RNAcentral text search supports advanced queries using the following syntax:
Use double quotes ("") to search for exact matches.
"hsa-mir-21" will find only hsa-mir-21 and not hsa-mir-212
A wildcard character (*) can match any number of characters. Wildcards are added automatically to all search terms that are not enclosed in double quotes.
Example: a search for
HOTAIR (no double quotes) will find both HOTAIR and HOTAIRM1 genes
and a search for
"HOTAIR" (with double quotes) will find only HOTAIR.
Search can be restricted to specific fields using the field_name:"field value" syntax. Please note that "field value" must be enclosed in double quotes.
|NCBI taxonomic identifier||
|scientific species name||
|common species name||
|Digital Object Identifier||
Multiple search terms separated by white spaces are combined using AND,
so a query like
Homo sapiens is treated as
Homo AND sapiens and only entries having both terms will be found.
or (to indicate equivalence)
rna_type:"pirna" or rna_type:"mirna"
not (to indicate exclusion)
expert_db:"lncrnadb" not expert_db:"rfam".
Use parentheses to group and nest logical terms.
(expert_db:"mirbase" OR expert_db:"lncrnadb") NOT expert_db:"rfam"
Make sure your spelling is correct.
Example: misspelled terms like
Esherichia (missing "c") won't find any results.
Use full species names.
Escherichia coli and not
E. coli as your search terms.
The RNAcentral text search can only export up to 1,000,000 search results. If you need to export more sequences, you can use the public Postgres database
Our latest article describes different ways of accessing the data.
RNAcentral is powered by the EBI search, which provides a publicly available REST interface for querying the data.