| Literature DB >> 22641850 |
Abstract
GGRNA (http://GGRNA.dbcls.jp/) is a Google-like, ultrafast search engine for genes and transcripts. The web server accepts arbitrary words and phrases, such as gene names, IDs, gene descriptions, annotations of gene and even nucleotide/amino acid sequences through one simple search box, and quickly returns relevant RefSeq transcripts. A typical search takes just a few seconds, which dramatically enhances the usability of routine searching. In particular, GGRNA can search sequences as short as 10 nt or 4 amino acids, which cannot be handled easily by popular sequence analysis tools. Nucleotide sequences can be searched allowing up to three mismatches, or the query sequences may contain degenerate nucleotide codes (e.g. N, R, Y, S). Furthermore, Gene Ontology annotations, Enzyme Commission numbers and probe sequences of catalog microarrays are also incorporated into GGRNA, which may help users to conduct searches by various types of keywords. GGRNA web server will provide a simple and powerful interface for finding genes and transcripts for a wide range of users. All services at GGRNA are provided free of charge to all users.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22641850 PMCID: PMC3394333 DOI: 10.1093/nar/gks448
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Screenshot from GGRNA. (A) Top page (http://GGRNA.dbcls.jp/). (B) Typical search output of GGRNA. The phrases ‘PAZ domain’ and ‘RNase’ are searched in human transcripts. (C) Amino acid sequence search. The sequences MTCQSC, MHCKSC, MTCASC, CPC, DKTGT, SEHPL and GDGVND are searched simultaneously. (D) Affymetrix microarray probe set ID, 1552311_a_at, is automatically expanded into eleven corresponding probe sequences and searched for their binding sites. (E) Advanced search page. All of the search terms are transformed into the single query string shown at the bottom of the page. (F) An example of searching PCR primer binding sites. Note that each number corresponds to the first base in the matched sequence. Clicking on a transcript displays the complete record from RefSeq in GenBank Flat File (GBFF) format.
Search operators in GGRNA
| Search operators | Description | Alias |
|---|---|---|
| refid:NM_001518 | Search by RefSeq ID. Version number following a dot is ignored: [refid:NM_003380.2] and [refid:NM_003380] will return the same results. Words starting with NM_, XM_, NR_ or XR_ are automatically treated as refid: search without operator. |
refseqid: refseq: id:NM_, id:XM_, id:NR_, id:XR_ |
| geneid:10579 | Search by Gene ID. An integer is automatically treated as geneid: search without operator. |
gene:integer id:integer |
| symbol:VIM | Search for gene symbols and synonyms which partially match to the query. For example, query EIF2C will return EIF2C1, 2, 3 and 4. | name: |
| aa:KDEL | Search for amino acid sequence. | |
|
ref:Naito ref:1327585 | Full text search within cited references. PubMed ID can also be queried. | reference: |
|
probe:1552311_a_at probe:A_23_P101434 | Search for nucleotide sequences by microarray probe ID. Words ending with _at, _st (Affymetrix ID) and starting with A_ (Agilent ID) are automatically treated as probe: search without operator. When probe ID is not converted into sequences, the probe ID is subjected to a regular search. | probeid: |
|
anot:GO:0006915 anot:[apoptosis] anot:“EC 2.3.1.51” |
Search for annotation. - Search by Gene Ontology ID and term - Search by Enzyme Commission (EC) number |
annotation: annot: |
|
seq:caagaagagattg seq1:caagaagagattg seq2:caagaagagattgcc seq3:caaggagagatgggacac | Search for nucleotide sequence. Query containing letters A, T, G, C and U only will automatically be treated as seq: search without the operator. U and T will be treated identically. seq1:, seq2: and seq3: will return results with 1-, 2- and 3-nt mismatch tolerance. |
sequence: sequence1: sequence2: sequence3: |
|
comp:caagaagagattg comp1:caagaagagattg comp2:caagaagagattgcc comp3:caaggagagatgggacac | Search for complementary sequence. comp1:, comp2: and comp3: will return results with 1-, 2- and 3-nt mismatch tolerance. |
complementary: complementary1: complementary2: complementary3: |
|
both:caagaagagattg both1:caagaagagattg both2:caagaagagattgcc both3:caaggagagatgggacac | Simultaneously retrieve sense and antisense nucleotide sequences corresponding to the query. both1:, both2: and both3: will return results with 1-, 2- and 3-nt mismatch tolerance. |
bothseq: bothseq1: bothseq2: bothseq3: |
|
iub:yyaaggnnnagacac iubcomp:yyaaggnnnagacac iubboth:yyaaggnnnagacac | Search for nucleotide sequence containing IUB code letters (e.g. N, R, Y, S). iubcomp: will return complementary sequences to the query; iubboth: will return both strands. | iubseq: → iub: |