| Literature DB >> 22123747 |
Paolo D'Onorio de Meo1, Mattia D'Antonio, Francesca Griggio, Renato Lupi, Massimiliano Borsani, Giulio Pavesi, Tiziana Castrignanò, Graziano Pesole, Carmela Gissi.
Abstract
The MITOchondrial genome database of metaZOAns (MitoZoa) is a public resource for comparative analyses of metazoan mitochondrial genomes (mtDNA) at both the sequence and genomic organizational levels. The main characteristics of the MitoZoa database are the careful revision of mtDNA entry annotations and the possibility of retrieving gene order and non-coding region (NCR) data in appropriate formats. The MitoZoa retrieval system enables basic and complex queries at various taxonomic levels using different search menus. MitoZoa 2.0 has been enhanced in several aspects, including: a re-annotation pipeline to check the correctness of protein-coding gene predictions; a standardized annotation of introns and of precursor ORFs whose functionality is post-transcriptionally recovered by RNA editing or programmed translational frameshifting; updates of taxon-related fields and a BLAST sequence similarity search tool. Database novelties and the definition of standard mtDNA annotation rules, together with the user-friendly retrieval system and the BLAST service, make MitoZoa a valuable resource for comparative and evolutionary analyses as well as a reference database to assist in the annotation of novel mtDNA sequences. MitoZoa is freely accessible at http://www.caspur.it/mitozoa.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22123747 PMCID: PMC3245153 DOI: 10.1093/nar/gkr1144
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Mitochondrial data sets searchable with BLAST, together with the data set size in MitoZoa Release 9.1
| Data set name | FTkey used as data set source | Additional data to the sequence header | No. of sequences |
|---|---|---|---|
| mtDNA | Full entry | mtDNA | 2894 |
| CDS_nt | CDS | Standard gene name | 37 022 |
| tRNA | tRNA | Standard gene name | 61 228 |
| rRNA | rRNA | Standard gene name | 5699 |
| NCR ≥ 25 nt | NCR ≥ 25 nt | NCR code | 8761 |
| Protein | CDS translation, excluding pseudogenes | Standard gene name | 37 016 |
aThe NCR code defined by MitoZoa relates to species, flanking genes and NCR length (in bp). See also the online MitoZoa Help.
Inconsistencies of protein-coding genes (CDS) corrected or pointed out with a warning message in MitoZoa Release 9.1
| CDS inconsistency | No. of CDS | No. of entries |
|---|---|---|
| Modification of name | 2 | 1 |
| Modification of strand and boundaries | 2 | 1 |
| Modification of boundaries | 203 | 184 |
| Internal stop codons resolved by adding a ‘join’ | 9 | 8 |
| Unusual start codon resolved by deleting a ‘join’ | 2 | 2 |
| Warning on ‘loss of highly conserved regions’ | 107 | 84 |
| MitoZoa Release 9.1 | 27 022 | 2894 |
aExchanged annotation between atp8 and atp6 in the snake Anilius scytale (FJ755180, v2 EMBL entry).
batp8 and nad3 of the gastropod Platevindex mortoni (GU475132).
cSpecial cases of the category ‘modification of boundaries’. The ‘join’ operator, defined by GenEMBL, is used to exclude internal positions from CDS or other FTkeys.
dIn nad2 of the gastropod Ilyanassa obsoleta (NC_007781), the addition of the ‘join’ operator is also accompanied by modification of the start codon position. In all remaining cases, the CDS boundary modification consists of only the addition of the ‘join’ operator.
eIn both cases (DQ340844 and NC_000844), the presence of the join operator was due to the hypothesis of the existence of a four-base start codon in cox1, recently rejected by experimental data (32).