| Literature DB >> 24980130 |
Conrad L Schoch1, Barbara Robbertse1, Vincent Robert1, Duong Vu1, Gianluigi Cardinali1, Laszlo Irinyi1, Wieland Meyer1, R Henrik Nilsson1, Karen Hughes1, Andrew N Miller1, Paul M Kirk1, Kessy Abarenkov1, M Catherine Aime1, Hiran A Ariyawansa1, Martin Bidartondo1, Teun Boekhout1, Bart Buyck1, Qing Cai1, Jie Chen1, Ana Crespo1, Pedro W Crous1, Ulrike Damm1, Z Wilhelm De Beer1, Bryn T M Dentinger1, Pradeep K Divakar1, Margarita Dueñas1, Nicolas Feau1, Katerina Fliegerova1, Miguel A García1, Zai-Wei Ge1, Gareth W Griffith1, Johannes Z Groenewald1, Marizeth Groenewald1, Martin Grube1, Marieka Gryzenhout1, Cécile Gueidan1, Liangdong Guo1, Sarah Hambleton1, Richard Hamelin1, Karen Hansen1, Valérie Hofstetter1, Seung-Beom Hong1, Jos Houbraken1, Kevin D Hyde1, Patrik Inderbitzin1, Peter R Johnston1, Samantha C Karunarathna1, Urmas Kõljalg1, Gábor M Kovács1, Ekaphan Kraichak1, Krisztina Krizsan1, Cletus P Kurtzman1, Karl-Henrik Larsson1, Steven Leavitt1, Peter M Letcher1, Kare Liimatainen1, Jian-Kui Liu1, D Jean Lodge1, Janet Jennifer Luangsa-ard1, H Thorsten Lumbsch1, Sajeewa S N Maharachchikumbura1, Dimuthu Manamgoda1, María P Martín1, Andrew M Minnis1, Jean-Marc Moncalvo1, Giuseppina Mulè1, Karen K Nakasone1, Tuula Niskanen1, Ibai Olariaga1, Tamás Papp1, Tamás Petkovits1, Raquel Pino-Bodas1, Martha J Powell1, Huzefa A Raja1, Dirk Redecker1, J M Sarmiento-Ramirez1, Keith A Seifert1, Bhushan Shrestha1, Soili Stenroos1, Benjamin Stielow1, Sung-Oui Suh1, Kazuaki Tanaka1, Leho Tedersoo1, M Teresa Telleria1, Dhanushka Udayanga1, Wendy A Untereiner1, Javier Diéguez Uribeondo1, Krishna V Subbarao1, Csaba Vágvölgyi1, Cobus Visagie1, Kerstin Voigt1, Donald M Walker1, Bevan S Weir1, Michael Weiß1, Nalin N Wijayawardene1, Michael J Wingfield1, J P Xu1, Zhu L Yang1, Ning Zhang1, Wen-Ying Zhuang1, Scott Federhen1.
Abstract
DNA phylogenetic comparisons have shown that morphology-based species recognition often underestimates fungal diversity. Therefore, the need for accurate DNA sequence data, tied to both correct taxonomic names and clearly annotated specimen data, has never been greater. Furthermore, the growing number of molecular ecology and microbiome projects using high-throughput sequencing require fast and effective methods for en masse species assignments. In this article, we focus on selecting and re-annotating a set of marker reference sequences that represent each currently accepted order of Fungi. The particular focus is on sequences from the internal transcribed spacer region in the nuclear ribosomal cistron, derived from type specimens and/or ex-type cultures. Re-annotated and verified sequences were deposited in a curated public database at the National Center for Biotechnology Information (NCBI), namely the RefSeq Targeted Loci (RTL) database, and will be visible during routine sequence similarity searches with NR_prefixed accession numbers. A set of standards and protocols is proposed to improve the data quality of new sequences, and we suggest how type and other reference sequences can be used to improve identification of Fungi. Database URL: http://www.ncbi.nlm.nih.gov/bioproject/PRJNA177353. Published by Oxford University Press 2013. This work is written by US Government employees and is in the public domain in the US.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24980130 PMCID: PMC4075928 DOI: 10.1093/database/bau061
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Workflow of the ITS verification for RTL ITS.
List of collection databases with specimen pages to which links were established from records in GenBank
| Acronym | Collection Institute | Database link |
|---|---|---|
| ACBR | Austrian Center of Biological Resources and Applied Mycology | |
| ATCC | American Type Culture Collection | |
| BCRC | Bioresource Collection and Research Center | |
| BPI | US National Fungus Collections, Systematic Botany and Mycology Laboratory | |
| CBS | Centraalbureau voor Schimmelcultures, Fungal and Yeast Collection | |
| CFMR | Center for Forest Mycology Research | |
| DSM | Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH | |
| FRR | Food Science Australia, Ryde | |
| ICMP | International Collection of Microorganisms from Plants | |
| JCM | Japan Collection of Microorganisms | |
| MA | Real Jardín Botánico de Madrid Herbarium | |
| MAFF | MAFF Genebank, Ministry of Agriculture, Forestry and Fisheries | |
| MICH | University of Michigan | |
| MTCC | Microbial Type Culture Collection and Gene Bank | |
| MUCL | Mycotheque de l'Universite Catholique de Louvain | |
| NBRC | NITE Biological Resource Center | |
| NRRL | Agricultural Research Service Culture Collection | |
| PDD | New Zealand Fungal and Plant Disease Herbarium | |
| PYCC | Portuguese Yeast Culture Collection | |
| SAG | Sammlung von Algenkulturen at Universitat Gottingen | |
| UAMH | University of Alberta Microfungus Collection and Herbarium |
Unique acronyms were taken from the GenBank collections database and, where possible, agree with labels used by Index Herbariorum, WFCC and GRBio
Figure 2.ITS length variation of complete ITS regions in the RTL data set according to class.
Figure 3.Diagram showing the proportion of accessions associated with UNITE (version 6) data.
Figure 4.Anatomy of an RTL record. The marked areas indicate most common additions to the original nucleotide record. (A) New RTL accession number; (B) new simplified definition line; (C) Bioproject number for the ITS-targeted loci project; (D) GenBank synonym of current taxonomic name (used in cases of common usage); (E) label indicating that this is a RefSeq record; (F) comment regarding the source of the record; (G) the culture collection or specimen voucher presented as a validated structured triplet or doublet that can link directly to a relevant outside culture or specimen page; (H) additional information on the type and basionym name; (I) the ITS entry of all records was re-annotated to indicate the spacers and ribosomal genes.
Figure 5.Diagram showing the proportion of accessions that originated from specimens associated with a collection that has an online database.
Figure 6.Multidimensional scaling clustering of RTL ITS sequences and coloring, according to the NCBI Taxonomy classification at class rank. Each marker represents an individual sequence.
Figure 7.BioloMICS centrality scores of ITS sequences at genus rank, showing genera with ≥20 ITS records in the RefSeq data set. Each marker represents an individual sequence.