Literature DB >> 29718202

The global catalogue of microorganisms 10K type strain sequencing project: closing the genomic gaps for the validly published prokaryotic and fungi species.

Linhuan Wu1,2,3, Kevin McCluskey4,5, Philippe Desmeth4,6, Shuangjiang Liu2,4, Sugawara Hideaki7, Ye Yin8, Ohkuma Moriya9, Takashi Itoh9, Cha Young Kim10, Jung-Sook Lee10, Yuguang Zhou11, Hiroko Kawasaki12, Manzour Hernando Hazbón13, Vincent Robert14, Teun Boekhout14,15,16, Nelson Lima17, Lyudmila Evtushenko18, Kyria Boundy-Mills4,19, Boyke Bunk20, Edward R B Moore21, Lily Eurwilaichitr4,22, Supawadee Ingsriswang22, Heena Shah23, Su Yao24, Tao Jin8, Jinqun Huang8, Wenyu Shi1, Qinglan Sun1, Guomei Fan1, Wei Li1, Xian Li1, Ipek Kurtböke4,25, Juncai Ma1,2,3,4.   

Abstract

Genomic information is essential for taxonomic, phylogenetic, and functional studies to comprehensively decipher the characteristics of microorganisms, to explore microbiomes through metagenomics, and to answer fundamental questions of nature and human life. However, large gaps remain in the available genomic sequencing information published for bacterial and archaeal species, and the gaps are even larger for fungal type strains. The Global Catalogue of Microorganisms (GCM) leads an internationally coordinated effort to sequence type strains and close gaps in the genomic maps of microorganisms. Hence, the GCM aims to promote research by deep-mining genomic data.

Entities:  

Mesh:

Year:  2018        PMID: 29718202      PMCID: PMC5941136          DOI: 10.1093/gigascience/giy026

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


Introduction

Microorganisms are the most abundant organisms on Earth. The total diversity of prokaryotes may comprise up to 109 species [1]. For prokaryotic species, only new names published in the International Journal of Systematic andEvolutionary Microbiology (IJSEM) as an original article or in the “Validation Lists” are considered valid. As of the end of 2017, 15,081 valid prokaryotic species and subspecies were published compared to 12,981 at the end of December 2015 [2]. The number of publications increased by 1,088 in 2016 and by 1,012 in 2017. The most commonly accepted estimate of the number of existing fungal species is 2.27 million, as hypothesized by Hawksworth [3], while the number of species reported in the Dictionary of Fungi is only about 100,000. The taxonomy of microorganisms, including their classification, identification, and nomenclature, has developed from morphological and metabolic characterization to incorporate numerical taxonomy based on phenetic analyses, chemotaxonomy, and finally polyphasic approaches that combine phenotypic, chemotaxonomic, genotypic, and genomic information. The IJSEM recently announced that since January 2018, authors of new taxa descriptions have been asked to provide genome sequence data with descriptions of novel taxa with their manuscript submissions [4]. As such, the taxonomy of microorganisms has entered the genomics era. A genomic “gold standard” for consistent microbial species definitions is urgently needed. To meet this end, a fundamental step is to sequence the type strains of validly published prokaryotic and fungal species. In addition to the microorganisms that can be isolated and maintained in situ, the vast majority of microorganisms cannot yet be cultivated and thus are relatively poorly studied. Culture-independent approaches have been developed to investigate the compositions and functions of environmental and human microbiomes. However, accurate taxonomic and functional predictions based on metagenomic data are dependent on the availability of high-quality reference genomic data [5]. Therefore, sequencing the genomes of type strains of recognized microbial species will provide a taxonomic context for metagenomic data analysis, which is commonly comprised of short and incomplete sequences from complex environmental communities. Microorganisms possess extensive genomic and metabolic diversity, which makes them ideal biotechnological tools. Decoding the full genomes of the type strains of various species in order to provide reference genomes will thus enable genes to be associated with functions, such as metabolic activity, virulence, antibiotic production or resistance, biomass deconstruction, cellulose agricultural nitrogen fixation, and the liberation of environmental phosphorus. Access to microbial genomic sequences will significantly contribute to future studies in microbial biology, ecology, and biochemistry and these will, in turn, accelerate the discovery of new natural products and drugs [6].

The Current Status of the Strain Sequencing Project

Descriptions of prokaryotic species are based on living cultures, and one representative strain is designated as the nomenclatural “type.” The IJSEM and the International Committee on Systematics of Prokaryotes require that the type strains of new species be deposited in at least two recognized collections in two countries. The type strains of 15,081 prokaryotic species are widely preserved in more than 130 culture collections. In mycology the trend is similar, although hitherto a fungal type specimen must be metabolically inactive. Presently, the selection of strains for whole-genome sequencing is based predominantly on medical, ecological, or industrial importance, which often leads to bias in assessing phylogenetic relationships. There are several ongoing phylogenetic-based microbial sequencing projects. The Genomic Encyclopedia of Bacteria and Archaea (GEBA), led by the US Department of Energy Joint Genome Institute (US DOE JGI), has pioneered the partnership between culture collections and sequencing projects. The GEBA project published 1,003 whole-genome sequences of type strains in 2017 as the outcome of its first stage [7]. GEBA started the new stage of the project in 2015, which has a focus on the genomes of soil, plant-associated, and newly described type strains [8]. Similarly, the US DOE JGI, in collaboration with international research teams, conducted a 5-year project to sequence 1,000 fungal genomes from across the Fungal Tree of Life [9]. The overall plan is to fill in gaps in the Fungal Tree of Life by sequencing at least two reference genomes from the more than 500 recognized families of fungi. Many type strains of microbial species remain unsequenced. Hence, the World Federation of Culture Collections (WFCC) and the World Data Centre for Microorganisms (WDCM) have initiated an international community-led project to sequence the full genomes of microbial type strains to support continued scientific discovery and biotechnological utilization. Considering the wide distribution of type strains, cooperation across the global culture collection community is essential for success.

Emerging Enhancements of Culture Collection in the Genomic Era

Efforts made by culture collection curators to explore the diversity of microorganisms and to harness their genes, properties, and products remain insufficient. While type collections are not always large or diverse, the genome sequencing efforts of the Global Catalogue of Microorganisms (GCM) will increase access to resources in smaller collections. The WDCM is the data center of the WFCC and the Microbial Resources Centers Network. The WDCM is working on facilitating the application of cutting-edge information technology to improve the interoperability of microbial data, promote access and use of data, and coordinate international cooperation among culture collections, scientists, and other user communities. The first stage of the GCM project, started in 2012, focused on sharing strain catalogue data from culture collections [10]. The proposed type strain sequencing project is the continuation of the GCM project as its second stage, GCM 2.0.

Project Development and Current Progress

The project was first announced during the 14th International Culture Collections Conference, held in Singapore in July 2017 in conjunction with the International Union of Microbiological Societies (IUMS) conferences. Following that, in October 2017, a ceremony was held in Beijing, China, to launch the project, at which representatives from the following culture collections were present: the American Type Culture Collection (ATCC), Belgian Coordinated Collections of Microorganisms (BCCM), Biological Resource Center, Japan National Institute of Technology and Evaluation (NBRC), Culture Collection of the University of Gothenburg (CCUG), China General Microbiological Culture Collection Center (CGMCC), Fungal Genetics Stock Center (Kansas State University, USA) (FGSC), Japan Collection of Microorganisms (JCM), Korean Collection for Type Cultures (KCTC), Leibniz-Institut DSMZ-Deutsche Sammlung von Mikroorganismen und Zellkulturen (DSMZ), Micoteca da Universidade do Minho (Portugal) (MUM), Thailand Bioresource Research Center (TBRC), and Westerdijk Fungal Biodiversity Institute (CBS). The meeting representatives held detailed discussions on the organization and operational procedures of the GCM 2.0 project. Some culture collections (e.g., All-Russian Collection of Microorganisms [VKM], China Center of Industrial Culture Collection [CICC], Phaff Yeast Culture Collection [University of California Davis, USA;(UCD-FST], Thailand Institute of Scientific and Technological Research Culture Collection [TISTR], and UK National Collection of Type Cultures [NCTC]) did not have a representative present at the launch ceremony but expressed their willingness to join the collaboration. It is expected that the project will be completed within 5 years, including a pilot stage in the first year. GCM 2.0 includes two core subprojects, sequencing 10,000 bacterial and archaeal type strains and sequencing of some of the fungal type strains. It will also embrace several satellite projects on specific scientific targets. GCM 2.0 is coordinated by a steering committee and five interlinked working groups: Bacteria Selection, Fungi Selection, Standard Operational Procedures, Databases, and Intellectual Property Rights and Legal Issues. The project has established standard operational procedures for DNA extraction, sample submission, sequencing, and data processing to ensure that all genetic resources, data, and metadata associated with type strains are appropriately obtained, recorded, and stored. A project proposed by the WDCM, “AWI 20170: Specification on Data Integration and Publication in Microbial Resource Centers,” which would meet International Organization for Standardization standards, is under development. The raw data and annotation results generated from this project will be published on the GCM portal. Following norms established for genome projects coming from the Bermuda Principles and Fort Lauderdale agreement, the resulting genomic data will also be made freely available in public databases, including those maintained by the National Center for Biotechnology Information, the European Molecular Biology Laboratory, and the DNA Data Bank of Japan, after completion of data analysis and annotation and ensurance that the data has met a set of quality criteria. All validly published bacterial and archaeal type strains, as well as selected reference fungal type strains that are frequently used for functional or phylogenetic studies, will be on the list of candidates for sequencing. Each strain has documentation issued by the providing culture collection, which ensures the purity and identity of the type strain. BGI-Shenzhen will support the microbial genome sequencing and assist with the data analysis for this project. Sampling works for the pilot stage have been initiated. The project has established a global network to collect approximately 800 candidate type strain samples from American, British, Belgian, Chinese, Dutch, Japanese, Korean, Portuguese, Russian, Swedish, and Thai collections. Although extracted DNA samples are much preferred, it is also acceptable to send cultured cell samples of the strains. Importantly, under the terms of Nagoya Protocol, GCM 2.0 will respect the access and benefit-sharing regulations of all countries. The GCM type strain sequencing project encourages all culture collections to participate in this international collaborative project. Interested parties should be willing to provide DNA for type strains held in their collections. All microbiologists and institutions from related fields are welcome to submit subprojects for genomic data-related research questions. Brief proposals, including questions to be addressed and type strains to be sequenced and analyzed, should be emailed to Dr Juncai Ma at ma@im.ac.cn. Once a proposal is granted as a subproject, the scientist(s) will be asked to lead the full genome analysis and jointly publish the generated outcomes.

Conclusion

As a collaborative network of international culture collections, GCM 2.0 will contribute to a genome-based microbial taxonomic framework, establishing high-quality complete genome sequences as the new gold standard. The resulting knowledge and tools generated through this project will not only directly facilitate the identification of microorganisms but will also improve our ability to predict new gene complexes and their functions from microbial communities. Thus, our knowledge of the hitherto undiscovered microbial diversity will be expanded, which may lead to the sustainable utilization of microbial resources for human benefit.

Abbreviations

GEBA, Genomic Encyclopedia of Bacteria and Archaea; GCM, Global Catalogue of Microorganisms; IJSEM,International Journal of Systematic and Evolutionary Microbiology; US DOE JGI, Uniteed States Department of Energy Joint Genome Institute; WDCM, World Data Centre for Microorganisms; WFCC, World Federation of Culture Collections. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  8 in total

1.  Estimating prokaryotic diversity and its limits.

Authors:  Thomas P Curtis; William T Sloan; Jack W Scannell
Journal:  Proc Natl Acad Sci U S A       Date:  2002-07-03       Impact factor: 11.205

Review 2.  Deciphering the human microbiome using next-generation sequencing data and bioinformatics approaches.

Authors:  Yihwan Kim; InSong Koh; Mina Rho
Journal:  Methods       Date:  2014-10-28       Impact factor: 3.608

3.  1,003 reference genomes of bacterial and archaeal isolates expand coverage of the tree of life.

Authors:  Supratim Mukherjee; Rekha Seshadri; Neha J Varghese; Emiley A Eloe-Fadrosh; Jan P Meier-Kolthoff; Markus Göker; R Cameron Coates; Michalis Hadjithomas; Georgios A Pavlopoulos; David Paez-Espino; Yasuo Yoshikuni; Axel Visel; William B Whitman; George M Garrity; Jonathan A Eisen; Philip Hugenholtz; Amrita Pati; Natalia N Ivanova; Tanja Woyke; Hans-Peter Klenk; Nikos C Kyrpides
Journal:  Nat Biotechnol       Date:  2017-06-12       Impact factor: 54.908

Review 4.  Phylogeny-guided (meta)genome mining approach for the targeted discovery of new microbial natural products.

Authors:  Hahk-Soo Kang
Journal:  J Ind Microbiol Biotechnol       Date:  2016-11-24       Impact factor: 3.346

Review 5.  A New Genomics-Driven Taxonomy of Bacteria and Archaea: Are We There Yet?

Authors:  George M Garrity
Journal:  J Clin Microbiol       Date:  2016-05-18       Impact factor: 5.948

6.  Proposed minimal standards for the use of genome data for the taxonomy of prokaryotes.

Authors:  Jongsik Chun; Aharon Oren; Antonio Ventosa; Henrik Christensen; David Ruiz Arahal; Milton S da Costa; Alejandro P Rooney; Hana Yi; Xue-Wei Xu; Sofie De Meyer; Martha E Trujillo
Journal:  Int J Syst Evol Microbiol       Date:  2018-01       Impact factor: 2.747

7.  Genomic Encyclopedia of Bacterial and Archaeal Type Strains, Phase III: the genomes of soil and plant-associated and newly described type strains.

Authors:  William B Whitman; Tanja Woyke; Hans-Peter Klenk; Yuguang Zhou; Timothy G Lilburn; Brian J Beck; Paul De Vos; Peter Vandamme; Jonathan A Eisen; George Garrity; Philip Hugenholtz; Nikos C Kyrpides
Journal:  Stand Genomic Sci       Date:  2015-05-17

8.  World data centre for microorganisms: an information infrastructure to explore and utilize preserved microbial strains worldwide.

Authors:  Linhuan Wu; Qinglan Sun; Philippe Desmeth; Hideaki Sugawara; Zhenghong Xu; Kevin McCluskey; David Smith; Vasilenko Alexander; Nelson Lima; Moriya Ohkuma; Vincent Robert; Yuguang Zhou; Jianhui Li; Guomei Fan; Supawadee Ingsriswang; Svetlana Ozerskaya; Juncai Ma
Journal:  Nucleic Acids Res       Date:  2016-10-07       Impact factor: 16.971

  8 in total
  13 in total

1.  Quantitative global studies reveal differential translational control by start codon context across the fungal kingdom.

Authors:  Edward W J Wallace; Corinne Maufrais; Jade Sales-Lee; Laura R Tuck; Luciana de Oliveira; Frank Feuerbach; Frédérique Moyrand; Prashanthi Natarajan; Hiten D Madhani; Guilhem Janbon
Journal:  Nucleic Acids Res       Date:  2020-03-18       Impact factor: 16.971

Review 2.  Development of the SeqCode: A proposed nomenclatural code for uncultivated prokaryotes with DNA sequences as type.

Authors:  William B Whitman; Maria Chuvochina; Brian P Hedlund; Philip Hugenholtz; Konstantinos T Konstantinidis; Alison E Murray; Marike Palmer; Donovan H Parks; Alexander J Probst; Anna-Louise Reysenbach; Luis M Rodriguez-R; Ramon Rossello-Mora; Iain Sutcliffe; Stephanus N Venter
Journal:  Syst Appl Microbiol       Date:  2022-07-20       Impact factor: 4.064

3.  Comparative transcriptome analysis of pollen and anther wall reveals novel insights into the regulatory mechanisms underlying anther wall development and its dehiscence in rice.

Authors:  Woo-Jong Hong; Su Kyoung Lee; Seok-Hui Kim; Yu-Jin Kim; Sunok Moon; Eui-Jung Kim; Jeniffer Silva; Ki-Hong Jung
Journal:  Plant Cell Rep       Date:  2022-03-06       Impact factor: 4.964

4.  Constructing a Defined Starter for Multispecies Vinegar Fermentation via Evaluation of the Vitality and Dominance of Functional Microbes in an Autochthonous Starter.

Authors:  Ting Huang; Zhen-Ming Lu; Ming-Ye Peng; Li-Juan Chai; Xiao-Juan Zhang; Jin-Song Shi; Qi Li; Zheng-Hong Xu
Journal:  Appl Environ Microbiol       Date:  2021-11-24       Impact factor: 5.005

5.  Characterization of the Potential Probiotic Vibrio sp. V33 Antagonizing Vibrio Splendidus Based on Iron Competition.

Authors:  Ningning Liu; Tongxiang Song; Shanshan Zhang; Huijie Liu; Xuelin Zhao; Yina Shao; Chenghua Li; Weiwei Zhang
Journal:  Iran J Biotechnol       Date:  2020-01-01       Impact factor: 1.671

6.  Draft Genome Sequence of the Type Strain Streptomyces armeniacus ATCC 15676.

Authors:  Puneet Labana; Jessica T Gosse; Christopher N Boddy
Journal:  Microbiol Resour Announc       Date:  2018-09-20

Review 7.  Advances in understanding the evolution of fungal genome architecture.

Authors:  Shelby J Priest; Vikas Yadav; Joseph Heitman
Journal:  F1000Res       Date:  2020-07-27

8.  gcMeta: a Global Catalogue of Metagenomics platform to support the archiving, standardization and analysis of microbiome data.

Authors:  Wenyu Shi; Heyuan Qi; Qinglan Sun; Guomei Fan; Shuangjiang Liu; Jun Wang; Baoli Zhu; Hongwei Liu; Fangqing Zhao; Xiaochen Wang; Xiaoxuan Hu; Wei Li; Jia Liu; Ye Tian; Linhuan Wu; Juncai Ma
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

9.  Platanus_B: an accurate de novo assembler for bacterial genomes using an iterative error-removal process.

Authors:  Rei Kajitani; Dai Yoshimura; Yoshitoshi Ogura; Yasuhiro Gotoh; Tetsuya Hayashi; Takehiko Itoh
Journal:  DNA Res       Date:  2020-06-01       Impact factor: 4.458

10.  gcType: a high-quality type strain genome database for microbial phylogenetic and functional research.

Authors:  Wenyu Shi; Qinglan Sun; Guomei Fan; Sugawara Hideaki; Ohkuma Moriya; Takashi Itoh; Yuguang Zhou; Man Cai; Song-Gun Kim; Jung-Sook Lee; Ivo Sedlacek; David R Arahal; Teresa Lucena; Hiroko Kawasaki; Lyudmila Evtushenko; Bevan S Weir; Sarah Alexander; Dlauchy Dénes; Somboon Tanasupawat; Lily Eurwilaichitr; Supawadee Ingsriswang; Bruno Gomez-Gil; Manzour H Hazbón; Marco A Riojas; Chatrudee Suwannachart; Su Yao; Peter Vandamme; Fang Peng; Zenghui Chen; Dongmei Liu; Xiuqiang Sun; Xinjiao Zhang; Yuanchun Zhou; Zhen Meng; Linhuan Wu; Juncai Ma
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.