| Literature DB >> 32246067 |
Christophe Djemiel1, Damien Plassard2, Sébastien Terrat1, Olivier Crouzet3, Joana Sauze4, Samuel Mondy1, Virginie Nowak1, Lisa Wingate4, Jérôme Ogée4, Pierre-Alain Maron5.
Abstract
Studying the ecology of photosynthetic microeukaryotes and prokaryotic cyanobacterial communities requires molecular tools to complement morphological observations. These tools rely on specific genetic markers and require the development of specialised databases to achieve taxonomic assignment. We set up a reference database, called µgreen-db, for the 23S rRNA gene. The sequences were retrieved from generalist (NCBI, SILVA) or Comparative RNA Web (CRW) databases, in addition to a more original approach involving recursive BLAST searches to obtain the best possible sequence recovery. At present, µgreen-db includes 2,326 23S rRNA sequences belonging to both eukaryotes and prokaryotes encompassing 442 unique genera and 736 species of photosynthetic microeukaryotes, cyanobacteria and non-vascular land plants based on the NCBI and AlgaeBase taxonomy. When PR2/SILVA taxonomy is used instead, µgreen-db contains 2,217 sequences (399 unique genera and 696 unique species). Using µgreen-db, we were able to assign 96% of the sequences of the V domain of the 23S rRNA gene obtained by metabarcoding after amplification from soil DNA at the genus level, highlighting good coverage of the database. µgreen-db is accessible at http://microgreen-23sdatabase.ea.inra.fr.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32246067 PMCID: PMC7125122 DOI: 10.1038/s41598-020-62555-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Pie chart and histograms showing (A) the origin and number, and (B) the length of the 23S rDNA sequences available in the database.
Figure 2Taxonomic coverage at different ranks from the PR2/SILVA, NCBI and AlgaeBase taxonomy.
Figure 3Sequence distribution of the µgreen-db database at the Phylum level and grouped by Kingdom or supergroups. (A) Based on NCBI taxonomy according to Adl et al. (2012)[55] for the group classification, (B) Based on AlgaeBase taxonomy, (C) Based on PR2 and SILVA taxonomy.
Figure 4Relative sequence abundance of photosynthetic microeukaryotes and cyanobacteria at Phylum (A) and Genus (B) level.
Figure 5Workflow describing the different steps performed to generate the curated and annotated 23S rDNA reference database constructed from various databases and methods.