| Literature DB >> 31641158 |
Frédéric Rimet1,2, Evgenuy Gusev3, Maria Kahlert4, Martyn G Kelly5, Maxim Kulikovskiy3, Yevhen Maltsev3, David G Mann6,7, Martin Pfannkuchen8, Rosa Trobajo7, Valentin Vasselon9,10, Jonas Zimmermann11, Agnès Bouchez9,10.
Abstract
Diatoms (Bacillariophyta) are ubiquitous microalgae which produce a siliceous exoskeleton and which make a major contribution to the productivity of oceans and freshwaters. They display a huge diversity, which makes them excellent ecological indicators of aquatic ecosystems. Usually, diatoms are identified using characteristics of their exoskeleton morphology. DNA-barcoding is an alternative to this and the use of High-Throughput-Sequencing enables the rapid analysis of many environmental samples at a lower cost than analyses under microscope. However, to identify environmental sequences correctly, an expertly curated reference library is needed. Several curated libraries for protists exists; none, however are dedicated to diatoms. Diat.barcode is an open-access library dedicated to diatoms which has been maintained since 2012. Data come from two sources (1) the NCBI nucleotide database and (2) unpublished sequencing data of culture collections. Since 2017, several experts have collaborated to curate this library for rbcL, a chloroplast marker suitable for species-level identification of diatoms. For the latest version of the database (version 7), 605 of the 3482 taxonomical names originally assigned by the authors of the rbcL sequences were modified after curation. The database is accessible at https://www6.inra.fr/carrtel-collection_eng/Barcoding-database .Entities:
Mesh:
Substances:
Year: 2019 PMID: 31641158 PMCID: PMC6805954 DOI: 10.1038/s41598-019-51500-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Results of the curation procedure of 23 Feb 2017 corresponding to version 7 of the database.
| Marker | ||
|---|---|---|
| Imported sequences in v7 (23 feb 2017) | 703 | |
| Sequences in the former version v6 (20 feb 2016) | 2784 | |
| Curation steps | New sequences having a different id. from the sequence of the same clade | 176 |
| Nomenclatural and taxonomic changes according to peer review papers | 33 | |
| Check of photos - modifications of determinations | 157 | |
| homogenization of taxonomy/synonymies based on phylogenetic results | 1 | |
| Rejected sequences | 5 | |
| Sequence in the new version (v7) | 3482 |
Sequences were imported from NCBI (nucleotide database of the National Center for Biotechnology Information) and the TCC (Thonon Culture Collection) between 20 Mar 2016 and 23 Feb 2017. Values in the table give the number of sequences.
Number of sequences in Diat.barcode version 7 for rbcL marker.
| Total (published in NCBI and in the TCC and UK barcoding project) | 3315 | 167 |
| From TCC and UK barcoding project | 955 | 167 |
Figure 1Amount and length of the rbcL barcodes present in Diat.barcode version 7 (February 2018).
Figure 2Location of the sampled sites gathered in Diat.barcode (version 7 of February 2018). Map was generated on Google Maps, Map data © 2019 Google (https://www.google.com/maps), accessible at: https://www.google.com/maps/d/u/0/edit?mid = 1Edfju_8jL_lkwSK5pZzBabECXYhMS3y2&ll = -0.7877939911851115%2C0&z = 2
Number of rbcL barcodes and taxa (species or sub species level) per order in Diat.barcode version 7.
| Class | Order | # of barcodes | # of taxa |
|---|---|---|---|
| Bacillariophyceae | Achnanthales | 260 | 55 |
| Bacillariales | 569 | 114 | |
| Cymbellales | 322 | 90 | |
| Eunotiales | 66 | 12 | |
| Lyrellales | 3 | 3 | |
| Mastogloiales | 1 | 1 | |
| Naviculales | 878 | 263 | |
| Rhopalodiales | 31 | 11 | |
| Surirellales | 219 | 68 | |
| Thalassiophysales | 144 | 39 | |
| Coscinodiscophyceae | Asterolamprales | 1 | 1 |
| Aulacoseirales | 25 | 9 | |
| Chrysanthemodiscales | 1 | 1 | |
| Corethrales | 2 | 1 | |
| Coscinodiscales | 24 | 14 | |
| Leptocylindrales | 3 | 2 | |
| Melosirales | 28 | 10 | |
| Paraliales | 76 | 6 | |
| Rhizosoleniales | 16 | 9 | |
| Fragilariophyceae | Ardissoneales | 4 | 4 |
| Climacospheniales | 1 | 1 | |
| Cyclophorales | 5 | 4 | |
| Fragilariales | 333 | 81 | |
| Licmophorales | 10 | 8 | |
| Protoraphidales | 1 | 1 | |
| Rhabdonematales | 3 | 2 | |
| Rhaphoneidales | 8 | 5 | |
| Striatellales | 13 | 10 | |
| Tabellariales | 11 | 3 | |
| Thalassionematales | 8 | 4 | |
| Mediophyceae | Anaulales | 1 | 1 |
| Biddulphiales | 14 | 11 | |
| Chaetocerotales | 29 | 15 | |
| Cymatosirales | 42 | 16 | |
| Hemiaulales | 10 | 9 | |
| Lithodesmiales | 19 | 6 | |
| Thalassiosirales | 212 | 86 | |
| Toxariales | 2 | 2 | |
| Triceratiales | 83 | 32 |
Figure 3General flowchart of the curation and integration of new sequences in Diat.barcode library. Taxonomic curation procedure is detailed in Fig. 2. Diamonds are conditions, the arrow from the bottom point of the diamond corresponds to « Yes », the arrow from the right point of the diamond corresponds to « No ». Rectangles are processing steps.
Figure 4Flowchart of the taxonomic curation procedure. Diamonds are conditions, the arrow from the bottom point of the diamond corresponds to « Yes », the arrow from the right point of the diamond corresponds to « No ». Rectangles are processing steps.