| Literature DB >> 27901108 |
Thomas Vannier1,2,3, Jade Leconte1,2,3, Yoann Seeleuthner1,2,3, Samuel Mondy1,2,3, Eric Pelletier1,2,3, Jean-Marc Aury1, Colomban de Vargas4, Michael Sieracki5, Daniele Iudicone6, Daniel Vaulot4, Patrick Wincker1,2,3, Olivier Jaillon1,2,3.
Abstract
Bathycoccus is a cosmopolitan green micro-alga belonging to the Mamiellophyceae, a class of picophytoplankton that contains important contributors to oceanic primary production. A single species of Bathycoccus has been described while the existence of two ecotypes has been proposed based on metagenomic data. A genome is available for one strain corresponding to the described phenotype. We report a second genome assembly obtained by a single cell genomics approach corresponding to the second ecotype. The two Bathycoccus genomes are divergent enough to be unambiguously distinguishable in whole DNA metagenomic data although they possess identical sequence of the 18S rRNA gene including in the V9 region. Analysis of 122 global ocean whole DNA metagenome samples from the Tara-Oceans expedition reveals that populations of Bathycoccus that were previously identified by 18S rRNA V9 metabarcodes are only composed of these two genomes. Bathycoccus is relatively abundant and widely distributed in nutrient rich waters. The two genomes rarely co-occur and occupy distinct oceanic niches in particular with respect to depth. Metatranscriptomic data provide evidence for gain or loss of highly expressed genes in some samples, suggesting that the gene repertoire is modulated by environmental conditions.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27901108 PMCID: PMC5128809 DOI: 10.1038/srep37900
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Assembly summaries of TOSAG39-1.
| SAG Assembly | Total Size (Mb) | N50 (kb) | NG50 | Genome Completion (%) |
|---|---|---|---|---|
| A | 3.5 | 14.8 | NA | 30.8 |
| B | 4.7 | 14.5 | NA | 27.7 |
| C | 3.7 | 24.1 | NA | 21.5 |
| D | 4.1 | 18.1 | NA | 26.0 |
| (A) + (B) + (C) + (D) | 8.0 | 16.6 | 0.9 | 44.6 |
| Combined ABCD | 10.1 | 14.1 | 6.0 | 64.0 |
1The longest assembly contigs covering together half of the genome size (15 Mbp) are each longer than the NG50. This evaluation was not possible for the four individual cell assemblies for which the total assembly sizes are shorter than half of the genome size.
2A + B + C + D corresponds to a non-redundant merging of contigs from individual assemblies.
3Combined ABCD corresponds to the co-assembly process.
Figure 1Comparisons of relative abundances of Bathycoccus in the 0.8–5 μm size fraction samples from Tara Oceans stations.
Left: relative 18S rRNA V9 amplicons abundance (percent of reads). Right: relative metagenomic abundances (percent of metagenomic reads) from direct mapping of metagenomic reads onto two genome sequence assemblies (strain RCC1105 and TOSAG39-1, single cell assembly from an Indian Ocean sample). Stations and depth (Surface or DCM) are indicated on the Y axis.
Figure 2Geographical distribution of two Bathycoccus genomes, RCC1105 and TOSAG39-1, along Tara Oceans expedition stations from recruitments of metagenomic reads.
Top and bottom maps correspond to the surface and deep chlorophyll maximum (DCM) samples respectively. Gray crosses indicate Tara Oceans sampling stations and the sizes of the red or blue circles indicate the relative genomic abundances of the two Bathycoccus types. We generated this map using R-package maps_2.1-6, mapproj_1.1-8.3, gplots_2.8.0 and mapplots_1.4 (version R-2.13, https://cran.r-project.org/web/packages/maps/index.html).
Figure 3Relationships between environmental parameters and Bathycoccus genome abundance.
Left: Principal component analysis. We only considered stations where we detected 98% of the genes for one Bathycoccus genome, and for which all environmental parameters were available (Oxygen, Nitrates, Phosphates, Chlorophyll, Sampling Depth, Water Temperature and Salinity). Crosses indicate stations, with a color scale corresponding to the water temperature. The distance to coast parameter corresponds to the shortest geographical distance to the coast. The two Bathycoccus are distributed along temperature and oxygen axes. Stars indicate parameters that statistically discriminate the two Bathycoccus. Right: Range of values of temperature, oxygen and sampling depth for parameters where a significant difference was detected between RCC1105 and TOSAG39-1.
Figure 4Evidence for cassettes of dispensable genes in B. prasinos RCC1105.
Left and right sides of the figures represent fragment recruitment and genomic alignments of dispensable gene cassettes, respectively. Fragment recruitments plots are displayed by marine zones (left legend). Each dot corresponds to a given number of mapped reads at a given identity percent (indicated on the Y-axis). The density of mapped read is displayed as the black line plotted below each fragment recruitment plot. Gene positions are represented by black boxes on the top of the first fragment recruitment plot and dispensable genes are highlighted in red. Genomic alignments are represented as circos graphs79 on which dispensable genes are colored in red, and other genes are represented by black boxes. Left side and right side of the genomic region are connected to metagenomics contigs (gray segments), leaving in-between the locus of the dispensable gene cassette that remains unconnected to any metagenomic contig. Connections correspond to blast alignments positions. (A) 100- and 8.6-kb regions of chromosome 1 are represented on a fragment recruitment plot and on the circos graph, respectively. A two gene long cassette is represented. A massive decrease of read coverage appears on the fragment recruitment plot in all oceanic zones except in the Mediterranean Sea, which indicates that the two genes are present only in a sub-population in this basin. A similar pattern is observed in panel (B) for four consecutive genes for which fragment recruitment plots representing 100 kb of chromosome 5 suggest a presence in a Mediterranean sub-population and absence in other marine areas. The circos graph represents alignments along the 15.6-kb cassette locus with metagenomics contigs, which resulted in a gap that included three small genes (in blue) in addition to the four automatically detected dispensable genes. Fragment recruitment confirmed a significant, but not total, decrease of read coverage for these three genes in every oceanic zone, indicating that their presence or absence in the two sub-populations was widely distributed.