| Literature DB >> 29184540 |
Juline M Walter1,2, Felipe H Coutinho1,2, Bas E Dutilh2,3, Jean Swings4, Fabiano L Thompson1,5, Cristiane C Thompson1.
Abstract
Cyanobacteria are major contributors to global biogeochemical cycles. The genetic diversity among Cyanobacteria enables them to thrive across many habitats, although only a few studies have analyzed the association of phylogenomic clades to specific environmental niches. In this study, we adopted an ecogenomics strategy with the aim to delineate ecological niche preferences of Cyanobacteria and integrate them to the genomic taxonomy of these bacteria. First, an appropriate phylogenomic framework was established using a set of genomic taxonomy signatures (including a tree based on conserved gene sequences, genome-to-genome distance, and average amino acid identity) to analyse ninety-nine publicly available cyanobacterial genomes. Next, the relative abundances of these genomes were determined throughout diverse global marine and freshwater ecosystems, using metagenomic data sets. The whole-genome-based taxonomy of the ninety-nine genomes allowed us to identify 57 (of which 28 are new genera) and 87 (of which 32 are new species) different cyanobacterial genera and species, respectively. The ecogenomic analysis allowed the distinction of three major ecological groups of Cyanobacteria (named as i. Low Temperature; ii. Low Temperature Copiotroph; and iii. High Temperature Oligotroph) that were coherently linked to the genomic taxonomy. This work establishes a new taxonomic framework for Cyanobacteria in the light of genomic taxonomy and ecogenomic approaches.Entities:
Keywords: charting biodiversity; ecological niches; genome-based microbial taxonomy; high-throughput sequencing technology; metagenome; microbial ecology
Year: 2017 PMID: 29184540 PMCID: PMC5694629 DOI: 10.3389/fmicb.2017.02132
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Details of all cyanobacterial genomes included in this study.
| PCC 7122T | Freshwater | Cambridge, UK | 7 | 7.06 | 38.79 | 6,182 | 99.44 | β | ||||
| PCC 7108 | Marine (coastal) | Intertidal zone, Moss Beach, CA, USA | 3 | 5.9 | 38.78 | 5,169 | 99.63 | β | ||||
| C1b | Freshwater | Alkaline salt lakes | 63 | 6.09 | 44.69 | 4,852 | 99.71 | β | ||||
| NIES-39 | Freshwater | Alkaline salt lakes | 1 | 6.78 | 44.27 | 6,676 | 99.13 | β | ||||
| Paraca | Freshwater | Alkaline salt lakes | 239 | 6.49 | 44.31 | 5,436 | 99.34 | β | ||||
| PCC 8005 | Unknown | Unknown | 119 | 6.27 | 44.7 | 5,171 | 99.93 | β | ||||
| PCC 7103 | Freshwater | Crawford Co., Wisconsin, USA | 12 | 11.58 | 38.55 | 9,371 | 99.39 | β | ||||
| PCC 6605 | Freshwater | Berkeley, CA, USA | 1 | 6.28 | 45.73 | 5,956 | 99.48 | β | ||||
| PCC 7203T | Soil | Greifswald, Germany | 3 | 6.68 | 44.47 | 5,618 | 99.63 | β | ||||
| PCC 7420cT | Marine (coastal) | Salt marsh in Woods Hole, Massachusetts, USA | 142 | 8.65 | 45.43 | 7,100 | 99.37 | β | ||||
| PCC 9333 | NA | NA | 1 | 5.31 | 40.16 | 5,002 | 99.48 | β | ||||
| ESFC-1 | Marine (coastal) | Extremophylic mat communities, Elkhorn Slough estuary, CA, USA | 52 | 5.62 | 46.51 | 4,857 | 99.59 | β | ||||
| JSC-12 | Freshwater | NA | 20 | 5.52 | 47.49 | 5,024 | 99.29 | β | ||||
| PCC 7202T | Freshwater | Thermal springs, alkaline pod | 1 | 3.16 | 38.66 | 2,886 | 99.52 | β | ||||
| PCC 7417T | Soil | Stockholm, Sweden | 4 | 7.61 | 42.2 | 6,127 | 99.78 | β | ||||
| PCC 8305T | Freshwater | Solar Lake, Israel | 1 | 3.78 | 42.44 | 3,412 | 99.55 | β | ||||
| JSC-11 | NA | NA | 34 | 5.38 | 41.05 | 4,627 | 99.76 | β | ||||
| PCC 9339 | NA | NA | 13 | 8 | 40.16 | 6,720 | 99.76 | β | ||||
| PCC 9431 | NA | NA | 8 | 7.16 | 40.19 | 6,104 | 99.76 | β | ||||
| PCC 9605 | Soil | Limestone, Jerucham, Har Rahama, Israel | 12 | 8.08 | 42.61 | 7,060 | 100 | β | ||||
| PCC 7105 | NA | USA | 8 | 6.15 | 51.59 | 4,735 | 93.75 | β | ||||
| PCC 7407 | Unknown | Unknown | 1 | 4.68 | 58.46 | 3,727 | 99.87 | β | ||||
| PCC 6308T | Freshwater | Lake near Madison, Wisconsin, USA | 1 | 4.26 | 34.28 | 3,887 | 99.78 | β | ||||
| PCC 7428 | Thermal - Freshwater | Moderate hot spring | 1 | 5.43 | 43.27 | 5,254 | 99.78 | β | ||||
| PCC 73106 | Freshwater | Sphagnum bog, Switzerland | 228 | 4.025 | 41.11 | 3,704 | 98.84 | β | ||||
| PCC 7418 | Freshwater | Solar Lake, Israel | 1 | 4.18 | 42.92 | 3,663 | 99.48 | β | ||||
| PCC 6306T | Freshwater | Lake near Madison, Wisconsin, USA | 5 | 7.26 | 47.02 | 6,827 | 99.41 | β | ||||
| PCC 7104d | Marine (coastal) | Rock at shoreline, Montauk Point, Long Island, NY, USA | 2 | 6.89 | 57.69 | 6,414 | 99.18 | β | ||||
| PCC 7375 | Marine (coastal) | Plankton, Woods Hole, Massachusetts, USA | 5 | 9.42 | 47.62 | 8,366 | 99.73 | β | ||||
| PCC 7376 | Marine (coastal) | Limestone, Crystal Cave, Bermuda | 1 | 5.12 | 43.87 | 4,601 | 99.42 | β | ||||
| PCC 6406 | Freshwater | California, USA | 3 | 5.77 | 55.18 | 5,156 | 98.64 | β | ||||
| BL-J | NA | NA | 432 | 6.87 | 41.16 | 5,597 | 99.74 | β | ||||
| BDU | Marine | India | 298 | 8.79 | 55.63 | 8,370 | 99.34 | β | ||||
| PCC 8106e | Marine (coastal) | NA | 110 | 7.03 | 41.11 | 5,854 | 99.3 | β | ||||
| PCC 7113 | Soil | San Francisco, California, USA | 1 | 7.47 | 46.21 | 6,734 | 99.56 | β | ||||
| FGP-2 | Soil | Canyonlands National Park, UT, USA | 40 | 6.69 | 46.04 | 5,519 | 99.67 | β | ||||
| 3LfT | NA | NA | 287 | 8.38 | 43.68 | 6,979 | 98.56 | β | ||||
| PCC 7107 | Freshwater | Point Reyes Peninsula, California, USA | 1 | 6.32 | 40.36 | 5,200 | 99.26 | β | ||||
| PCC 7524 | Freshwater | Hot spring, Amparai District, Maha Oya, Sri Lanka | 3 | 6.71 | 41.53 | 5,326 | 99.33 | β | ||||
| PCC 6304T | Soil | NA | 1 | 7.68 | 47.6 | 6,004 | 99.71 | β | ||||
| PCC 7112 | Soil | USA | 1 | 7.47 | 45.87 | 6,925 | 99.78 | β | ||||
| PCC 10802 | NA | NA | 9 | 8.59 | 54.1 | 7,012 | 100 | β | ||||
| PCC 6506 | NA | NA | 377 | 6.67 | 43.4 | 6,007 | 99.12 | β | ||||
| PCC 6407 | Freshwater | NA | 12 | 6.89 | 43.43 | 5,693 | 99.56 | β | ||||
| CC9605T | Marine | California current, Pacific, oligotrophic, 51 m | 1 | 2.51 | 59.2 | 2,583 | 99.73 | α | ||||
| CC9902T | Marine | California current, Pacific, oligotrophic, 5 m | 1 | 2.23 | 54.2 | 2,289 | 99.46 | α | ||||
| WH8109T | Marine | Sargasso Sea | 1 | 2.12 | 60.1 | 2,661 | 99.32 | α | ||||
| WH8102T | Marine | Sargasso Sea | 1 | 2.43 | 59.4 | 2,461 | 99.46 | α | ||||
| BL107T | Marine | Blanes Bay, Mediterranean Sea, 1,800 m | 1 | 2.29 | 54.2 | 2,322 | 99.46 | α | ||||
| CC9311T | Marine | California current, Pacific, coastal, 95 m | 1 | 2.61 | 52.4 | 2,627 | 99.73 | α | ||||
| RS9917T | Marine | Gulf of Aqaba, Red Sea, 10 m | 1 | 2.58 | 64.4 | 2,575 | 99.46 | α | ||||
| RS9916T | Marine | Gulf of Aqaba, Red Sea, 10 m | 1 | 2.66 | 59.8 | 2,603 | 99.73 | α | ||||
| WH7803T | Marine | Sargasso Sea, 25 m | 1 | 2.37 | 60.2 | 2,439 | 99.18 | α | ||||
| WH7805T | Marine | Sargasso Sea | 3 | 2.63 | 57.6 | 2,595 | 99.73 | α | ||||
| WH8016T | Marine | Woods Hole, MA, USA | 16 | 2.69 | 54.1 | 2,990 | 99.18 | α | ||||
| WH5701T | Marine | Long Island Sound, Connecticut, USA | 116 | 3.28 | 65.4 | 2,917 | 99.46 | α | ||||
| CB0205 | Marine | Chesapeake Bay, Baltimore, Maryland, USA | 78 | 2.43 | 63 | 2,473 | 99.18 | α | ||||
| CB0101T | Marine | Chesapeake Bay, Baltimore, Maryland, USA | 94 | 2.69 | 64.2 | 2,757 | 99.73 | α | ||||
| RCC307T | Marine | Mediterranean Sea, 15 m | 1 | 2.22 | 60.8 | 2,348 | 99.64 | α | ||||
| NIVA-CYA 126/8 | Freshwater | NA | 13 | 5.04 | 39.57 | 4,188 | 100 | β | ||||
| NIVA-CYA 15 | Freshwater | NA | 238 | 5.38 | 39.48 | 4,606 | 100 | β | ||||
| NIVA-CYA 56/3 | Freshwater | NA | 185 | 5.48 | 39.48 | 4,674 | 99.78 | β | ||||
| NIVA-CYA 405 | Freshwater | NA | 240 | 5.46 | 39.47 | 4,697 | 99.56 | β | ||||
| NIVA-CYA 406 | Freshwater | NA | 375 | 5.62 | 39.51 | 4,873 | 100 | β | ||||
| NIVA-CYA 540 | Freshwater | NA | 157 | 5.5 | 39.48 | 4,710 | 99.78 | β | ||||
| NIVA-CYA 98 | Freshwater | NA | 346 | 5.61 | 39.52 | 4,862 | 99.78 | β | ||||
| NIVA-CYA 407 | Freshwater | NA | 219 | 5.39 | 39.46 | 4,658 | 100 | β | ||||
| PCC 7319 | Marine (coastal) | Arizona Station, Gulf of California, Puerto Penasco, Mexico | 10 | 7.38 | 38.74 | 4,516 | 99.56 | β | ||||
| PCC 7319 | Marine (coastal) | Arizona Station, Gulf of California, Puerto Penasco, Mexico | 10 | 7.38 | 38.74 | 4,516 | 99.56 | β | ||||
| AS9601T | Marine | Arabian Sea, 50 m | 1 | 1.66 | 31.32 | 1,769 | 99.64 | α | ||||
| CCMP1986 | E. marinus | Marine | Mediterranean Sea, 5 m | 1 | 1.65 | 30.8 | 1,777 | 99.46 | α | |||
| MIT9312T | Marine | Gulf Stream, 135 m | 1 | 1.7 | 31.21 | 1,815 | 99.73 | α | ||||
| MIT9202T | Marine | South Pacific, 79 m | 1 | 1.69 | 31.1 | 1,795 | 98.78 | α | ||||
| MIT9215 | Marine | Equatorial Pacific, surface | 1 | 1.73 | 31.15 | 1,840 | 99.73 | α | ||||
| MIT9301T | Marine | Sargasso Sea, 90 m | 1 | 1.64 | 31.34 | 1,774 | 99.46 | α | ||||
| MIT9515T | Marine | Equatorial Pacific, 15 m | 1 | 1.7 | 30.79 | 1,784 | 100 | α | ||||
| NATL1A | Marine | Northern Atlantic, 30 m | 1 | 1.86 | 34.98 | 2,204 | 99.73 | α | ||||
| NATL2AT | Marine | Northern Atlantic, 10 m | 1 | 1.84 | 35.12 | 1,930 | 99.45 | α | ||||
| CCMP1375T | Marine | Sargasso Sea, 120 m | 1 | 1.75 | 36.44 | 1,883 | 100 | α | ||||
| MIT9211T | Marine | Equatorial Pacific, 83 m | 1 | 1.68 | 38.01 | 1,748 | 99.73 | α | ||||
| MIT9303 | Marine | Sargasso Sea, 100 m | 1 | 2.68 | 50.01 | 2,504 | 100 | α | ||||
| MIT9313T | Marine | Gulf Stream, 135 m | 1 | 2.41 | 50.74 | 2,339 | 99.46 | α | ||||
| PCC 7429 | Freshwater | NA | 464 | 5.47 | 43.18 | 4,774 | 99.29 | β | ||||
| PCC 7367 | Marine (coastal) | Intertidal zone, Mexico | 1 | 4.55 | 46.31 | 3,960 | 98.23 | β | ||||
| PCC 6802 | Freshwater | California, USA | 6 | 5.62 | 47.83 | 5,363 | 99.76 | β | ||||
| PCC 7116 | Marine (coastal) | La Paz, Baja California Sur, Mexico | 3 | 8.72 | 37.53 | 6,612 | 99.78 | β | ||||
| PCC 9445 | NA | NA | 10 | 5.32 | 47.39 | 4,580 | 99.56 | β | ||||
| PCC 7437T | Freshwater | Havana, Cuba | 6 | 5.54 | 36.22 | 4,895 | 99.56 | β | ||||
| PCC 6301T | Freshwater | NA | 1 | 2.7 | 55.5 | 2,576 | 99.73 | β | ||||
| PCC 7942 | Freshwater | NA | 2 | 2.74 | 55.46 | 2,655 | 100 | β | ||||
| JA23Ba213T | Thermal-Freshwater | Octopus Spring, Yellowstone Park, USA | 1 | 3.05 | 58.5 | 3,064 | 100 | β | ||||
| JA33AbT | Thermal-Freshwater | Octopus Spring, Yellowstone Park, USA | 1 | 2.93 | 60.2 | 3,036 | 100 | β | ||||
| PCC 6312T | Freshwater | California, USA | 2 | 3.72 | 48.49 | 3,795 | 99.29 | β | ||||
| PCC 7002T | Unknown | Unknown | 7 | 3.41 | 49.16 | 3,121 | 100 | β | ||||
| PCC 7335T | Marine (coastal) | Snail shell, intertidal zone, Puerto Penasco, Mexico | 11 | 5.97 | 48.2 | 5,702 | 98.91 | β | ||||
| PCC 7336T | Marine (coastal) | Sea Water Tank, Berkeley University, CA, USA | 1 | 5.07 | 53.7 | 5,093 | 100 | β | ||||
| PCC 7502T | Sphagnum bog (peat bog) | NA | 3 | 3.58 | 40.6 | 3,703 | 99.76 | β | ||||
| PCC 7509 | Soil | Rock scraping, Switzerland | 4 | 4.9 | 41.67 | 4,859 | 99.67 | β | ||||
| IMS101T | Marine (coastal) | NA | 1 | 7.75 | 34.14 | 4,358 | 99.71 | β | ||||
| PCC 7305 | Marine (coastal) | Aquarium, La Jolla, CA, USA | 234 | 5.92 | 39.68 | 4,992 | 99.78 | β | ||||
| PCC 7421 | Soil | Calcareous (chalky) rock, Switzerland | 1 | 4.66 | 62 | 4,511 | 99.15 | β |
Ecological and molecular features were indicated, such as environment sampling, as well as number of contigs, genome size, GC % content, completeness score, and carboxysome type. The following classifications for detailed for comparison: NCBI (order and family), numeral identification and genera according to Kozlov et al. (.
Cyanobacterial genomes used in Komárek et al. (.
Arthrospira platensis is also called Spirulina platensis.
Coleofasciculus chthonoplastes PCC 7420 is also called Microcoleus chthonoplastes PCC 7420.
Leptolyngbya sp. PCC 7104 is also called Nodosilinea nodulosa PCC 7104.
Lyngbya aestuarii PCC8106 is also called L. aestuarii CCY9616, and even the former name Oscillatoria limosa PCC8106.
Moorea producens 3L is also called Moorea producta 3L.
Oscillatoria sp. PCC 6407 is also called Kamptonema formosum PCC 6407, and even O. formosa PCC 6407.
Outgroup used in the phylogenetic analysis.
New taxonomic identification proposed by Coutinho et al. (.
New taxonomic classification proposed by Thompson et al. (2013a)
Number of contigs, total length and GC content values were obtained using QUAST tool.
Values using CheckM tool.
Figure 1Phylogenomic tree of the Cyanobacteria phylum with the proposed new names. Tree construction was performed using 100 genomes (ninety-nine used in this study plus the outgroup), based on a set of conserved marker genes. The numbers at the nodes indicate bootstrap values as percentages greater than 50%. Bootstrap tests were conducted with 1,000 replicates. The unit of measure for the scale bars is the number of nucleotide substitutions per site. The Gloeobacter violaceus PCC 7421 sequence was designated as outgroup. Capital letters indicate environmental source: F, freshwater; M, marine; P, peat bog (sphagnum); S, soil; T, thermal; and §, other habitat. New names are highlighted in red. Overwritten T indicates type strain or type species. Ecogenomic groups are depicted in different colors as indicated in the legend: Low Temperature group; Low Temperature Copiotroph group; and High Temperature Oligotroph group. Cases depicted in the Results section are in bold.
Figure 2Heatmap displaying the AAI levels between cyanobacterial genomes. The intraspecies limit is assumed as ≥95%, whereas genera delimitation is assumed as ≥70% (dashed lines) AAI. Clustering the genomes by AAI similarity was done using a hierarchical clustering method in R (hclust), based on Manhattan distances. The AAI values are associated with the respective thermal color scale located at the bottom left corner of the figure. The proposed new genera and species names were adopted in this figure.
Figure 3Correlations between Cyanobacteria and environmental variables. Heatmap displays Spearman correlation scores between the abundance of cyanobacterial genomes and measured environmental parameters at Tara Ocean sampling sites. Correlations that showed q corrected p < 0.05 are marked with stars. Variables were grouped through the complete linkage clustering method using Manhattan distances as input. The proposed new genera and species names were adopted in this figure.
Figure 4Ecogenomic analysis of Cyanobacteria in global marine environments. (A) Distribution of the dominant ecogenomic groups (Low Temperature group; Low Temperature Copiotroph group; and High Temperature Oligotroph) along the Tara Ocean transect sampling from surface layer (5 m). (B) Distribution of the dominant ecogenomic groups along the Tara Ocean transect sampling from subsurface layer (>5 m). (C) Non-metric multidimensional scaling (NMDS) analysis of the marine metagenomes and environmental parameters. Ordination plot of physicochemical parameters. Dots indicate the metagenomes samples. Distances were calculated based on the Bray-Curtis Method. NMDS stress value = 0.15. (D) Non-metric multidimensional scaling (NMDS) analysis of the marine metagenomes and environmental parameters. Ordination plot of ecogenomic clusters. Dots indicate the metagenomes samples. Distances were calculated based on the Bray-Curtis Method. NMDS stress value = 0.15.