| Literature DB >> 26784354 |
Alexander Eiler1, Rhiannon Mondav1, Lucas Sinclair1, Leyden Fernandez-Vidal1, Douglas G Scofield2, Patrick Schwientek3, Manuel Martinez-Garcia4, David Torrents5,6, Katherine D McMahon7,8, Siv Ge Andersson9, Ramunas Stepanauskas4, Tanja Woyke3, Stefan Bertilsson1.
Abstract
Most free-living planktonic cells are streamlined and in spite of their limitations in functional flexibility, their vast populations have radiated into a wide range of aquatic habitats. Here we compared the metabolic potential of subgroups in the Alphaproteobacteria lineage SAR11 adapted to marine and freshwater habitats. Our results suggest that the successful leap from marine to freshwaters in SAR11 was accompanied by a loss of several carbon degradation pathways and a rewiring of the central metabolism. Examples for these are C1 and methylated compounds degradation pathways, the Entner-Doudouroff pathway, the glyoxylate shunt and anapleuretic carbon fixation being absent from the freshwater genomes. Evolutionary reconstructions further suggest that the metabolic modules making up these important freshwater metabolic traits were already present in the gene pool of ancestral marine SAR11 populations. The loss of the glyoxylate shunt had already occurred in the common ancestor of the freshwater subgroup and its closest marine relatives, suggesting that the adaptation to freshwater was a gradual process. Furthermore, our results indicate rapid evolution of TRAP transporters in the freshwater clade involved in the uptake of low molecular weight carboxylic acids. We propose that such gradual tuning of metabolic pathways and transporters toward locally available organic substrates is linked to the formation of subgroups within the SAR11 clade and that this process was critical for the freshwater clade to find and fix an adaptive phenotype.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26784354 PMCID: PMC5029164 DOI: 10.1038/ismej.2015.260
Source DB: PubMed Journal: ISME J ISSN: 1751-7362 Impact factor: 10.302
Description of the genome data
| HTCC1002 | 638341056 | 1a.1 | Marine | 1327604 | 0.99 | 1346462 | 1428 | 1448 | 5 | 0.3 | 95.9 | 35 |
| HTCC1013 | 2511231042 | 1a.1 | Marine | 1302704 | 1.00 | 1302704 | 1411 | 1411 | 22 | 0.3 | 96.6 | 36 |
| HTCC1062 | 637000058 | 1a.1 | Marine | 1308759 | 1.00 | 1308759 | 1394 | 1394 | 1 | 0.3 | 96.1 | 40 |
| HTCC9565 | 2503283022 | 1a.1 | Marine | 1279674 | 0.99 | 1294174 | 1386 | 1402 | 3 | 0.29 | 96.2 | 34 |
| HIMB083 | 2510461079 | 1a.2 | Marine | 1395997 | 1.00 | 1395997 | 1505 | 1505 | 1 | 0.29 | 96.8 | 37 |
| HIMB140 | 2503754000 | 1a.2 | Marine | 1437930 | 1.00 | 1437930 | 1535 | 1535 | 1 | 0.29 | 97.0 | 31 |
| HIMB5 | 2503982040 | 1a.2 | Marine | 1343202 | 1.00 | 1343202 | 1467 | 1467 | 1 | 0.29 | 96.8 | 36 |
| HTCC7211 | 2503283017 | 1a.2 | Marine | 1456888 | 1.00 | 1456888 | 1481 | 1481 | 1 | 0.29 | 96.7 | 34 |
| HTCC8051 | 2511231043 | 1a.2 | Marine | 1395115 | 0.99 | 1410923 | 1498 | 1515 | 2 | 0.29 | 96.8 | 36 |
| HTCC9022 | 2511231041 | 1a.2 | Marine | 1362422 | 0.99 | 1381774 | 1464 | 1485 | 24 | 0.3 | 96.5 | 37 |
| SCGCAAA288-G21 | 2236347014 | 1c | Marine | 909786 | 0.64 | 1421541 | 1103 | 1723 | 139 | 0.3 | 94.4 | 29 |
| SCGCAAA288-N07 | 2236347020 | 1c | Marine | 954664 | 0.66 | 1446461 | 1110 | 1682 | 81 | 0.29 | 94.8 | 27 |
| SCGCAAA240-E13 | 2236661011 | 1c | Marine | 1401625 | 0.86 | 1629797 | 1621 | 1885 | 151 | 0.29 | 93.6 | 40 |
| SCGCAAA288-E13 | 2236661012 | 1c | Marine | 812863 | 0.55 | 1477933 | 948 | 1724 | 106 | 0.29 | 93.6 | 25 |
| HIMB058 | 2265129005 | 2 | Marine | 1115049 | 0.99 | 1130887 | 1269 | 1287 | 59 | 0.3 | 96.9 | 39 |
| HIMB114 | 2503283019 | 3a | Marine | 1237371 | 1.00 | 1237371 | 1357 | 1357 | 1 | 0.3 | 97.0 | 36 |
| IMCC9063 | 650716017 | 3a | Marine | 1284727 | 1.00 | 1284727 | 1482 | 1482 | 1 | 0.32 | 94.9 | 35 |
| SCGCAAA023-L09 | 2236661000 | 3b/LD12 | Freshw | 774923 | 0.68 | 1147914 | 921 | 1364 | 76 | 0.29 | 96.0 | 29 |
| SCGCAAA024-N17 | 2236876027 | 3b/LD12 | Freshw | 328144 | 0.24 | 1394612 | 397 | 1687 | 45 | 0.3 | 95.5 | 14 |
| SCGCAAA027-C06 | 2264265094 | 3b/LD12 | Freshw | 775384 | 0.72 | 1072915 | 936 | 1295 | 90 | 0.3 | 95.9 | 20 |
| SCGCAAA027-J10 | 2236876030 | 3b/LD12 | Freshw | 792980 | 0.73 | 1080510 | 952 | 1297 | 82 | 0.3 | 96.0 | 31 |
| SCGCAAA027-L15 | 2236876031 | 3b/LD12 | Freshw | 719587 | 0.63 | 1141744 | 840 | 1333 | 56 | 0.29 | 96.5 | 19 |
| SCGCAAA028-C07 | 2236661008 | 3b/LD12 | Freshw | 846566 | 0.76 | 1111117 | 974 | 1278 | 32 | 0.3 | 96.3 | 19 |
| SCGCAAA028-D10 | 2236347069 | 3b/LD12 | Freshw | 925141 | 0.78 | 1192329 | 1091 | 1406 | 57 | 0.3 | 95.6 | 34 |
| SCGCAAA280-B11 | 2236876032 | 3b/LD12 | Freshw | 674250 | 0.54 | 1260247 | 815 | 1523 | 47 | 0.3 | 95.8 | 18 |
| SCGCAAA280-P20 | 2236876029 | 3b/LD12 | Freshw | 720523 | 0.68 | 1062920 | 838 | 1236 | 65 | 0.3 | 95.6 | 24 |
| SCGCAAA487-M09 | 2236347068 | 3b/LD12 | Freshw | 627365 | 0.61 | 1027382 | 800 | 1310 | 97 | 0.29 | 96.1 | 24 |
| HIMB59 | 2503982041 | 5 | Marine | 1410127 | 1.00 | 1410127 | 1532 | 1532 | 1 | 0.32 | 96.8 | 39 |
Abbreviations: Com, completeness; EGS, estimated genome size; Freshw, freshwater. Genomes including 'SCGCAAA' in the names are single amplified genomes while others are derived from isolates.
Figure 1Evolutionary relationships between marine and freshwater SAR11 genomes. Unrooted maximum likelihood phylogenetic tree of concatenated ribosomal protein sequences from single-cell genomes and isolate genomes of the SAR11 clade. In addition to bootstrap values as inferred by maximum likelihood, the second (large and bold) number at the branching points show proportion of protein trees with corresponding branching pattern. These proportions were generated by using the Robinson–Foulds metric summarizing 518 orthologous protein trees that had at least one representative in each of the seven subclades (for details, see Supplementary Table S1). Freshwater SAR11 (subgroup IIIb/LD12) are indicated in red, whereas marine SAR11 subgroups in other colors.
Figure 2Nine protein trees with evolutionary inconsistency when compared with the ribosomal tree. (a–i) Unrooted phylogenetic trees of orthologous protein clusters with alternative tree topologies (that is, sister clade to IIIb/LD12 is not IIIa) when compared with the ribosomal tree (Figure 1).
Figure 3Central carbon metabolism and other relevant metabolic pathways identified in SAR11 genomes. Glycogenesis, TCA cycle and glyoxylate shunt are shown in the center of the plot with adjunction pathways to either side such as the pentose phosphate and ED pathway. The color of the arrows indicates genes encoded in at least one genome of each SAR11 subgroup (see legend for details). The presence and absence of genes was determined within the IMG system based on automated and manual annotations. Detailed results are in Supplementary Tables S6–S9.
Figure 4Closest blast hits against refseq and rarefaction curves of orthologous SAR11 protein clusters. (a) Bar charts revealing top hits from BLASTP against all available genomes in refseq. Hits are classified based on their taxonomic affiliation with higher taxonomic levels removed from lower taxonomic levels (that is, SAR11 hits are removed from the alphaproteobacterial bin, with alphaproteobacterial hits removed from the proteobacterial bin and so on). Black bars indicate hits of genes specific to freshwaters as inferred by orthologous clustering. (b) Detailed representation of freshwater-specific orthologous protein clusters and their closest hits classified on their taxonomic affiliation with higher taxonomic levels removed from lower taxonomic levels. (c) Results from a rarefaction analysis of orthologous protein clusters after normalization to account for the partial single-cell genomes.