| Literature DB >> 29523891 |
E D Graham1, J F Heidelberg1,2, B J Tully3,4.
Abstract
Aerobic anoxygenic phototrophs (AAnPs) are common in marine environments and are associated with photoheterotrophic activity. To date, AAnPs that possess the potential for carbon fixation have not been identified in the surface ocean. Using the Tara Oceans metagenomic dataset, we have identified draft genomes of nine bacteria that possess the genomic potential for anoxygenic phototrophy, carbon fixation via the Calvin-Benson-Bassham cycle, and the oxidation of sulfite and thiosulfate. Forming a monophyletic clade within the Alphaproteobacteria and lacking cultured representatives, the organisms compose minor constituents of local microbial communities (0.1-1.0%), but are globally distributed, present in multiple samples from the North Pacific, Mediterranean Sea, the East Africa Coastal Province, and the Atlantic. This discovery may require re-examination of the microbial communities in the oceans to understand and constrain the role this group of organisms may play in the global carbon cycle.Entities:
Mesh:
Year: 2018 PMID: 29523891 PMCID: PMC6018677 DOI: 10.1038/s41396-018-0091-3
Source DB: PubMed Journal: ISME J ISSN: 1751-7362 Impact factor: 10.302
Summary of genome statistics for the putative members of the ‘Ca. Luxescamonaceae’
| Genome ID | PufM presence | RbcL presence | No. of contigs | Total length (bp) | Max. contig length (bp) | N50 | Mean length (bp) | GC (%) | No. of predicted CDSa | Est. completeness (%)b | Est. contamination (Est. strain heterogeneity) (%)b |
|---|---|---|---|---|---|---|---|---|---|---|---|
| NAT185c,d | + | + | 90 | 2,235,338 | 199,971 | 38,700 | 24,761 | 38.0 | 2,198 | 94.77 | 0.39 (66.67) |
| NAT102c,d | + | + | 105 | 1,995,055 | 67,548 | 22,200 | 18,995 | 37.7 | 1,936 | 86.20 | 0.67 (0.00) |
| NP970c,d | + | + | 119 | 2,435,294 | 84,644 | 32,121 | 20,380 | 36.1 | 2,359 | 82.71 | 2.00 (33.33) |
| SAT68c,d | + | + | 192 | 2,295,506 | 62,884 | 13,375 | 11,880 | 37.1 | 2,351 | 80.00 | 5.53 (71.88) |
| MED800c,d | + | + | 176 | 2,162,263 | 55,714 | 13,593 | 12,191 | 37.0 | 2,219 | 69.75 | 2.80 (100.00) |
| EAC638c,d | + | + | 197 | 2,499,174 | 80,634 | 14,967 | 12,620 | 30.9 | 2,599 | 66.59 | 1.20 (100.00) |
| TARA-MED-MAG-00011d,e | + | + | 100 | 1,971,082 | 70,540 | 31,135 | 19,710 | 37.7 | 1,949 | 88.20 | 0.40 (0.00) |
| TARA-MED-MAG-00076e | + | + | 86 | 1,475,546 | 86,697 | 22,424 | 17,157 | 37.4 | 1,475 | 75.60 | 0.53 (50.00) |
| TARA-ANE-MAG-00073d,e | + | + | 142 | 811,509 | 16,501 | 6438 | 5714 | 38.6 | 858 | 38.24 | 0.00 (0.00) |
| TARA-MED-MAG-00097e | − | + | 168 | 982,411 | 26,063 | 6725 | 5874 | 36.1 | 1,058 | 63.97 | 0.93 (100.00) |
| TARA-ANE-MAG-00058d,e | − | − | 136 | 907,139 | 38.673 | 7370 | 6670 | 37.6 | 966 | 59.48 | 1.16 (80.00) |
| TARA-MED-MAG-00111e | − | − | 133 | 848,198 | 26,144 | 6958 | 6377 | 34.9 | 960 | 59.27 | 1.81 (33.33) |
| TARA-PSE-MAG-00093d,e | − | − | 70 | 1,073,290 | 53,951 | 19,516 | 15,332 | 27.7 | 1,144 | 72.10 | 0.13 (100.00) |
| TARA-MED-MAG-00072e | − | − | 332 | 1,996,491 | 57,250 | 6935 | 6013 | 36.8 | 2,103 | 78.65 | 0.93 (66.67) |
| TARA-ANW-MAG-00032d,e | + | − | 211 | 1,576,546 | 29,415 | 9704 | 7471 | 39.2 | 1,674 | 73.03 | 1.20 (33.33) |
| TARA-MED-MAG-00129d,e | + | − | 259 | 1,489,684 | 20,564 | 6389 | 5751 | 39.2 | 1,579 | 50.59 | 0.00 (0.00) |
| TARA-IOS-MAG-00017e | + | − | 210 | 2,258,247 | 64,859 | 15147 | 10,753 | 38.7 | 2,269 | 91.57 | 1.10 (12.50) |
| TARA-ANE-MAG-00066e | − | + | 276 | 1,221,441 | 16,829 | 4451 | 4425 | 38.3 | 1,371 | 58.85 | 1.40 (57.14) |
N50—length of contig for which all contigs longer in length contain half of the total genome
CDS—coding DNA sequence
a Determined using Prodigal
b Determine using CheckM with the Alphaproteobacteria markers (388 markers in 250 sets)
c MAGs determined in Tully et al. [11]
d MAGs manually refined in this study
e MAGs determined in Delmont et al. [12]
Fig. 1a Phylogenomic tree of 31 concatenated marker genes for the Alphaproteobacteria. Numbers in parentheses represent the number of genomes collapsed within a branch. Black stars denote genomes within the ‘Ca. Luxescamonaceae’ that possess PufM and RbsL; gray stars denote genomes that possess RbsL only; white stars denote genomes that possess PufM only. Bootstrap values >0.75 are shown. Circle size representing the bootstrap value is scaled from 0.75–1.0. b Cellular schematic comparing the six AAnP genomes derived from Tully et al. [11]. c Comparison of predicted functions for the entire ‘Ca. Luxescamonaceae’ clade, selected neighbors, and previously described anaerobic anoxygenic phototrophs. Dendrogram represents the phylogenetic relationship between members of the ‘Ca. Luxescamonaceae’. Black boxes and blue arrows denote specific comparison discussed in the manuscript. Predicted functions are represented on a scale from 0 to 1 denoting the fraction of completeness a pathway or function has within a genome. KEGG module or ontology IDs used to determine function completeness are noted
Fig. 2Global map illustrating the Tara Oceans sampling sites. Sites at which the AAnP members of the ‘Ca. Luxescamonaceae’ were detected at >0.01% relative abundance are depicted. For each site, filter size fractions that were not collected are represented by an ‘X’ and each column represents one of the three Tara Oceans filter size fractions. Red asterisks denote filter fractions in which the relative abundance of genomes from Delmont et al. [12] contributed at least 0.01% (max. 0.04%) of the total relative abundance (this study). Squares highlighted in red denote filter samples in which ‘Ca. Luxescamonaceae’ genomes from Delmont et al. [12] had a reported relative fraction of the metagenome of 0.01–0.03% [12]