| Literature DB >> 21304748 |
Jack A Gilbert, Folker Meyer, Lynn Schriml, Ian R Joint, Martin Mühling, Dawn Field.
Abstract
Both metagenomic data and metatranscriptomic data were collected from surface water (0-2m) of the L4 sampling station (50.2518 N, 4.2089 W), which is part of the Western Channel Observatory long-term coastal-marine monitoring station. We previously generated from this area a six-year time series of 16S rRNA V6 data, which demonstrated robust seasonal structure for the bacterial community, with diversity correlated with day length. Here we describe the features of these metagenomes and metatranscriptomes. We generated 8 metagenomes (4.5 million sequences, 1.9 Gbp, average read-length 350 bp) and 7 metatranscriptomes (392,632 putative mRNA-derived sequences, 159 Mbp, average read-length 272 bp) for eight time-points sampled in 2008. These time points represent three seasons (winter, spring, and summer) and include both day and night samples. These data demonstrate the major differences between genetic potential and actuality, whereby genomes follow general seasonal trends yet with surprisingly little change in the functional potential over time; transcripts tended to be far more structured by changes occurring between day and night.Entities:
Keywords: Marine; aerobic; coastal; diel; metagenome; metatranscriptome; pyrosequencing; seasonal; surface water; temperate; time-series
Year: 2010 PMID: 21304748 PMCID: PMC3035373 DOI: 10.4056/sigs.1202536
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Group-average dendrogram showing relationship between all metagenomes and metatranscriptomes, based on comparison of annotated protein fragments via BLAST x using the SEED database in MG-RAST for each dataset. MTS – metatranscriptome. MGS – metagenome.
Classification and general feature of 8 metagenome datasets according to the MIMS recommendations [5].
| | | | |
|---|---|---|---|
| Current classification | Metagenome ecological | TAS [ | |
| 5 | Collection date | Jan Day: 2008-01-28T15:30 | TAS [ |
| 6 | Latitude Longitude | Jan Day: 50.2518:4.2089 | NAS |
| 7 | Depth | 0 | NAS |
| 8 | Altitude | 0 | NAS |
| 9 | Geographic location/Country | England | NAS |
| 10 | Environment | Coastal Marine | |
| 11a | Environmental Package | See | |
| 29 | Sample collection device or method | Large bore peristaltic filtration pump | |
| 30 | Sample material processing | Water filtered on to a 0.22 µm Sterivex (Millipore) | |
| 31 | Amount or size of sample collected | 10L |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [7]. If the evidence code is IDA, then the property was directly observed for a live isolate by one of the authors or an expert mentioned in the acknowledgements.
Environmental variables for each sampling occasion
| Sample Collection date (MIGS-5) | | ||||||||
| Sample collection time | 15:38 | 19:30 | 16:00 | 22:00 | 16:00 | 22.00 | 04:00 | 10:00 | |
| Temperature (ºC) | 10.1 | 10.1 | 9.7 | 9.6 | 15.9 | 15.8 | 15.7 | 15.8 | IDA |
| Density (kg m-2) | 1025.6 | 1026.3 | 1027.2 | 1027.1 | 1023.5 | 1024.3 | 1024.5 | 1024.4 | |
| Salinity (PSU) | 33.3 | 34.2 | 35.1 | 35.0 | 32.1 | 33.0 | 33.3 | 33.2 | |
| Chlorophyll a (µg/L) | 0.8 | 0.9 | 2.2 | 1.3 | 9.2 | 8.2 | 9.8 | 11.9 | IDA |
| Total Organic Nitrogen (µmol L-1) | 1.3 | 3.5 | 2.9 | 2.8 | 2.8 | 2.3 | 3.0 | 4.1 | IDA |
| Total Organic Carbon (µmol L-1) | 33.2 | 38.2 | 27.2 | 19.4 | 26.8 | 26.5 | 22.0 | 23.7 | IDA |
| NO2 + NO3 (µmol L-1) | 10.9 | 10.0 | 4.0 | 3.8 | 0.1 | 0.1 | 0.9 | 0.1 | |
| Ammonia (µmol L-1) | 0.0 | 0.0 | 0.5 | 0.3 | 0.1 | 0.1 | 0.1 | 0.1 | IDA |
| SRP (µmol L-1) | 0.5 | 0.5 | 0.4 | 0.3 | 0.0 | 0.1 | 0.0 | 0.1 | |
| Silicate (µmol L-1) | 6.0 | 5.8 | 2.6 | 2.7 | 0.1 | 0.2 | 0.3 | 0.2 | |
a Samples collected January – August, 2008. Evidence codes: MIGS-5: TAS [5].
Figure 2Monthly annual averages for all environmental parameters and species richness (S). TO – total organic; SRP – Soluble Reactive Phosphorous; PAR – Photosynthetically Active Radiation; NAO – North Atlantic Oscillation. Data taken from Gilbert et al., 2010.
Figure 3Principal component analysis of environmental variables showing the seasonal differences in variables outlined in Table 2. Classification and general features of the 15 datasets in accordance with the MIMS recommendations [5]
Metagenome sequencing project information (MIMS compliance)
| MIGS ID | Property | | | | | | | | |
|---|---|---|---|---|---|---|---|---|---|
| | 616,793 | 784,823 | 637,801 | 493,003 | 620,759 | 524,953 | 500,117 | 326,475 | |
| | Gilbert et al. 2008 | ||||||||
| | 454 Titanium pyrosequencing (GS flx) | ||||||||
| | none | ||||||||
| | SRA009436 | ||||||||
| | 01-12-2009 | ||||||||
| | GM00104 | ||||||||
Metatranscriptome sequencing project information (MIMS compliance)
| | | | | | | | | |
|---|---|---|---|---|---|---|---|---|
| | 139,880 | 130,826 | 124,925 | 147,492 | 139,375 | 193,254 | 154,865 | |
| | Gilbert et al. 2008 | |||||||
| | 454 Titanium pyrosequencing (GS flx) | |||||||
| | none | |||||||
| | SRA009436 | |||||||
| | 01-12-2009 | |||||||
| | GM00104 | |||||||
Metagenome statistics
| | | | | | | | | |
|---|---|---|---|---|---|---|---|---|
| No. Original DNA Sequences | 616,793 | 784,823 | 637,801 | 493,003 | 620,759 | 524,953 | 500,117 | 326,475 |
| Predicted ORFs (>40aa pORFs) | 862,695 | 1,287,412 | 1,003,799 | 745,305 | 986,269 | 846,209 | 779,951 | 491,330 |
| No. of pORF clusters (95%) | 615,374 | 1,123,829 | 779,342 | 588,387 | 881,113 | 703,712 | 675,210 | 444,729 |
| No. of pORF singletons (95%) | 546,463 | 1,031,865 | 682,586 | 526,233 | 805,284 | 634,042 | 608,785 | 410,616 |
| No. of pORF ‘families’ (60%) | 423,674 | 1,031,904 | 678,547 | 528,213 | 801,760 | 637,542 | 620,403 | 419,461 |
| No. of pORF singletons (60%) | 352,938 | 962,073 | 609,351 | 486,712 | 740,032 | 589,839 | 577,027 | 398,202 |
| No. of pORF clusters (95%) (66529) | 56337 | 64446 | 61187 | 59904 | 65601 | 63032 | 64729 | 65075 |
| No. of pORF singletons (95%) (66529) | 52891 | 63378 | 58691 | 57779 | 64818 | 61068 | 63359 | 63945 |
| Good’s Coverage (66529) | 20.5 | 4.7 | 11.8 | 13.2 | 2.6 | 8.2 | 4.8 | 3.9 |
| No. DNA seqs with | 122,936 | 291,953 | 258,658 | 164,249 | 283,761 | 196,369 | 196,972 | 126,392 |
| No. DNA seqs without | 493,857 | 492,870 | 379,143 | 328,754 | 336,998 | 328,584 | 303,145 | 200,083 |
| Percent DNA seqs without | 80% | 63% | 59% | 67% | 54% | 63% | 61% | 61% |
| No. DNA seqs with taxonomic | 190,326 | 417,920 | 349,888 | 241,541 | 379,911 | 288,356 | 304,003 | 186,421 |
| Number of archaeal sequences (186,421) | 19,055 | 15,150 | 777 | 561 | 1,370 | 1,093 | 1,585 | 1,244 |
| Number of bacterial sequences (186,421) | 161,899 | 146,911 | 182,850 | 180,674 | 182,717 | 176,825 | 180,725 | 182,332 |
Metatranscriptome statistics
| Jan 3pm | Jan 7pm | Apr 4pm | Apr 10pm | Aug 27 4pm | Aug 27 10pm | Aug 28 4am | |
|---|---|---|---|---|---|---|---|
| No. Original cDNA Sequences | 139,880 | 130,826 | 124,925 | 147,492 | 139,375 | 193,254 | 154,865 |
| No. of sequences following filtering*** | 94,024 | 106,864 | 84,916 | 109,577 | 87,799 | 118,360 | 111,568 |
| No. mRNA following removal of rRNA | 61,831 | 96,026 | 41,378 | 53,413 | 33,149 | 51,829 | 55,006 |
| Predicted ORFs (>40aa pORFs) | 143,169 | 211,374 | 81,642 | 107,699 | 77,985 | 66,529 | 159,909 |
| No. of pORF clusters (95%) | 98,871 | 78,278 | 35,648 | 51,088 | 28,167 | 24,136 | 68,080 |
| No. of pORF singletons (95%) | 82,464 | 54,870 | 25,925 | 38,960 | 19,600 | 17,177 | 50,246 |
| No. of pORF ‘families’ (60%) | 84,598 | 45,049 | 19,131 | 37,628 | 15,146 | 12,735 | 41,480 |
| No. of pORF singletons (60%) | 76,655 | 30,720 | 13,869 | 30,919 | 9,857 | 9,134 | 32,662 |
| No. of pORF clusters (95%) (66529) | 31026 | 50354 | 30334 | 34217 | 24848 | 24136 | 33191 |
| No. of pORF singletons (95%) (66529) | 23038 | 43687 | 22394 | 26840 | 17373 | 17177 | 25636 |
| Good’s Coverage (66529) | 65.37 | 34.33 | 66.34 | 59.66 | 73.89 | 74.18 | 61.47 |
| No. mRNA seqs with | 11,513 | 31,990 | 8,845 | 16,315 | 11,720 | 5,907 | 15,384 |
| No. mRNA seqs | 50,318 | 64,036 | 32,533 | 37,098 | 21,429 | 45,922 | 39,622 |
| Percent DNA seqs | 81% | 67% | 79% | 69% | 65% | 89% | 72% |
| No. mRNA seqs with | 29,521 | 30,778 | 20,899 | 26,398 | 15,456 | 29,605 | 38,304 |
| Number of archaeal sequences (15,456) | 625 | 49 | 1 | 16 | 4 | 4 | 11 |
| Number of bacterial sequences (15,456) | 13,633 | 11,926 | 13,702 | 8,449 | 14,469 | 15,071 | 14,803 |
Number of genes associated with the general SEED functional categories
| | | | | | | | | |
|---|---|---|---|---|---|---|---|---|
| Amino Acids and Derivatives | 13,515 | 12,346 | 13,913 | 12,089 | 13,279 | 12,517 | 11,966 | 12,074 |
| Carbohydrates | 14,181 | 13,087 | 14,884 | 13,829 | 14,801 | 13,929 | 13,258 | 13,780 |
| Cell Division and Cell Cycle | 2,136 | 2,026 | 2,286 | 2,243 | 2,243 | 2,231 | 2,175 | 2,234 |
| Cell Wall and Capsule | 5,632 | 5,363 | 5,336 | 6,051 | 5,553 | 5,674 | 6,079 | 6,347 |
| Clustering-based subsystems | 18,051 | 17,585 | 19,425 | 19,647 | 19,055 | 19,441 | 20,434 | 19,860 |
| Cofactors, Vitamins, Prosthetic Groups, Pigments | 8,497 | 7,675 | 8,188 | 8,606 | 8,142 | 8,227 | 8,582 | 8,001 |
| DNA Metabolism | 5,461 | 5,331 | 5,191 | 5,559 | 5,321 | 5,717 | 5,824 | 5,855 |
| Fatty Acids and Lipids | 2,165 | 1,919 | 1,883 | 1,891 | 1,955 | 2,025 | 1,960 | 1,934 |
| Macromolecular Synthesis | 148 | 147 | 287 | 163 | 213 | 151 | 136 | 109 |
| Membrane Transport | 2,764 | 2,322 | 2,839 | 2,375 | 2,606 | 2,507 | 2,234 | 2,234 |
| Metabolism of Aromatic Compounds | 1,817 | 1,357 | 1,473 | 1,527 | 1,632 | 1,409 | 1,629 | 1,489 |
| Miscellaneous | 381 | 367 | 448 | 423 | 417 | 446 | 454 | 393 |
| Motility and Chemotaxis | 1,034 | 994 | 879 | 1,227 | 977 | 1,203 | 1,311 | 1,348 |
| Nitrogen Metabolism | 668 | 688 | 587 | 574 | 747 | 718 | 628 | 660 |
| Nucleosides and Nucleotides | 5,152 | 4,820 | 4,701 | 4,578 | 4,836 | 4,752 | 4,639 | 4,706 |
| Phosphorus Metabolism | 1,796 | 1,706 | 1,747 | 1,926 | 1,832 | 1,958 | 2,085 | 1,879 |
| Photosynthesis | 212 | 4,373 | 160 | 1,489 | 127 | 197 | 270 | 203 |
| Potassium metabolism | 648 | 591 | 586 | 631 | 620 | 755 | 838 | 817 |
| Protein Metabolism | 11,912 | 11,717 | 11,254 | 11,534 | 11,473 | 11,597 | 11,210 | 11,715 |
| RNA Metabolism | 5,133 | 4,889 | 4,660 | 4,813 | 4,811 | 4,744 | 5,068 | 4,981 |
| Regulation and Cell signaling | 1,196 | 1,127 | 1,400 | 966 | 1,356 | 1,360 | 1,076 | 1,056 |
| Respiration | 5,298 | 8,480 | 5,455 | 5,570 | 5,432 | 5,579 | 4,926 | 4,994 |
| Secondary Metabolism | 116 | 124 | 63 | 87 | 93 | 83 | 86 | 83 |
| Stress Response | 2,497 | 2,133 | 2,338 | 2,419 | 2,306 | 2,524 | 2,508 | 2,605 |
| Sulfur Metabolism | 1,604 | 1,354 | 1,673 | 1,430 | 1,446 | 1,240 | 1,320 | 1,317 |
| Unclassified | 6,235 | 5,677 | 6,567 | 5,763 | 6,672 | 6,019 | 5,555 | 5,794 |
| Virulence | 4,686 | 4,733 | 4,711 | 5,521 | 4,989 | 5,929 | 6,684 | 6,467 |
Number of transcripts associated with the general SEED functional categories
| | | | | | | | |
|---|---|---|---|---|---|---|---|
| Amino Acids and Derivatives | 261 | 536 | 204 | 198 | 21 | 144 | 443 |
| Carbohydrates | 886 | 1767 | 546 | 1302 | 530 | 1381 | 1256 |
| Cell Division and Cell Cycle | 83 | 191 | 52 | 63 | 96 | 56 | 80 |
| Cell Wall and Capsule | 154 | 353 | 317 | 297 | 153 | 113 | 221 |
| Clustering-based subsystems | 641 | 657 | 294 | 451 | 111 | 157 | 427 |
| Cofactors, Vitamins, Prosthetic Groups, Pigments | 215 | 457 | 130 | 248 | 24 | 13 | 469 |
| DNA Metabolism | 102 | 108 | 83 | 122 | 24 | 26 | 85 |
| Fatty Acids and Lipids | 84 | 28 | 17 | 27 | 0 | 28 | 10 |
| Macromolecular Synthesis | 0 | 0 | 5 | 2 | 2 | 0 | 0 |
| Membrane Transport | 44 | 19 | 237 | 83 | 2673 | 13 | 440 |
| Metabolism of Aromatic Compounds | 47 | 6 | 16 | 4 | 0 | 24 | 14 |
| Miscellaneous | 53 | 80 | 54 | 55 | 672 | 43 | 75 |
| Motility and Chemotaxis | 40 | 10 | 438 | 58 | 3 | 8 | 180 |
| Nitrogen Metabolism | 11 | 0 | 0 | 2 | 9 | 8 | 3 |
| Nucleosides and Nucleotides | 144 | 87 | 42 | 48 | 4 | 13 | 56 |
| Phosphorus Metabolism | 79 | 83 | 64 | 94 | 25 | 18 | 31 |
| Photosynthesis | 67 | 0 | 17 | 2 | 0 | 1 | 0 |
| Potassium metabolism | 29 | 13 | 3 | 13 | 4 | 2 | 7 |
| Protein Metabolism | 439 | 95 | 129 | 625 | 81 | 112 | 172 |
| RNA Metabolism | 1631 | 160 | 1813 | 702 | 907 | 2883 | 874 |
| Regulation and Cell signaling | 65 | 136 | 16 | 354 | 30 | 18 | 41 |
| Respiration | 174 | 20 | 26 | 97 | 125 | 31 | 109 |
| Secondary Metabolism | 18 | 3 | 1 | 0 | 0 | 0 | 1 |
| Stress Response | 100 | 175 | 42 | 229 | 5 | 43 | 56 |
| Sulfur Metabolism | 42 | 18 | 19 | 14 | 13 | 11 | 40 |
| Unclassified | 346 | 58 | 957 | 101 | 10 | 110 | 271 |
| Virulence | 152 | 847 | 385 | 716 | 385 | 651 | 546 |