| Literature DB >> 25177534 |
Yan Wei Lim1, Daniel A Cuevas2, Genivaldo Gueiros Z Silva2, Kristen Aguinaldo3, Elizabeth A Dinsdale1, Andreas F Haas1, Mark Hatay1, Savannah E Sanchez1, Linda Wegley-Kelly1, Bas E Dutilh4, Timothy T Harkins5, Clarence C Lee6, Warren Tom5, Stuart A Sandin7, Jennifer E Smith7, Brian Zgliczynski7, Mark J A Vermeij8, Forest Rohwer1, Robert A Edwards9.
Abstract
Genomics and metagenomics have revolutionized our understanding of marine microbial ecology and the importance of microbes in global geochemical cycles. However, the process of DNA sequencing has always been an abstract extension of the research expedition, completed once the samples were returned to the laboratory. During the 2013 Southern Line Islands Research Expedition, we started the first effort to bring next generation sequencing to some of the most remote locations on our planet. We successfully sequenced twenty six marine microbial genomes, and two marine microbial metagenomes using the Ion Torrent PGM platform on the Merchant Yacht Hanse Explorer. Onboard sequence assembly, annotation, and analysis enabled us to investigate the role of the microbes in the coral reef ecology of these islands and atolls. This analysis identified phosphonate as an important phosphorous source for microbes growing in the Line Islands and reinforced the importance of L-serine in marine microbial ecosystems. Sequencing in the field allowed us to propose hypotheses and conduct experiments and further sampling based on the sequences generated. By eliminating the delay between sampling and sequencing, we enhanced the productivity of the research expedition. By overcoming the hurdles associated with sequencing on a boat in the middle of the Pacific Ocean we proved the flexibility of the sequencing, annotation, and analysis pipelines.Entities:
Keywords: Coral reef; Environmental microbiology; Expeditions; Genomics; Metagenomics; Sequencing; Vibrio
Year: 2014 PMID: 25177534 PMCID: PMC4145072 DOI: 10.7717/peerj.520
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Workflow for the preparation of bacterial isolates and water samples for genome and metagenome sequencing.
We started with sampling the coral reefs using supreme supersuckers, and isolates were cultured on different media. Isolates were assayed using multi-phenotype assay plates and sequenced.
Isolate characteristics.
Isolates and metagenomic sample information. The identifier of Vibrio spp. isolates are indicated by “V” and non-Vibrio spp. are indicated by “Z”. Potential genus was predicted based on the genome sequences.
| Identifier | Island | Site (depth) | Potential genus | Culture media | Barcode |
|---|---|---|---|---|---|
| VRT1 | Flint | Tent (10 m) |
| TCBS | 1 |
| VRT2 | Flint | Tent (10 m) |
| TCBS | 2 |
| VRT3 | Flint | Tent (10 m) |
| TCBS | 3 |
| VRT4 | Flint | Tent (10 m) |
| TCBS | 4 |
| VAR1 | Flint | Tent (10 m) |
| TCBS | 5 |
| VAR2 | Flint | Tent (10 m) |
| TCBS | 6 |
| VAR3 | Flint | Tent (10 m) |
| TCBS | 7 |
| VAR4 | Flint | Tent (10 m) |
| TCBS | 8 |
| VRT5B | Flint | Tent–20 m |
| TCBS | 8 |
| VRT11 | Vostok | Supersucker (10 m - Coral) |
| TCBS | 7 |
| VRT14 | Vostok | In-reef (10 m) |
| TCBS | 10 |
| VRT18 | Starbuck | Ambient water (10 m) |
| TCBS | 6 |
| VRT22 | Starbuck | In-reef (10 m) |
| TCBS | 3 |
| VRT23 | Starbuck | In-reef (10 m) |
| TCBS | 5 (4) |
| VRT25 | Starbuck | Black-reef water (10 m) |
| TCBS | 4 |
| VRT30 | Starbuck | Black-reef (surface) |
| TCBS | 9 |
| VRT35 | Malden | Supersucker (10 m - Coral) |
| TCBS | 11 |
| VRT37 | Malden | Supersucker (10 m - Algae) |
| TCBS | 12 |
| VRT38 | Millenium | Supersucker (10 m - Coral) |
| TCBS | 13 |
| VRT41 | Millenium | Supersucker (10 m - Algae) |
| TCBS | 14 |
| ZRT1 | Flint | Tent (10 m) |
| Zobell | 9 (3) |
| ZRT3 | Flint | Tent (10 m) |
| Zobell | 11 |
| ZAR1 | Flint | Tent (10 m) |
| Zobell | 13 (1) |
| ZAR2 | Flint | Tent (10 m) |
| Zobell | 14 (2) |
| ZRT28 | Malden | Supersucker (10 m - Coral) |
| Marine | 16 |
| ZRT32 | Malden | Supersucker (10 m - Algae) |
| Marine | 15 |
| Starbuck | Tent (10 m) | Mix | N/A | 1 | |
| Starbuck | Black-reef water (10 m) | Mix | N/A | 2 |
Notes.
A new library was made and different barcode was used during the second library preparation.
Figure 2A field guide in setting up sequencing workflow, specifically on a moving ship.
(A) Molecular bench for procedure including DNA extraction and library preparation was set-up in a clean room, which also serves as one of the bedrooms of scientists on-board. (B) Emulsion PCR was performed using the One-Touch technology located at the laundry room close to the hull of the ship. Damaged touch screen was replaced by re-wiring the display onto a laptop. (C) The sequencer was run at the owner’s cabin where there is an easy and safe access to the nitrogen tank that fuels the microfluidics of the sequencing technology. (D) A modified version of Ion Torrent server was set up to run the data analysis.
Sequencing data.
Library characteristics of the genomes and metagenomes.
| Identifier | Total reads | Total bases (bp) | Contigs (>1 kbp) | Longest contig (bp) | Size (Mbp) |
|---|---|---|---|---|---|
| VRT1 | 370,046 | 67,150,180 | 784 | 46,040 | 4.97 |
| VRT2 | 873,103 | 163,538,506 | 320 | 169,085 | 5.73 |
| VRT3 | 437,353 | 81,088,282 | 771 | 54,972 | 5.61 |
| VRT4 | 432,329 | 80,084,392 | 626 | 45,526 | 5.10 |
| VAR1 | 305,860 | 55,268,349 | 1,549 | 20,978 | 5.58 |
| VAR2 | 651,865 | 114,981,090 | 463 | 67,906 | 4.95 |
| VAR3 | 553,761 | 102,526,008 | 2,715 | 15,535 | 5.71 |
| VAR4 | 415,376 | 71,395,671 | 1,386 | 31,755 | 5.65 |
| VRT5B | 446,202 | 68,187,702 | 1,624 | 28,687 | 4.76 |
| VRT11 | 296,834 | 50,764,788 | 1,219 | 23,083 | 4.45 |
| VRT14 | 591,693 | 105,091,668 | 612 | 94,938 | 5.52 |
| VRT18 | 407,884 | 71,662,502 | 1,330 | 43,832 | 5.66 |
| VRT22 | 500,878 | 79,757,251 | 1,506 | 27,655 | 5.53 |
| VRT23 | 1,913,266 | 321,123,402 | 34 | 994,789 | 5.62 |
| VRT25 | 237,978 | 35,973,495 | 1,655 | 7,836 | 2.88 |
| VRT30 | 1,135,838 | 195,189,151 | 1,872 | 34,535 | 5.5 |
| VRT35 | 5,074,872 | 836,924,832 | 1,227 | 157,763 | 12.27 |
| VRT37 | 3,505,390 | 602,639,979 | 191 | 172,237 | 5.16 |
| VRT38 | 4,739,179 | 852,206,121 | 47 | 742,807 | 5.97 |
| VRT41 | 5,189,926 | 807,748,928 | 40 | 1,680,777 | 5.76 |
| ZRT1 | 5,927,108 | 885,069,802 | 104 | 413,587 | 5.78 |
| ZRT3 | 515,406 | 76,920,334 | 1,039 | 29,051 | 5.07 |
| ZAR1 | 3,127,071 | 289,378,345 | 423 | 111,609 | 5.30 |
| ZAR2 | 2,975,125 | 269,722,069 | 2,380 | 50,061 | 7.18 |
| ZRT28 | 6,114,713 | 1,110,443,467 | 113 | 263,705 | 5.16 |
| ZRT32 | 4,932,760 | 903,726,978 | 900 | 214,396 | 6.28 |
| 5,381,011 | 665,058,375 | – | – | – | |
| 186,516 | 26,325,947 | – | – | – | |
|
| 9,093,303,263 | ||||
Notes.
Libraries sequenced by Life Technologies after the expedition.
These libraries were under-sequenced during the expedition. Additional sequence data was provided by Life Technologies.
Accession numbers.
Data accession numbers for the sequences. All sequences are in NCBI bioproject PRJNA253472.
| Sample | RAST | NCBI | NCBI | NCBI locus | NCBI SRA | NCBI SRA | NCBI SRA |
|---|---|---|---|---|---|---|---|
| SLI-3.1 |
| 408172 | – |
|
|
| |
| SLI-3.2 |
| 408172 | – |
|
|
| |
| VAR1 | 163649 |
| 1515472 | HQ00 |
|
|
|
| VAR2 | 163650 |
| 1515473 | HQ01 |
|
|
|
| VAR3 | 163651 |
| 1515474 | HQ02 |
|
|
|
| VAR4 | 163652 |
| 1515475 | HQ03 |
|
|
|
| VRT1 | 165046 |
| 1519375 | HM87 |
|
|
|
| VRT11 | 163654 |
| 1515495 | HQ22 |
|
|
|
| VRT14 | 163656 |
| 1515496 | HQ23 |
|
|
|
| VRT18 | 163657 |
| 1515490 | HQ17 |
|
|
|
| VRT2 | 163646 |
| 1515476 | HQ04 |
|
|
|
| VRT22 | 163658 |
| 1515491 | HQ18 |
|
|
|
| VRT23 | 163659 |
| 1515492 | HQ19 |
|
|
|
| VRT25 | 163660 |
| 1515493 | HQ20 |
|
|
|
| VRT3 | 163647 |
| 1515477 | HQ05 |
|
|
|
| VRT30 | 163661 |
| 1515494 | HQ21 |
|
|
|
| VRT35 | 163662 |
| 1515484 | HQ11 |
|
|
|
| VRT37 | 163663 |
| 1515485 | HQ12 |
|
|
|
| VRT38 | 163664 |
| 1515488 | HQ15 |
|
|
|
| VRT4 | 163648 |
| 1515478 | HQ06 |
|
|
|
| VRT41 | 163665 |
| 1515489 | HQ16 |
|
|
|
| VRT5B | 163653 |
| 1515479 | HQ07 |
|
|
|
| ZAR1 | 163669 |
| 1515480 | HQ08 |
|
|
|
| ZAR2 | 163670 |
| 1515481 | HR57 |
|
|
|
| ZRT1 | 163666 |
| 1515482 | HQ09 |
|
|
|
| ZRT28 | 163671 |
| 1515486 | HQ13 |
|
|
|
| ZRT3 | 163667 |
| 1515483 | HQ10 |
|
|
|
| ZRT32 | 163672 |
| 1515487 | HQ14 |
|
|
|
Figure 3Heatmap comparing all genomes sequenced to Line Islands metagenomes.
The genomes, along the horizontal axis, are clustered by genera. The metagenomes on the vertical axis are organized by the island from which the samples were isolated. Solid gray bars on the top indicate the assembled genome size (bp), and gray bars on the right indicate metagenome size (bp). The cells are colored by distance calculated as described in the text (Eq. (1)) and as shown in the legend on the lower left.
Figure 4Distance tree comparing isolates based on genotypes and phenotypes.
(A) Distance tree based on the growth of isolates on 48 different carbon sources, using the optical density measurement at 40 h. (B–D) Distance tree based on the 16S rRNA (B), RecA (C), and RpoB (D) genes extracted from each genome.