| Literature DB >> 20428247 |
Tanja Woyke1, Damon Tighe, Konstantinos Mavromatis, Alicia Clum, Alex Copeland, Wendy Schackwitz, Alla Lapidus, Dongying Wu, John P McCutcheon, Bradon R McDonald, Nancy A Moran, James Bristow, Jan-Fang Cheng.
Abstract
While the bulk of the finished microbial genomes sequenced to date are derived from cultured bacterial and archaeal representatives, the vast majority of microorganisms elude current culturing attempts, severely limiting the ability to recover complete or even partial genomes from these environmental species. Single cell genomics is a novel culture-independent approach, which enables access to the genetic material of an individual cell. No single cell genome has to our knowledge been closed and finished to date. Here we report the completed genome from an uncultured single cell of Candidatus Sulcia muelleri DMIN. Digital PCR on single symbiont cells isolated from the bacteriome of the green sharpshooter Draeculacephala minerva bacteriome allowed us to assess that this bacteria is polyploid with genome copies ranging from approximately 200-900 per cell, making it a most suitable target for single cell finishing efforts. For single cell shotgun sequencing, an individual Sulcia cell was isolated and whole genome amplified by multiple displacement amplification (MDA). Sanger-based finishing methods allowed us to close the genome. To verify the correctness of our single cell genome and exclude MDA-derived artifacts, we independently shotgun sequenced and assembled the Sulcia genome from pooled bacteriomes using a metagenomic approach, yielding a nearly identical genome. Four variations we detected appear to be genuine biological differences between the two samples. Comparison of the single cell genome with bacteriome metagenomic sequence data detected two single nucleotide polymorphisms (SNPs), indicating extremely low genetic diversity within a Sulcia population. This study demonstrates the power of single cell genomics to generate a complete, high quality, non-composite reference genome within an environmental sample, which can be used for population genetic analyzes.Entities:
Mesh:
Year: 2010 PMID: 20428247 PMCID: PMC2859065 DOI: 10.1371/journal.pone.0010314
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Sulcia cell isolation and sequence coverage, closure and polishing locations along the Sulcia DMIN single cell genome.
(A) Micromanipulation of the single Sulcia cell from the sharpshooter bacteriome metasample. (B) Sequence coverage including closure and polishing locations along the finished, circular Sulcia DMIN genome with circles corresponding to following features, starting with outermost circle: (1) Illumina sequence coverage ranging from 0–3276 (mean 303+−386), (2) pyrosequence sequence coverage ranging from 0–231 (mean 42+−39), (3) Sanger sequence coverage ranging from 0–30 (mean 10+−7), (4) locations of captured (green) and uncaptured gaps (orange), (5) polishing locations corrected using Illumina (blue) and Sanger (purple) seqeunce, (6) GC content heat map (dark blue to light green = low to high values) and (7) GC skew.
General features of the Candidatus Sulcia muelleri DMIN genome.
|
| |
| Genome size [bp] | 243,933 |
| DNA copding bases [bp] | 237,536 |
| GC content [%] | 22.5 |
|
| |
| Total genes | 261 |
| Protein coding genes | 226 |
| with function prediction | 210 |
| w/o function prediction | 16 |
| Number of rRNA operons | 1 |
| Number of tRNA genes | 31 |
Figure 2Distribution of protein similarities between the two available genomes of Candidatus Sulcia muelleri from sharpshooters.
Pairs of proteins that have low sequence similarity between Candidatus S. muelleri DMIN and Candidatus S. muelleri GWSS.
|
|
| Protein | Sequence similarity [%] |
| DMIN_01680 | SMGWSS_171 | 30S ribosomal subunit protein S18 | 75.34 |
| DMIN_01600 | SMGWSS_162 | putative outer membrane protein | 81.23 |
| DMIN_01330 | SMGWSS_137 | chaperone protein DnaJ | 87.14 |
| DMIN_02500 | SMGWSS_255 | dihydrolipoamide acyltransferase E2 component | 89.05 |
| DMIN_00160 | SMGWSS_017 | putative ATP synthase F1, epsilon subunit | 89.11 |
| DMIN_01860 | SMGWSS_191 | hypothetical protein | 89.71 |
| DMIN_00320 | SMGWSS_033 | highly suspect ATP synthase F1, delta subunit | 92.73 |
| DMIN_02450 | SMGWSS_249 | putative delta-1-pyrroline-5-carboxylate dehydrogenase | 92.74 |
| DMIN_00970 | SMGWSS_101 | chaperone protein DnaJ | 94.13 |
| DMIN_01120 | SMGWSS_116 | putative acetylornithine deacetylase (ArgE)/succinyl-diaminopimelate desuccinylase (DapE) | 94.52 |
| DMIN_00050 | SMGWSS_006 | phenylalanyl-tRNA synthetase beta subunit | 95.04 |
| DMIN_01320 | SMGWSS_136 | 50S ribosomal subunit protein L19 | 95.29 |
| DMIN_01570 | SMGWSS_160 | tryptophanyl-tRNA synthetase | 95.3 |
| DMIN_01250 | SMGWSS_129 | ATP-dependent protease | 95.32 |
| DMIN_01340 | SMGWSS_138 | 1,4-dihydroxy-2-naphthoate octaprenyltransferase | 95.35 |
| DMIN_00300 | SMGWSS_031 | ATP synthase F1, gamma subunit | 95.44 |
| DMIN_02510 | SMGWSS_256 | integral membrane protein TerC family | 95.74 |
| DMIN_01940 | SMGWSS_199 | translation initiation factor IF-2 | 95.79 |
Sequence variation detected between the single cell genome, the single bacteriome derived metagenome (SH, single host) and the pooled bacteriome derived metagenome (MH, multiple hosts).
| Genomic locus | Sequence variation | |
|
| Single cell genome/Metagenome (SH) |
|
| Metagenome (MH) |
| |
|
| Single cell genome |
|
| Metagenome (SH) |
| |
| Metagenome (MH) |
| |
|
| Single cell genome/Metagenome (SH) |
|
| Metagenome (MH) |
| |
|
| Single cell genome/Metagenome (SH) |
|
| Metagenome (MH) |
|
Identified SNPs within the host pool metagenomic sequence reads.
| Genomic locus | Sequence variation | |
|
| Single cell genome |
|
| Metagenome (MH) |
| |
|
| Single cell genome |
|
| Metagenome (MH) |
|