| Literature DB >> 35573199 |
Yuguang Liu1,2,3, Patricio Jeraldo1,2, William Herbert2,4, Samantha McDonough5, Bruce Eckloff5, Dirk Schulze-Makuch6,7,8,9, Jean-Pierre de Vera10, Charles Cockell11, Thomas Leya12, Mickael Baqué13, Jin Jen5, Marina Walther-Antonio1,2,14.
Abstract
The Nostoc sp. strain CCCryo 231-06 is a cyanobacterial strain capable of surviving under extreme conditions and thus is of great interest for the astrobiology community. The knowledge of its complete genome sequence would serve as a guide for further studies. However, a major concern has been placed on the effects of contamination on the quality of sequencing data without a reference genome. Here, we report the use of microfluidic technology combined with single cell sequencing and de novo assembly to minimize the contamination and recover the complete genome of the Nostoc strain CCCryo 231-06 with high quality. 100% of the whole genome was recovered with all contaminants removed and a strongly supported phylogenetic tree. The data reported can be useful for comparative genomics for phylogenetic and taxonomic studies. The method used in this work can be applied to studies that require high-quality assemblies of genomes of unknown microorganisms.Entities:
Keywords: Astrobiology; Microbial genomics; Microbiology; Space sciences
Year: 2022 PMID: 35573199 PMCID: PMC9095746 DOI: 10.1016/j.isci.2022.104291
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Figure 1Optofluidic platform overview
(A) An optofluidic platform consists of a microscope, laser tweezers and a microfluidic device consists of 4 identical reaction blocks for high-throughput SC-WGA.
(B) The use of laser tweezers to trap single cells of Nostoc strain CCCryo 231-06. Laser trap A_1 was turned on and trapped a single cell in the sample suspension; laser trap B_1 was turned off.
Figure 2An overview of the workflow of SC-WGS for Nostoc strain CCCryo 231-06
A list of samples of the Nostoc sp. strain CCCryo 231-06 exposed to different conditions and the number of single cells sequenced
| Sample No. | Sample name | Exposure condition | Number of single cells | Cultured at CCCryo |
|---|---|---|---|---|
| 1 | 231-06 Control | 4 | 231-06 | |
| 2 | 1-1-t-05: 231-06, BG11 | Flight, BIOMEX, UV, space | 4 | 231-06_C1 |
| 3 | 1-1-t-06: 231-06, Lunar | 4 | 231-06_D1 | |
| 4 | 1-1-b-05: 231-06, BG11 | Flight, BIOMEX, dark, space | 4 | 231-06_C2 |
| 5 | 1-1-b-06: 231-06, Lunar | 4 | 231-06_D2 | |
| 6 | 2-1-t-05: 231-06, S-MRS | Flight, BIOMEX, UV, Mars | 4 | 231-06_C4 |
| 7 | 2-1-t-06: 231-06, P-MRS | 3 | 231-06_D4 | |
| 8 | 2-1-b-05: 231-06, S-MRS | Flight, BIOMEX, dark, Mars | 4 | 231-06_C5 |
| 9 | 2-1-b-06: 231-06, P-MRS | 4 | 231-06_D5 | |
| 10 | 1-1-t-05: 231-06, BG11 | Ground, BIOMEX, UV, space | 4 | - |
| 11 | 1-1-t-06: 231-06, Lunar | 4 | - | |
| 12 | 2-1-t-05: 231-06, S-MRS | Ground, BIOMEX, UV, Mars | 4 | - |
| 13 | 2-1-t-06: 231-06, P-MRS | 1 | - |
CCCryo database is available at
Figure 3Quality control of the sequencing data
(A) Raw data reads R1-based sequence quality.
(B) Filtered reads R1-based sequence quality.
(C) Raw data reads R2-based sequence quality.
(D) Filtered reads R2-based sequence quality.
Sequencing data filtering quality control
| Before filtering | After filtering | |
|---|---|---|
| Mean length | 151 bp | 137 bp |
| Total reads | 454.38 M | 447.57 M |
| Total bases | 68.61 G | 61.38 G |
| Q20 bases | 96.53% | 97.05% |
| Q30 bases | 91.67% | 92.39% |
Nostoc sp. CCCryo 231-06 genome statistics
| Attribute | Value | % of total |
|---|---|---|
| Genome size (bp) | 7,079,046 | |
| DNA G+C (bp) | 41.75 | |
| Total contigs | 721 | |
| Contig N50 length (bp) | 22,738 | |
| Contig L50 count | 94 | |
| Estimated essential completeness (%) | 100 | |
| Estimated contig contamination (%) | 0 | |
| Total genes | 7,939 | |
| Protein encoding genes | 7,891 | 99.4% |
| rRNA gnes | 4 | 0.0005% |
| tRNA genes | 44 | 0.0055% |
| Genes with function prediction | 4,566 | 57.5% |
| Genes assigned to COGs | 4,911 | 62.9% |
Figure 4Representation of metagenome contigs for manual binning
Highlighted is the bin for the Nostoc strain CCCryo 231-06. Other organisms are present in the sequence data, being members of the culture or environmental contaminants.
Figure 5A COG usage bar chart shows the percent of predicted genes of Nostoc strain CCCryo 231-06 and four close relatives
No significant differences in gene category usage were apparent among the genomes of these species.
Figure 6A bar chart shows contaminants detected within amplified Nostoc sp. CCCryo 231-06 samples
Figure 7A protein-based phylogenetic tree highlighting the position of the Nostoc strain CCCryo 231-06 relative to the neighboring species and other members of Nostoc. Cylindrospermum spp., Leptolyngbya spp., and Phormidium spp. are the outgroup genomes
Bootstrap values are shown at each subtree.
Figure 8Two putative prophages identified in the genome of Nostoc strain CCCryo 231-06
(A) A prophage similar to a putative prophage belonging to N. punctiforme PCC 73102 (91% BLAST identity).
(B) A prophage similar to a putative prohage belonging to N. commune HK-02 (93% BLAST identity).
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| The antarctic strain CCCryo 231-06 (= UTEX EE21; CCMEE 391) of the cyanobacterium | Biobank CCCryo at Branch Bioanalytics and Bioprocesses of the Fraunhofer Institute for Cell Therapy and Immunology (IZI-BB), Potsdam, Germany | CCCryo 231-06 |
| Pluronic F127 | Sigma Aldrich | Cat# P2443; CAS: 9003-11-6 |
| PBS | Fisher Scientific | Cat# BP243820 |
| PDMS | Momentive | RTV615 |
| UltraPure™ 0.5M EDTA, pH 8.0 | ThermoFisher | Cat# 15575020 |
| Ready-Lyse™ Lysozyme Solution | Lucigen | Cat# R1804M |
| Dithiothreitol (DTT) | Bio-rad | Cat# 1610611 |
| Low-EDTA TE (1X), pH 8.0 | VWR | Cat# 10128-588 |
| NEBNext® Multiplex Oligos for Illumina® | New England Biolabs | Cat# E7335S |
| SPRIselect Beads | Beckman Coulter | Cat# B23318 |
| Genome sequence | This paper | NCBI, accession# |
| The antarctic strain CCCryo 231-06 (= UTEX EE21; CCMEE 391) of the cyanobacterium | Biobank CCCryo at Branch Bioanalytics and Bioprocesses of the Fraunhofer Institute for Cell Therapy and Immunology (IZI-BB), Potsdam, Germany | CCCryo 231-06 |
| Samtools | ||
| BioBloomTools | ||
| Atropos | ||
| BBTools | ||
| MEGAHIT | ||
| BusyBee | ||
| CheckM | ||
| GTDBtk tool | ||
| Prokka | ||
| eggNOG mapper | ||
| PATRIC | ||
| RAxML | ||
| Dendroscope 3 | ||
| PHASTER | ||
| REPLI-g Single Cell Kit | Qiagen | Cat# 150,343 |
| NEBNext Ultra II DNA Library Prep Kit for Illumina | New England Biolabs | Cat# E7103 |
| HiSeq 3000/4000 PE Cluster Kit | Illumina | Cat# PE-410-1001 |
| HiSeq 3000/4000 SBS Kit (300 cycles) | Illumina | Cat# FC-410-1003 |