| Literature DB >> 28798737 |
Charu Tripathi1, Harshita Mishra1, Himani Khurana1, Vatsala Dwivedi1, Komal Kamra2, Ram K Negi1, Rup Lal1.
Abstract
Thermophilic environments represent an interesting niche. Among thermophiles, the genus Thermus is among the most studied genera. In this study, we have sequenced the genome of Thermus parvatiensis strain RL, a thermophile isolated from Himalayan hot water springs (temperature >96°C) using PacBio RSII SMRT technique. The small genome (2.01 Mbp) comprises a chromosome (1.87 Mbp) and a plasmid (143 Kbp), designated in this study as pTP143. Annotation revealed a high number of repair genes, a squeezed genome but containing highly plastic plasmid with transposases, integrases, mobile elements and hypothetical proteins (44%). We performed a comparative genomic study of the group Thermus with an aim of analysing the phylogenetic relatedness as well as niche specific attributes prevalent among the group. We compared the reference genome RL with 16 Thermus genomes to assess their phylogenetic relationships based on 16S rRNA gene sequences, average nucleotide identity (ANI), conserved marker genes (31 and 400), pan genome and tetranucleotide frequency. The core genome of the analyzed genomes contained 1,177 core genes and many singleton genes were detected in individual genomes, reflecting a conserved core but adaptive pan repertoire. We demonstrated the presence of metagenomic islands (chromosome:5, plasmid:5) by recruiting raw metagenomic data (from the same niche) against the genomic replicons of T. parvatiensis. We also dissected the CRISPR loci wide all genomes and found widespread presence of this system across Thermus genomes. Additionally, we performed a comparative analysis of competence loci wide Thermus genomes and found evidence for recent horizontal acquisition of the locus and continued dispersal among members reflecting that natural competence is a beneficial survival trait among Thermus members and its acquisition depicts unending evolution in order to accomplish optimal fitness.Entities:
Keywords: CRISPR; Pilus genes; Thermus parvatiensis; natural transformation; phage resistance; thermophiles
Year: 2017 PMID: 28798737 PMCID: PMC5529391 DOI: 10.3389/fmicb.2017.01410
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
General genome features of organisms belonging to the genus Thermus.
| RL | Hot spring, India | pTP143 (143,277) | 68.5 | 2,383 | 54 | 2 | |||
| HB27 | Hot spring, Japan | 2,127,482 | pTT27 (232,605) | 2,244 | 47 | 6 | |||
| HB8 | Hot spring, Japan | 2,197,207 | pTT27 (256,992), pTT8 (9,322), pVV8 (81,151) | 2,268 | 48 | 6 | |||
| JL-18 | Great Boiling Spring, USA | 2,311,212 | pTTJL1801 (265,886), pTTJL1802 (142,731) | 69.0 | 2,424 | 52 | 6 | ||
| SG0.5JP17-16 | Hot Spring | 2,303,227 | pTHTHE1601 ( | 68.6 | 2,405 | 53 | 6 | ||
| SA-01 | Fissure water, South Africa | 2,355,186 | pTSC8 ( | 64.9 | 2,514 | 47 | 6 | ||
| JL-2 | Great Boiling Spring, USA | 2,401,329 | pTHEOS01 (271,713), pTHEOS02 (57,223) | 68.6 | 2,521 | 59 | 6 | ||
| CCB_US3_UF1 | Hot spring, Malaysia | 2,263,488 | pTCCB09 (19,716) | 68.6 | 2,228 | 48 | 6 | ||
| Y51MC23 | Hot spring, USA | 2,338,641 | pTA14 (14,448), pTA16 (16,597), pTA69 (69,906), pTA78 (78,727) | 68.0 | 2,436 | 55 | 3 | ||
| GE-1 | Kamchatka, Russia | 2,388,273 | pTB1 (342,792), pTB2 (10,299) | 66.9 | 2,789 | 47 | 2 | ||
| DSM 12462 (HN3-7) | Hot spring, Iceland | 2,163,625 | ND | 2,321 | 47 | 4 | |||
| ATCC 43280 (Wai33 A1) | Hot spring, New Zealand | 2,386,081 | ND | 69.0 | 2,338 | 47 | 6 | ||
| DSM 21543 (PRI 3838) | Hot spring, Iceland | 2,263,010 | ND | 68.4 | 2,470 | 47 | 6 | ||
| ATCC 700962 (RF-4) | Hot spring, Iceland | 2,225,983 | ND | 68.8 | 2,379 | 43 | 6 | ||
| YIM 77777 | Hot spring, China | 2,218,114 | ND | 67.2 | 2,327 | 50 | 3 | ||
| YIM 77409 | Hot spring, China | 2,160,855 | ND | 67.4 | 2,313 | 48 | 6 | ||
| YIM 77401 | Geothermally heated soil, China | ND | 66.4 | 2,750 | 47 | 2 |
Smallest genome/plasmid sequences and GC content are underlined whereas the largest are in bold.
Complete genome; ND-Not determined.
Figure 1Replicon maps of Thermus parvatiensis strain RL (A) ORFs on the chromosome have been mapped on both strands, the origin of replication is marked with a red arrow. From outside to inside: genes on the negative strand, genes on the positive strand, GC percentage and GC skew. (B) Detailed map of T. parvatiensis megaplasmid pTP143 marked with prominent categories of genes in different colors. Genes representing DNA repair genes (green), mobile element genes (red), transcriptional regulators (magenta) and gene clusters (purple; denitrification gene cluster, cobalamin biosynthetic gene cluster, carotenoid synthesis gene cluster) have been specifically highlighted.
Genomic features of the chromosome and plasmid of T. parvatiensis strain RL.
| Accession number | ||
| Size (bp) | 1,872,821 | 143,277 |
| G+C content (%) | 68.5 | 68.4 |
| CDS | 2326 | 57 |
| Coding density (%) | 94.12 | 87.65 |
| tRNAs | 54 | 0 |
| rRNA operons | 2 | 0 |
Figure 2Inference of evolutionary relationships among Thermus spp. based on phylogenetic and phylogenomic methods. Phylogenetic analysis based on (A) 16S rRNA gene sequences; (B) 31 single copy genes; and (C) 400 conserved bacterial marker genes, of the species under study using maximum likelihood method. Bars represent the number of substitutions per nucleotide position. Percentage bootstrap values (≥70%) are shown next to the nodes. Phylogenomic dendrograms showing hierarchical clustering of species under study constructed using (D) whole genome distance matrix based on ANI scores; (E) tetranucleotide frequencies; and (F) pan genes presence-absence matrix. Gradation of colors from black to yellow in (D) depicts increasing genome distance on the basis two-way ANI matrix. Black denotes minimum distance and yellow denotes maximum distance. Organisms have been grouped together into clades on the basis of minimum distance (black). Blue shade (in A,B) depicts clustering of T. parvatiensis within T. thermophilus group. Light brown shade (in C–F) depicts the position of T. parvatiensis separately from other closely clustered Thermus members (shaded gray in C–F).
Figure 3Organizational (synteny) comparisons of T. parvatiensis with nine representatives of Thermus. Only members with complete genomes have been considered for this analysis. Synteny maps showing (A) comparison of the chromosome of T. parvatiensis against the chromosomes of other genomes; and (B) comparison of plasmid pTP143 against plasmid sequences (concatenated wherever number of plasmids >1) of other Thermus members. Boxes of different colors represent locally collinear blocks (LCBs) (or locally conserved regions) connected via lines of the same color to their corresponding positions on other genomes. For each genome, the LCBs above and below the reference line (indicated by black triangle) denote the orientation of the LCBs with respect to the reference sequence (LCBs below the black reference line denote inversions). Black lines (in B) below LCBs represent the position of coding sequences. Red arrows on plasmid pTP143 (in B) mark the regions that could not be mapped on other Thermus plasmids.
Figure 4(A) Detailed map of plasmid pTP143 with genes assigned to COG categories and depicted in a color-coded manner. Rings from outside to inside represent positions of genomic islands (GI) (red); coding sequences (brown); GC content (black); GC skew (purple) and COG category assignment (multiple colors according to the color key) (B) GIs found on the megaplasmid pTP143 (linear view) have been highlighted. These regions contain hypothetical protein coding genes, some plasmid specific genes, a toxin-antitoxin system and repair genes. Most of the genes on the GIs were placed into COG category “L” (replication and repair).
Figure 5Core and pan genome analysis of Thermus genus (A) Plot of pan and core genome. The plot represents a stabilized core structure but an open pan-genome; (B) Graph showing the number of core genes, accessory genes, unique genes and exclusively absent genes in all genomes under study.
Figure 6Depiction of metagenomic islands recovered by recruitment of raw reads obtained from the metagenomic sequencing of hot spring water (Manikaran, India) onto T. parvatiensis plasmid (A) and chromosome (B). MGIs on the plasmid pTP143 (5 MGIs) are highlighted in gray; MGIs on the chromosome of T. parvatiensis (5 MGIs) are marked with arrows. Reads mapped to the reference (identity ≥ 80%) are represented as blue (MNW1) and red (MNW2) dots.
Summary of CRISPR elements found across all Thermus genomes under this study.
| 1 | 37 | 9 | Other | ||
| 9 | 32-37 | 6+3+15+7+7+6+6+13+9 = 72 | I, III, IV | ||
| 9 | 29-37 | 14+4+3+12+9+12+23+20+12 = 109 | I, III | ||
| 6 | 29-37 | 21+23+7+25+19+5 = 100 | I | ||
| 6 | 28-37 | 3+13+6+10+6+18 = 56 | I | ||
| 3 | 29-30 | 9+36+42 = 87 | I | ||
| 5 | 29-36 | 14+6+88+30+4 = 134 | I, III, IV | ||
| 7 | 28-36 | 3+17+14+23+18+9+12 = 96 | I, III | ||
| 7 | 26-36 | 7+5+22+3+6+12+3 = 58 | I, III | ||
| 8 | 29-37 | 9+14+7+10+12+8+25+27 = 112 | I, III | ||
| 0 | 0 | 0 | — | — | |
| 9 | 25-37 | 3+24+4+5+6+22+5+12+7 = 88 | I, III, IV | ||
| 0 | 0 | 0 | I, III | ||
| 9 | 28-36 | 5+23+3+11+3+4+15+17+28 = 109 | I, III | ||
| 4 | 35-36 | 21+18+9+10 = 58 | I, III | ||
| 5 | 29-36 | 15+5+9+6+18 = 53 | I, III | ||
| 4 | 30-37 | 67+3+12+9 = 91 | I, III |
Figure 7CRISPR repeat (A) and spacer (B) analyses. (A) Dendrogram depicting clustering of CRISPR array repeats from all genomes of Thermus spp. against CRISPR repeats database (4,719 consensus repeats). The classification of repeats is based upon the division of all repeats into 24 conserved sequence families and 18 conserved structural motifs. The position of branches representing repeats from Thermus spp. is marked by red arrows and depicted by a number which corresponds to a genome, as follows: 1. T. aquaticus, 2. T. amyloliquefaciens, 3. T. brockianus, 4. T. caliditerrae, 5. Thermus sp. CCB_US3_UF1, 6. T. filiformis, 7. T. thermophilus HB8, 8. T. thermophilus HB27, 9. T. igniterrae, 10. T. thermophilus JL-18, 11. T. oshimai, 12. T. scotoductus, 13. T. tengchongensis, 14. T. parvatiensis, 15. T. thermophilus SG0.5JP17-16. Concentric circles are color coded and represent the following from inside out respectively: Structure motifs, sequence families, CAS subtypes, taxonomic groups, superclasses. Black and blue color branches represent bacterial and archaeal sequences respectively. (B) Pie diagram representing the relative abundance of viral families infecting Thermus genomes, predicted on the basis of CRISPR spacer analysis.
Figure 8Organization of putatively horizontally transferred PilA-ComZ locus genes from all Thermus genomes in this study. Blue, Pilus family proteins; Gray, Nucleases; Tan, Hypothetical/Conserved proteins; Maroon, Transposase/Conjugative element; Rhomboid, unformed CDS; Navy blue, Toxin-antitoxin elements; Black triangles, tRNAs.