| Literature DB >> 31178855 |
Danyu Hu1,2, Yang Zang1,2, Yingjin Mao1,2, Beile Gao1.
Abstract
The class Thermoleophilia is one of the deep-rooting lineages within the Actinobacteria phylum and metagenomic investigation of microbial diversity suggested that species associated with the class Thermoleophilia are abundant in hot spring and soil samples. However, very few species of this class have been cultivated and characterized. Our understanding of the phylogeny and taxonomy of Thermoleophilia is solely based on 16S rRNA sequence analysis of limited cultivable representatives, but no other phenotypic or genotypic characteristics are known that can clearly discriminate members of this class from the other taxonomic units within the kingdom bacteria. This study reports phylogenomic analysis for 12 sequenced members of this class and clearly resolves the interrelationship of not yet cultivated species with reconstructed genomes and known type species. Comparative genome analysis discovered 12 CSIs in different proteins and 32 CSPs that are specific to all species of this class. In addition, a large number of CSIs or CSPs were identified to be unique to certain lineages within this class. This study represents the first and most comprehensive phylogenetic analysis of the class Thermoleophilia, and the identified CSIs and CSPs provide valuable molecular markers for the identification and delineation of species belonging to this class or its subordinate taxa.Entities:
Keywords: Thermoleophilia; conserved signature indels; conserved signature proteins; molecular signatures; phylogeny
Year: 2019 PMID: 31178855 PMCID: PMC6544083 DOI: 10.3389/fmicb.2019.01185
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Phylogenetic analysis of the class Thermoleophilia. (A) Maximum-likelihood tree for Thermoleophilia species based upon concatenated sequences of 54 conserved proteins. (B) Maximum-likelihood tree based on full length 16S rRNA gene sequences of all type species within the class Thermoleophilia. Bootstrap values (%) are shown at each node and different clusters that are consistently observed in both phylogenetic trees are marked.
Characteristic of Conserved Signature Indels specific to the class Thermoleophilia or its associated taxa.
| Protein name | GI no.a | Figure number | Indel size | Indel positionb | Specificity |
|---|---|---|---|---|---|
| Quinolinate synthase NadA | 1225101978 | 4aa insc | 138–180 | All | |
| 30S ribosomal protein S10 | 1093219170 | 1aa ins | 72–105 | All | |
| Glutamate-1-semialdehyde-2,1-aminomutase | 1225102988 | 2aa del | 172–209 | All | |
| D-tyrosyl-tRNA(Tyr) deacylase | 1225105696 | 6aa del | 100–135 | All | |
| Vitamin B12-dependent ribonucleotide reductase | 1225104123 | 1aa ins | 746–793 | All | |
| DNA-directed RNA polymerase subunit beta | 1225103324 | 2aa ins | 215–256 | All | |
| PspA/IM30 family protein | 654611971 | 3aa del | 184–227 | All | |
| Glutamine-hydrolyzing GMP synthase | 1225105599 | 1aa ins | 406–450 | All | |
| Elongation factor P | 1225104642 | 1aa ins | 127–176 | All | |
| Replicative DNA helicase | 1225103017 | 2aa ins | 15–55 | All | |
| Phenylalanine–tRNA ligase subunit alpha | 654610443 | 2–10aa ins | 244–285 | All | |
| DNA polymerase III alpha subunit | 1225105080 | 1aa ins | 84–128 | All | |
| Arginine–tRNA ligase | 1225101858 | 7aa ins | 314–367 | ||
| LytR family transcriptional regulator | 1225102507 | 2aa ins | 155–190 | ||
| DNA gyrase subunit A | 1225102941 | 8aa ins | 250–298 | ||
| Chaperonin GroEL | 1225103134 | 3aa ins | 459–497 | ||
| Short chain dehydrogenase | 1225103641 | 2aa ins | 222–264 | ||
| Type II secretion system F family protein | 1225104607 | 1aa ins | 299–342 | ||
| Leucyl-tRNA synthetase | 1093217654 | 1aa ins | 429–469 | ||
| NADH-quinone oxidoreductase subunit B | 551309834 | 1aa del | 137–181 | ||
| 4-hydroxy-3-methylbut-2-enyl diphosphate reductase | 739551922 | 1aa ins | 44–91 | ||
| Pyruvate kinase | 652636441 | 5aa del | 189–227 | ||
| tRNA guanosine (34) transglycosylase Tgt | 654594575 | 1aa ins | 312–357 | ||
| Excinuclease ABC subunit UvrB | 654612298 | 1aa ins | 215–263 | ||
| Transcription antitermination factor NusB | 494847549 | 6aa ins | 62–102 | ||
| Thioredoxin-disulfide reductase | 916615184 | 1aa ins | 40–82 | ||
| Trigger factor | 917589205 | 5aa ins | 169–217 | ||
| 1aa ins | 215–255 | ||||
| Glutamate-5-semialdehyde dehydrogenase | 652642436 | 5aa del | 150–196 | ||
| Glutamine amidotransferase | 654598081 | 3aa ins | 170–211 | ||
| 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase subunit CofH | 654594367 | 4aa del | 152–192 | ||
| methionine–tRNA ligase | 654600348 | 5aa ins | 267–310 | ||
| Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatC | 654597239 | 1aa ins | 20–65 | ||
| CTP synthase | 921290543 | 2aa ins | 264–308 | ||
| DNA-directed RNA polymerase subunit beta’ | 494853285 | 8aa ins | 376–420 | ||
| SDR family NAD(P)-dependent oxidoreductase | 494848053 | 2aa ins | 149–198 | ||
| Dihydrolipoyl dehydrogenase | 551307243 | 1aa del | 355–396 | ||
| Methylmalonyl-CoA epimerase | 551310266 | 2aa ins | 1–48 | ||
| Acetyl-CoA carboxylase biotin carboxylase subunit | 551309981 | 2aa ins | 224–268 | ||
| GTPase HflX | 1225104795 | 1aa ins | 282–322 | ||
| 1-deoxy-D-xylulose-5-phosphate reductoisomerase | 551310630 | 6–8aa ins | 146–188 | ||
| Tryptophan–tRNA ligase | 494851195 | 4–12aa ins | 152–191 | ||
| Endopeptidase La | 551309049 | 1aa ins | 228–266 | ||
| 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase subunit CofH | 494847285 | 4aa ins | 481–522 | ||
| NADH-quinone oxidoreductase subunit I | 1113228917 | 1aa ins | 72–125 | New cluster | |
| Adenylosuccinate synthase | 1113229450 | 17–23aa ins | 154–204 | S.67-14 and S.70-9d | |
| GTPase Era | 1113226493 | 1–2aa ins | 38–88 | S.67-14 and S.70-9 | |
| Heme-copper oxidase subunit III | 1113215223 | 1–4aa ins | 121–167 | S.67-14 and S.70-9 |
FIGURE 2CSI specific to all Thermoleophilia species. Partial sequence alignment of the protein quinolinate synthase NadA showing a 4 amino acid insertion in a conserved region that is specific for members of the class Thermoleophilia. The dashes in this alignment as well as all other alignments indicate identity with the amino acid on the top line. The GenBank identification numbers of the protein sequences are shown, and the topmost numbers indicate the position of this sequence in the species shown on the top line.
Conserved Signature Proteins that are uniquely found in the Thermoleophilia class.
| Protein product | Length | Specificity | Function |
|---|---|---|---|
| WP_093115104.1 | 242 | All | Unknown |
| WP_093115134.1 | 90 | Unknown | |
| WP_093115673.1 | 127 | All | Unknown |
| WP_093115681.1 | 103 | All | Unknown |
| WP_093115745.1 | 166 | All | Unknown |
| WP_093115827.1 | 993 | All | Unknown |
| WP_093116216.1 | 151 | All | Unknown |
| WP_093116230.1 | 213 | All | Unknown |
| WP_093116634.1 | 159 | All | Unknown |
| WP_093116636.1b | 64 | All | Unknown |
| WP_093116642.1 | 114 | All | Unknown |
| WP_093116769.1 | 130 | All | Unknown |
| WP_093116819.1 | 167 | All | Unknown |
| WP_093116917.1 | 120 | All | Unknown |
| WP_093116997.1 | 185 | All | Unknown |
| WP_093117023.1 | 151 | All | Unknown |
| WP_093117047.1 | 572 | All | Unknown |
| WP_093117060.1 | 247 | All | Unknown |
| WP_093117260.1 | 72 | All | Unknown |
| WP_093117458.1b | 142 | All | Unknown |
| WP_093117523.1 | 269 | All | Unknown |
| WP_093118104.1 | 79 | All | Unknown |
| WP_093118304.1b | 132 | All | Unknown |
| WP_093118364.1b | 257 | All | Unknown |
| WP_093118537.1 | 154 | All | Unknown |
| WP_093118589.1 | 178 | All | Unknown |
| WP_093118635.1 | 120 | All | Unknown |
| WP_093118833.1 | 82 | All | Unknown |
| WP_093119001.1 | 187 | All | Unknown |
| WP_093116803.1 | 141 | Unknown | |
| WP_093118036.1 | 211 | Unknown | |
| WP_093116745.1 | 226 | Unknown | |
Conserved Signature Proteins that are uniquely found in the subgroups of Thermoleophilia class.
| Accession no. | Length | Specificity |
|---|---|---|
| WP_093115090.1 | 197 | |
| WP_093115144.1 | 179 | |
| WP_093115294.1 | 164 | |
| WP_093115296.1 | 180 | |
| WP_093115479.1 | 319 | |
| WP_093115661.1 | 93 | |
| WP_093115901.1 | 156 | |
| WP_093115943.1 | 202 | |
| WP_093116532.1 | 154 | |
| WP_093116727.1 | 429 | |
| WP_093116780.1 | 110 | |
| WP_093116825.1 | 68 | |
| WP_093116919.1 | 83 | |
| WP_093117092.1 | 264 | |
| WP_093117483.1 | 93 | |
| WP_093117587.1 | 83 | |
| WP_093117642.1 | 114 | |
| WP_093117817.1 | 157 | |
| WP_093117827.1 | 199 | |
| WP_093117877.1 | 136 | |
| WP_093118281.1 | 403 | |
| WP_093118340.1 | 146 | |
| WP_093118436.1 | 119 | |
| WP_093118524.1 | 170 | |
| WP_093118569.1 | 80 | |
| WP_093118679.1 | 148 | |
| WP_093118731.1 | 93 | |
| WP_093118750.1 | 573 | |
| WP_093118752.1 | 195 | |
| WP_022926981.1 | 246 | |
| WP_022926986.1 | 115 | |
| WP_022927172.1 | 216 | |
| WP_022927347.1 | 417 | |
| WP_022927380.1 | 114 | |
| WP_022927389.1 | 468 | |
| WP_022927525.1 | 461 | |
| WP_022927538.1 | 181 | |
| WP_022927665.1 | 153 | |
| WP_022927703.1 | 253 | |
| WP_022927703.1 | 253 | |
| WP_022927792.1 | 224 | |
| WP_022927799.1 | 564 | |
| WP_022927801.1 | 265 | |
| WP_022928134.1 | 160 | |
| WP_022928438.1 | 136 | |
| WP_022928438.1 | 136 | |
| WP_022929183.1 | 133 | |
| WP_022929536.1 | 104 | |
| WP_022929558.1 | 227 | |
| WP_022930026.1 | 369 | |
| WP_022930484.1 | 604 | |
| WP_028721853.1 | 100 | |
| WP_051160538.1 | 289 | |
| WP_022926969.1 | 211 | |
| WP_022926970.1 | 304 | |
| WP_022927005.1 | 421 | |
| WP_022927132.1 | 338 | |
| WP_022927548.1 | 105 | |
| WP_022927557.1 | 100 | |
| WP_022927572.1 | 162 | |
| WP_022928009.1 | 773 | |
| WP_022928045.1 | 170 | |
| WP_022928129.1 | 165 | |
| WP_022928139.1 | 176 | |
| WP_022928142.1 | 174 | |
| WP_022928143.1 | 155 | |
| WP_022928333.1 | 248 | |
| WP_022928557.1 | 67 | |
| WP_022928588.1 | 110 | |
| WP_022928655.1 | 62 | |
| WP_022928967.1 | 242 | |
| WP_022929153.1 | 236 | |
| WP_022929154.1 | 209 | |
| WP_022929593.1 | 417 | |
| WP_022929618.1 | 66 | |
| WP_022929735.1 | 411 | |
| WP_022929823.1 | 153 | |
| WP_022929914.1 | 269 | |
| WP_022929990.1 | 171 | |
| WP_022930081.1 | 281 | |
| WP_022930294.1 | 190 | |
| WP_022930374.1 | 124 | |
| WP_022930538.1 | 472 | |
| WP_022930714.1 | 206 | |
FIGURE 3CSI specific to T. album and MAG HR41. Partial sequence alignment of arginine–tRNA ligase showing a 7 amino acid insertion that is uniquely shared by T. album and MAG HR41.
FIGURE 4CSI specific to the families Conexibacteraceae, Solirubrobacteraceae and Patulibacteraceae. Partial alignment of the protein NADH-quinone oxidoreductase subunit B showing a 1 amino acid deletion that is uniquely shared by 3 families Conexibacteraceae, Solirubrobacteraceae and Patulibacteraceae.
FIGURE 5CSI specific to Conexibacteraceae. A 1 amino acid insertion in the protein thioredoxin-disulfide reductase that is uniquely shared by C. woesei and associated MAG.
FIGURE 6CSI specific to Solirubrobacteraceae. A 3 amino acid CSI in the protein glutamine amidotransferase that is specific for S. soli and associated MAG.
FIGURE 7CSI specific to Patulibacteraceae. Partial sequence alignment of DNA-directed RNA polymerase subunit beta’ showing an 8 amino acid insertion that is specific for Patulibacteraceae.