| Literature DB >> 35707396 |
Philippe Normand1, Petar Pujic1, Danis Abrouk1, Spandana Vemulapally2, Trina Guerra2, Camila Carlos-Shanley2, Dittmar Hahn2.
Abstract
The genomes of two nitrogen-fixing Frankia strains, Ag45/Mut15 and AgPM24, isolated from root nodules of Alnus glutinosa are described as representatives of a novel candidate species. Phylogenomic and ANI analyses confirmed that both strains are related to cluster 1 frankiae, and that both strains belong to a novel species. At 6.4 - 6.7 Mb, their genomes were smaller than those of other cultivated Alnus-infective cluster 1 strains but larger than that of the non-cultivated Alnus-infective cluster 1 Sp+ strain AgTrS that was their closest neighbor as assessed by ANI. Comparative genomic analyses identified genes essential for nitrogen-fixation, gene composition as regards COGs, secondary metabolites clusters and transcriptional regulators typical of those from Alnus-infective cluster 1 cultivated strains in both genomes. There were 459 genes present in other cultivated Alnus-infective strains lost in the two genomes, spread over the whole of the genome, which indicates genome erosion is taking place in these two strains. © The author(s).Entities:
Keywords: Actinorhizal symbiosis; Frankia; biosynthetic gene clusters; genome; nitrogen-fixing frankiae
Year: 2022 PMID: 35707396 PMCID: PMC9194555 DOI: 10.7150/jgen.74788
Source DB: PubMed Journal: J Genomics
Basic genome characteristics of Frankia strains Ag45/Mut15 and AgPM24 compared to those of type strains of Frankia species in clusters 1 to 4
| Cluster 1 | Cluster 2 | Cluster 3 | Cluster 4 | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Strain | ACN14aT | ARgP5T | CpI1T | QA3 | Ag45/Mut15 | AgPM24 | CcI3T | BMG5.1T | BCU110501T | BMG5.12T | G2T | CjT | EAN1pec | EUN1f | M16386T | EuI1cT | Cn3T | DC12 |
| Collection | DSM 45986 | DSM 45898 | DSM 44263 | DSM 45818 | DSM 100624 | DSM | DSM 46783 | DSM 45899 | DSM 100623 | DSM 100626 | DSM 45817 | DSM 105290 | ||||||
|
|
|
|
|
|
|
|
|
|
|
|
| |||||||
| Genomic G+C content (mol%) | 72.8 | 72.4 | 72.4 | 72.6 | 71.37 | 71.35 | 70.1 | 71.0 | 72.3 | 71.7 | 70.9 | 71.1 | 70.94 | 70.82 | 71.93 | 72.3 | 71.8 | 71.93 |
| Genome length (nt) | 7497934 | 7730285 | 7624758 | 7590853 | 6443382 | 6672691 | 5433628 | 5795263 | 7891711 | 7589313 | 9537992 | 8061539 | 9035218 | 9322173 | 9435764 | 8815781 | 9978592 | 6884336 |
| # CDS | 6,714 | 7,500 | 7,201 | 7,307 | 6,088 | 6,370 | 5,593 | 6,487 | 7,567 | 6,977 | 8,663 | 8,108 | 9,063 | 9,428 | 8,884 | 8,099 | 9,262 | 6,630 |
| # secondary metabolite clusters* | 27 | 33 | 28 | 33 | 29 | 38 | 26 | 22 | 36 | 35 | 37 | 30 | 27 | 33 | 29 | 23 | 28 | 15 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 |
|
| 2 | 2 | 2 | 2 | 2 | 2 | 1 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
|
| 2 | 2 | 2 | 2 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
|
| 2 | 2 | 2 | 0 | 2 | 2 | 0 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 |
|
| 1 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
|
| 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 0 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 0 | 0 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 |
|
| 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 1 | 2 | 2 | 0 | 1 | 0 | 0 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 |
|
| 1 | 1 | 2 | 2 | 2 | 2 | 1 | 1 | 2 | 2 | 2 | 1 | 2 | 2 | 0 | 0 | 0 | 0 |
|
| 1 | 2 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 |
|
| 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 |
| # contigs | 1 | 568 | 153 | 120 | 113 | 181 | 1 | 116 | 207 | 139 | 83 | 289 | 1 | 396 | 174 | 1 | 2 | 1 |
| Accession | NC_008278.1 | OESX01000001 | JYFN00000000 | WGS NZ_AJWA.1 | JALKFT000000000 | JALKFW000000000 | CP000249.1 | JWIO00000000 | ARDT00000000 | ARFH00000000 | FAOZ00000000 | MAXA00000000.1 | AAII00000000 | ADGX00000000 | MOMC00000000 | CP002299.1 | AGJN00000000 | LANG01000000 |
| Reference | (30) | (39) | (40) | (41) | this study | (30) | (38) | (42) | (43) | (44) | (45) | (30) | (46) | (47) | (46) | (30) | (46) | |
* indicates the number of clusters identified by AntiSMASH
** indicates the number of hits (>50%) following a BlastP. nif is nitrogenase, shc is squalene hopene cyclase, hup is hydrogenase uptake, suf is sulfur-iron cluster, cel is cellulase, glx is glucose oxidase, bcs is cellulose synthase,
gvp is gas vesicle cluster, sodF is superoxide dismutase iron, geoA is geosmine synthase, arG is arginine, acc is acetate carboxylase, can is carbonic anhydrase, rhb is rhizobactin, lac is laccase, phd is a phytoene desaturase,
dct is a dicarboxylate transporter, tgs is diacylglycerol O-acyltransferase. ddn is F420H(2)-dependent quinone nitroreductase, mop is molybdenum transport, qor is quinone oxydoreductase, glb is hemoglobin.
Figure 1Phylogenetic tree of complete genomes using Micromonospora lupini (NZ_CAIE00000000.1) as outgroup. Clusters are indicated on the right.
Figure 2Heatmap matrix of Average Nucleotide Identity (ANI) comparisons (in percent) for the Frankia genomes of type strains of described species using the pyani platform with the b (Blast) setting (23); https://pyani.readthedocs.io). The two genomes described in the present study are highlighted in grey.
COG characteristics of Frankia strains Ag45/Mut15 and AgPM24 compared to those of type strains of Frankia species in clusters 1 to 4
| Strain | Cluster 1 | Cluster 2 | Cluster 3 | Cluster 4 | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ACN14aT | ARgP5T | CpI1T | QA3 | Ag45/Mut15 | AgPM24 | CcI3T | BMG5.1T | BCU110501T | BMG5.12T | G2T | CjT | EAN1pec | EUN1f | M16386T | EuI1cT | Cn3T | DC12 | |
| species |
|
|
|
|
|
|
|
|
|
|
|
| ||||||
| Class1 | ||||||||||||||||||
| D | 56 | 66 | 75 | 64 | 56 | 61 | 57 | 80 | 65 | 63 | 80 | 66 | 65 | 78 | 63 | 62 | 64 | 67 |
| M | 241 | 189 | 253 | 236 | 225 | 241 | 207 | 203 | 292 | 259 | 297 | 248 | 292 | 311 | 299 | 258 | 266 | 255 |
| N | 19 | 15 | 26 | 22 | 12 | 17 | 12 | 30 | 20 | 16 | 28 | 21 | 20 | 29 | 16 | 20 | 11 | 17 |
| O | 181 | 134 | 181 | 190 | 133 | 140 | 147 | 149 | 200 | 165 | 176 | 200 | 200 | 195 | 177 | 173 | 200 | 149 |
| T | 325 | 226 | 320 | 326 | 291 | 290 | 232 | 253 | 405 | 336 | 436 | 400 | 405 | 418 | 415 | 405 | 494 | 282 |
| U | 42 | 38 | 50 | 38 | 45 | 50 | 48 | 50 | 54 | 53 | 66 | 56 | 54 | 64 | 53 | 52 | 52 | 50 |
| V | 94 | 74 | 86 | 102 | 77 | 81 | 60 | 78 | 107 | 84 | 117 | 126 | 107 | 110 | 130 | 113 | 153 | 113 |
| J | 212 | 226 | 212 | 257 | 209 | 212 | 202 | 243 | 207 | 197 | 203 | 219 | 207 | 226 | 243 | 241 | 247 | 232 |
| K | 565 | 402 | 594 | 646 | 509 | 525 | 369 | 409 | 739 | 577 | 778 | 688 | 739 | 755 | 785 | 809 | 945 | 520 |
| L | 270 | 254 | 351 | 356 | 308 | 319 | 433 | 289 | 613 | 398 | 398 | 518 | 613 | 468 | 380 | 286 | 409 | 399 |
| C | 435 | 323 | 455 | 472 | 346 | 347 | 256 | 362 | 492 | 394 | 527 | 451 | 492 | 530 | 555 | 507 | 589 | 332 |
| E | 523 | 386 | 482 | 534 | 452 | 451 | 335 | 396 | 577 | 461 | 630 | 516 | 577 | 623 | 670 | 661 | 704 | 447 |
| F | 111 | 82 | 104 | 108 | 96 | 94 | 94 | 92 | 107 | 94 | 103 | 101 | 107 | 97 | 129 | 116 | 114 | 107 |
| G | 326 | 274 | 321 | 342 | 289 | 297 | 233 | 249 | 418 | 326 | 372 | 360 | 418 | 428 | 450 | 426 | 488 | 302 |
| H | 192 | 149 | 186 | 187 | 170 | 184 | 174 | 173 | 187 | 177 | 192 | 181 | 187 | 182 | 188 | 186 | 208 | 163 |
| I | 432 | 258 | 400 | 460 | 296 | 303 | 191 | 297 | 513 | 412 | 643 | 405 | 513 | 619 | 586 | 624 | 619 | 313 |
| P | 311 | 243 | 323 | 332 | 307 | 313 | 210 | 293 | 381 | 298 | 408 | 343 | 381 | 387 | 402 | 394 | 427 | 278 |
| Q | 376 | 226 | 368 | 371 | 304 | 339 | 197 | 320 | 488 | 369 | 565 | 417 | 488 | 550 | 531 | 534 | 569 | 256 |
| R | 1009 | 704 | 1005 | 1059 | 814 | 836 | 619 | 682 | 1216 | 969 | 1323 | 1064 | 1216 | 1280 | 1343 | 1332 | 1508 | 865 |
| S | 301 | 226 | 315 | 286 | 258 | 278 | 223 | 243 | 323 | 297 | 336 | 328 | 323 | 338 | 341 | 334 | 375 | 284 |
1class: D: Cell cycle control, cell division, chromosome partitioning; M: Cell wall/membrane/envelope biogenesis; N: Cell motility; O: Posttranslational modification, protein turnover, chaperones; T: Signal transduction mechanisms; U: Intracellular trafficking, secretion, and vesicular transport; V: Defense mechanisms; J: Translation, ribosomal structure and biogenesis; K: Transcription; L: Replication, recombination and repair; C: Energy production and conversion; E: Amino acid transport and metabolism; F: Nucleotide transport and metabolism; G: Carbohydrate transport and metabolism; H: Coenzyme transport and metabolism; I: Lipid transport and metabolism; P: Inorganic ion transport and metabolism; Q: Secondary metabolites biosynthesis, transport and catabolism; R: General function prediction only; S: Function unknown.
Number of secondary metabolites clusters (antiSMASH) of Frankia strains Ag45/Mut15 and AgPM24 compared to those of cultivated type strains of Frankia species in clusters 1 to 4
| Strain | Cluster 1 | Cluster 2 | Cluster 3 | Cluster 4 | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ACN14aT | ARgP5T | CpI1T | QA3 | Ag45/Mut15 | AgPM24 | CcI3T | BMG5.1T | BCU110501T | BMG5.12T | G2T | CjT | EAN1pec | EUN1f | M16386T | EuI1cT | Cn3T | DC12 | |
| species |
|
|
|
|
|
|
|
|
|
|
|
| ||||||
| t1PKS1 | 6 | 9 | 8 | 8 | 9 | 11 | 1 | 6 | 16 | 13 | 6 | 9 | 5 | 9 | 6 | 5 | 2 | 1 |
| t2PKS | 1 | 3 | 1 | 3 | 1 | 1 | 2 | 2 | 1 | 1 | 2 | 1 | 1 | 2 | 1 | 2 | 1 | 1 |
| t3PKS | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 1 | 1 | 1 | 1 | 1 | 3 | 1 | 1 | 2 |
| otherKS | 4 | 4 | 3 | 3 | 3 | 5 | 4 | 1 | 4 | 3 | 6 | 4 | 4 | 6 | 2 | 1 | 2 | 1 |
| t1pks-NRPS | 1 | 0 | 1 | 0 | 1 | 2 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 |
| NRPS | 3 | 6 | 2 | 2 | 6 | 6 | 0 | 1 | 1 | 2 | 9 | 5 | 3 | 5 | 4 | 2 | 7 | 1 |
| terpene | 5 | 3 | 5 | 5 | 4 | 4 | 4 | 3 | 4 | 4 | 3 | 5 | 4 | 4 | 5 | 4 | 4 | 3 |
| lanthipeptide | 1 | 1 | 1 | 3 | 0 | 3 | 6 | 2 | 4 | 3 | 2 | 1 | 4 | 3 | 1 | 2 | 1 | 2 |
| bacteriocin | 2 | 1 | 2 | 2 | 1 | 2 | 1 | 1 | 2 | 2 | 2 | 2 | 3 | 0 | 3 | 1 | 2 | 0 |
| siderophore | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 |
| lassopeptide | 1 | 1 | 1 | 2 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 2 | 1 | 2 |
| betalactone | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 1 | 2 | 0 | 1 |
| thiopeptide | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| butyrolactone | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| phosphonate | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| arylpolyene | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| nucleoside | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| ladderane | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| oligosaccharide | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| resorcinol | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| LAP | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| other | 0 | 2 | 2 | 3 | 1 | 0 | 4 | 1 | 1 | 5 | 1 | 1 | 0 | 1 | 2 | 1 | 5 | 1 |
|
| 27 | 33 | 28 | 33 | 29 | 38 | 26 | 22 | 36 | 35 | 37 | 30 | 27 | 33 | 29 | 23 | 28 | 15 |
1-tnPKS is type “n” PolyKetide Synthase, NRPS is Non Ribosomal Peptide Synthase, LAP is Linear Azole/azoline-containing Peptide.
Number of transcriptional regulators of Frankia strains Ag45/Mut15 and AgPM24 compared to those of type strains of Frankia species in clusters 1 to 4
| Strain | Cluster 1 | Cluster 2 | Cluster 3 | Cluster 4 | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ACN14aT | ARgP5T | CpI1T | QA3 | Ag45/Mut15 | AgPM24 | CcI3T | BMG5.1T | BCU110501T | BMG5.12T | G2T | CjT | EAN1pec | EUN1f | M16386T | EuI1cT | Cn3T | DC12 | |
| species |
|
|
|
|
|
|
|
|
|
|
|
| ||||||
|
| ||||||||||||||||||
| AraC | 9 | 9 | 10 | 16 | 6 | 6 | 2 | 5 | 15 | 13 | 17 | 16 | 28 | 17 | 20 | 22 | 21 | 6 |
| ArsR | 9 | 6 | 5 | 1 | 7 | 6 | 6 | 5 | 4 | 4 | 11 | 6 | 11 | 9 | 9 | 16 | 8 | 8 |
| AsnC | 3 | 2 | 2 | 4 | 4 | 3 | 3 | 2 | 3 | 3 | 3 | 3 | 5 | 4 | 5 | 5 | 5 | 3 |
| CRP | 4 | 2 | 1 | 1 | 4 | 4 | 2 | 3 | 3 | 3 | 5 | 2 | 3 | 5 | 3 | 5 | 2 | 3 |
| DeoR | 4 | 1 | 0 | 0 | 1 | 2 | 0 | 0 | 2 | 1 | 0 | 2 | 4 | 0 | 2 | 2 | 2 | 1 |
| DtxR | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| FurC | 2 | 3 | 3 | 4 | 3 | 3 | 2 | 2 | 5 | 4 | 5 | 5 | 4 | 4 | 5 | 4 | 4 | 5 |
| GntR | 25 | 19 | 10 | 20 | 7 | 5 | 6 | 8 | 21 | 12 | 19 | 19 | 20 | 24 | 27 | 35 | 30 | 11 |
| IclR | 3 | 6 | 4 | 9 | 4 | 3 | 2 | 1 | 4 | 7 | 12 | 6 | 12 | 12 | 6 | 13 | 11 | 7 |
| LuxR | 10 | 19 | 19 | 36 | 10 | 14 | 20 | 15 | 22 | 9 | 18 | 15 | 58 | 40 | 18 | 64 | 29 | 14 |
| LysR | 18 | 16 | 12 | 22 | 11 | 10 | 5 | 5 | 14 | 10 | 20 | 13 | 13 | 17 | 24 | 20 | 22 | 13 |
| MarR | 21 | 19 | 13 | 33 | 16 | 15 | 15 | 23 | 18 | 20 | 31 | 25 | 27 | 30 | 32 | 33 | 35 | 19 |
| MerR | 8 | 17 | 9 | 22 | 10 | 10 | 12 | 4 | 13 | 12 | 15 | 13 | 15 | 17 | 16 | 19 | 18 | 7 |
| TetR | 92 | 78 | 117 | 127 | 61 | 65 | 30 | 47 | 113 | 77 | 39 | 126 | 147 | 156 | 156 | 191 | 18 | 59 |
| WhiB | 7 | 7 | 8 | 7 | 5 | 6 | 6 | 6 | 10 | 5 | 6 | 9 | 6 | 5 | 8 | 10 | 7 | 8 |
1class: AraC: arabinose regulator; ArsR: arsenic resistance; AsnC: asparagine synthase regulator; CRP: cyclic AMP receptor protein (catabolite repression); DeoR: deoxyribonucleoside synthesis operon regulator; DtxR: diphtheria toxin repressor; FurC: ferric uptake regulator; GntR: gluconate regulator; IclR: isocitrate lyase regulator; LuxR: quorum-sensing luminescence regulator; LysR: lysine regulator; MarR: Multiple antibiotic resistance regulator; MerR: mercury resistance regulator; TetR: Tetracycline repressor; WhiB: regulation of morphological differentiation.