| Literature DB >> 30515147 |
Alex Ranieri Jerônimo Lima1, Andrei Santos Siqueira1, Janaina Mota de Vasconcelos1, James Siqueira Pereira2, Juliana Simão Nina de Azevedo2, Pablo Henrique Gonçalves Moraes1, Délia Cristina Figueira Aguiar1, Clayton Pereira Silva de Lima3, João Lídio Silva Gonçalves Vianez-Júnior3, Márcio Roberto Teixeira Nunes3, Luciana Pereira Xavier4, Leonardo Teixeira Dall'Agnol5, Evonnildo Costa Goncalves1.
Abstract
Currently only four genome sequences for Limnothrix spp. are publicly available, and information on the genetic properties of cyanobacteria belonging to this genus is limited. In this study, we report the draft genome of Limnothrix sp. CACIAM 69d, isolated from the reservoir of a hydroelectric dam located in the Amazon ecosystem, from where cyanobacterial genomic data are still scarce. Comparative genomic analysis of Limnothrix revealed the presence of key enzymes in the cyanobacterial central carbon metabolism and how it is well equipped for environmental sulfur and nitrogen acquisition. Additionally, this work covered the analysis of Limnothrix CRISPR-Cas systems, pathways related to biosynthesis of secondary metabolites and assembly of extracellular polymeric substances and their exportation. A trans-AT PKS gene cluster was identified in two strains, possibly related to the novel toxin Limnothrixin biosynthesis. Overall, the draft genome of Limnothrix sp. CACIAM 69d adds new data to the small Limnothrix genome library and contributes to a growing representativeness of cyanobacterial genomes from the Amazon region. The comparative genomic analysis of Limnothrix made it possible to highlight unique genes for each strain and understand the overall features of their metabolism.Entities:
Keywords: Amazonian; Limnothrix; comparative genomics; cyanobacteria; genome mining
Year: 2018 PMID: 30515147 PMCID: PMC6256058 DOI: 10.3389/fmicb.2018.02811
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Map of Tucuruí Hydroelectric Dam, showing the M1 sampling point.
NaPDoS results for KS domain containing proteins present in trans-AT PKS clusters found in Limnothrix sp. CACIAM 69d and Limnothrix sp. P13C2.
| Query ID (locus tag) | NaPDoS database match ID | Percent identity | Align length | E-value | Pathway product | Organism |
|---|---|---|---|---|---|---|
| BJG00_012525 | KirAIV_CAN89634_7T | 55 | 417 | 7 E-117 | Kirromycin | |
| BJG00_012510 | VirA_BAF50727_4T | 59 | 438 | 2 E-138 | Virginiamycin | |
| BJG00_012505 | VirA_BAF50727_4T | 57 | 431 | 5 E-133 | Virginiamycin | |
| BCR12_15970 | KirAIV_CAN89634_7T | 55 | 426 | 5 E-120 | Kirromycin | |
| BCR12_15975 | LnmJ_AF484556_1T | 59 | 432 | 6 E-137 | Leinamycin | |
| BCR12_15980 | VirA_BAF50727_4T | 57 | 431 | 1 E-133 | Virginiamycin | |
FIGURE 2Bayesian phylogenetic tree based on 16S rRNA sequences from Limnothrix strains and other Oscillatorian genera. Limnothrix strains with sequenced genomes are shown in bold. Gloeobacter violaceus PCC 7421 was used as outgroup. GenBank accession numbers are in parentheses.
CheckM results for each assembler and binning tool.
| Tool | MyCC | MaxBin 2.2 | MetaBAT2 | |||
|---|---|---|---|---|---|---|
| Assembler | SPAdes | Newbler | SPAdes | Newbler | SPAdes | Newbler |
| Bin01 | Cyanobacteria (99.64%, 0.34%, 22x) | Cyanobacteria (99.64%, 0.34%, 21x) | Cyanobacteria (99.64%, 0.34%, 23x) | Cyanobacteria (99.64%, 0.34%, 27x) | Cyanobacteria (99.64%, 0.34%, 22x) | Cyanobacteria (99.64%, 0.34%, 26x) |
| Bin02 | Sphingomonadales (94.28%, 31.3%, 2.2x) | Sphingomonadales (98.44%, 21.1%, 2.7x) | Sphingomonadales (96.77%, 8.08%, 4x) | Sphingomonadales (98.34%, 5.13%, 4.6x) | Sphingomonadales (96.77%, 3.1%, 7.2%) | Sphingomonadales (98.27%, 0.97%, 8x) |
| Bin03 | Bacteroidetes (91.98%, 4.3%, 4x) | Bacteroidetes (97.2%, 2.8%, 5.7x) | Bacteroidetes (95%, 16.35%, 3.6x) | Bacteroidetes (99.5%, 8.38%, 4.4x) | Bacteroidetes (92.65%. 0.99%, 6.2x) | Bacteroidetes (97.16%, 0.99%, 7x) |
| Bin04 | Alphaproteobacteria (89.94%, 43.53%, 2x) | Alphaproteobacteria (92%, 7.81%, 4.2x) | – | Alphaproteobacteria (99.73%, 17.54%, 3.4x) | Alphaproteobacteria (98.91%, 0.65%, 6.7x) | Alphaproteobacteria (98.63%, 1.88%, 7.3x) |
| Bin05 | Sphingomonadales (89.2%, 8.22%, 3.1x) | Sphingomonadales (92.68%, 18.29%, 3.3x) | Sphingomonadales (90.75%, 20.7%, 2.7x) | Sphingomonadales (91.53%, 23.25%, 3.2x) | Sphingomonadales (87.07%, 3.05%, 4.4x) | Sphingomonadales (87.08%, 2.02%, 4.8x) |
| Bin06 | Burkholderiales (70.72%, 9.47%, 2.5x) | Burkholderiales (84.2%, 4.89%, 3x) | – | Burkholderiales (85.77%, 25.64%, 2.7x) | Burkholderiales (65.03%, 3.24%, 2.9x) | Burkholderiales (74.79%, 2.16%, 3.3x) |
| Bin07 | Gammaproteobacteria (24.08%, 2.27%, 1.9x) | – | – | – | – | – |
| Bin08 | Gammaproteobacteria (14.2%, 0.07%, 2x) | – | – | – | – | – |
| Bin09 | Rhizobiales (15.61%, 1.85%, 2.1x) | Rhizobiales (9.08%, 0.72%, 2.4x) | – | – | – | – |
| Bin10 | – | Xanthomonadaceae (11.3%, 0.08%, 2.3x) | – | – | Xanthomonadaceae (26.11%, 1.59%, 2.2x) | Xanthomonadaceae (25.4%, 0.11%, 2.5x) |
| Bin11 | – | – | Sphingomonadales (7.31%, 1.05%, 2.1x) | – | – | – |
| Bin12 | – | – | Pseudomonadales (7.31%, 1.05%, 2.1x) | – | – | – |
| Bin13 | – | – | – | Burkholderiales (6.8%, 0.75%, 2.4x) | – | – |
FIGURE 3Phylogenetic Species Tree based on genomic sequences. Limnothrix strains are highlighted with a yellow background. The numbers presented are confidence values used by FastTree 2 (Price et al., 2010) to estimate maximum likelihood. GenBank and RefSeq assembly accessions are provided within the brackets.
FIGURE 4Classification of CRISPR-Cas systems found in all four analyzed Limnothrix genome (A–D). The colors indicate the following categories: repeat-associated mysterious protein (RAMP) family RNases involved in crRNA process (gray), CRISPR-Cas protein with no major category (light blue), large CRISPR-associated complex for antiviral defense (Cascade) subunits (yellow), RAMP Cascade subunits (dark blue), fused Cas protein (purple), transcriptional regulator (light pink), protein containing a CRISPR-associated domain (green), CRISPR repeat region (orange) and domain of unknown function (DUF) or hypothetical protein (HP) or other (red).
FIGURE 5Number of hits per COG category for all four Limnothrix genomes and Synechocystis sp. PCC 6803.
Number of protein sequences for representatives of core, accessory and unique orthologous clusters present in all four Limnothrix genomes.
| Organism | No. of core genes | No. of accessory genes | No. of unique genes | No. of coding CDS |
|---|---|---|---|---|
| 1,088 | 81 | 2,296 | 3,540 | |
| 1,088 | 2,008 | 451 | 3,718 | |
| 1,088 | 2,417 | 59 | 3,672 | |
| 1,088 | 2,155 | 68 | 3,723 | |
FIGURE 6COG distribution of core, accessory and unique genes present in all four analyzed Limnothrix genomes generated by Bacterial Pan Genome Analysis Tool (BPGA).
Number of protein sequences for representatives of core, accessory and unique orthologous clusters present in Limnothrix genomes, excluding Limnothrix rosea IAM M-220.
| Organism | No. of core genes | No. of accessory genes | No. of unique genes | No. of coding CDS |
|---|---|---|---|---|
| 2,741 | 303 | 496 | 3,718 | |
| 2,741 | 759 | 59 | 3,672 | |
| 2,741 | 498 | 69 | 3,723 | |
FIGURE 7Schematic showing TCA cycle and nitrogen acquisition pathways in Limnothrix. The dotted boxes delimit carbon (black) and nitrogen (green) metabolisms. Compounds are represented by circles and their respective names indicated in black. Arrows indicate reactions; enzyme reactions are shown in red, transport reactions in orange and a spontaneous KEGG reaction is shown in purple. The green reaction occurs exclusively in Limnothrix rosea IAM M-220 NIES-208. ACO, aconitase; ArgD, γ-aminobutyric acid aminotransferase; CA, carbonic anhydrase; CL, cyanate lyase; CS, citrate synthase; FUM, fumarase; GadA, glutamate decarboxylase; GdhA, glutamate dehydrogenase; GlnA, glutamine synthase; GS, glutamine synthase; IDH, isocitrate dehydrogenase; MDH, malate dehydrogenase; ME, malic enzyme; 2-OGDC, 2-oxoglutarate decarboxylase; PDC, pyruvate dehydrogenase complex; PEP, phosphoenolpyruvate; PEPC, phosphoenolpyruvate carboxylase; PK, pyruvate kinase; SDH, succinate dehydrogenase; SSADH, succinic semialdehyde dehydrogenase.
FIGURE 8Piricyclamide gene cluster schematics. (A) Cluster found in Limnothrix sp. CACIAM 69d contig MKGP02000097 and (B) cluster previously described in Microcystis aeruginosa PCC7005. The precursor genes are shown in red, proteases are blue, prenyltransferase is purple, genes with no prediction function are brown, hypothetical proteins (HP) are light green, a transposase is dark green, and R is for repeat region.
FIGURE 9Domains present in trans-AT PKS cluster found in Limnothrix sp. CACIAM 69d and Limnothrix sp. P13C2. The results are presented according to antiSMASH 4.0 output. The red circle indicates a domain present only in P13C2 strain and the purple circle a domain found only in CACIAM 69d strain. The order of the CDS for CACIAM 69d and P13C2 is shown at the bottom. ACP, acyl carrier protein; AT, acyltransferase domain; C, condensation domain; DH, dehydratase domain; ER, enoyl reductase domain; KR, keto reductase domain; KS, keto synthase domain; MT, methyl transferase domain; TE, thioesterase domain.
FIGURE 10Subtree generated by NaPDoS for Limnothrix sp. CACIAM 69d and Limnothrix sp. P13C2 trans-AT PKS cluster analysis. Limnothrix domain sequences are in red and named based on Table 4; the initials BCR12 and BJG00 are, respectively, from Limnothrix sp. P13C2 and Limnothrix sp. CACIAM 69d. Other domain sequences are named according to the NaPDoS database: Vir, virginiamycin; Tet, tetronomycin; Lnm, leinamycin; Kir, kirromycin. The numbers presented are confidence values generated in the NaPDoS tree reconstruction process, which uses the FastTree (Guindon and Gascuel, 2003) to estimate maximum likelihood. The complete tree is available in Supplementary File 3.