| Literature DB >> 31681190 |
Na L Gao1,2, Jingchao Chen3, Teng Wang1, Martin J Lercher2, Wei-Hua Chen1,3,4.
Abstract
Viruses and plasmids can introduce novel DNA into bacterial cells, thereby creating an opportunity for genome expansion; conversely, CRISPR, the prokaryotic adaptive immune system, which targets and eliminates foreign DNAs, may impair genome expansions. Recent studies presented conflicting results over the impact of CRISPR on genome expansion. In this study, we constructed a comprehensive dataset of prokaryotic genomes and identified their associations with viruses and plasmids. We found that genomes associated with viruses and/or plasmids were significantly larger than those without, indicating that both viruses and plasmids contribute to genome expansion. Genomes were increasingly larger with increasing numbers of associated viruses or plasmids. Conversely, genomes with CRISPR systems were significantly smaller than those without, indicating that CRISPR has a negative impact on genome size. These results confirmed that on evolutionary timescales, viruses and plasmids facilitate genome expansion, while CRISPR impairs such a process in prokaryotes. Furthermore, our results also revealed that CRISPR systems show a preference for targeting viruses over plasmids.Entities:
Keywords: CRISPR; horizontal gene transfer; plasmids; prokaryotic genome expansion; viruses
Year: 2019 PMID: 31681190 PMCID: PMC6805729 DOI: 10.3389/fmicb.2019.02254
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1A total of 7,085 prokaryotic genomes and their associations with viruses (A), plasmids (B), and CRISPRs (C). The Venn diagram (D) shows the overlap of their distributions in prokaryotes. 1,962 genomes (27.69%) were not found to be associated with viruses, plasmids, or CRISPRs; 439 (6.2%) genomes were associated with all three elements.
Estimated enrichment of CRISPR in virus-associated and plasmid-associated genomes compared to other genomes, and enrichment of CRISPR in virus-associated compared to plasmid-associated genomes.
| Virus-associated vs. others | 1.18 | 1.07 × 10–3 |
| Plasmid-associated vs. others | 1.04 | 0.43 |
| Virus- vs. plasmid-associated | 1.08 | 0.21 |
| Virus-associated vs. othersa | 1.17 | 7.48 × 10–3 |
| Plasmid-associated vs. othersa | 0.97 | 0.67 |
| Virus- vs. Plasmid-associateda | 1.15 | 0.08 |
Relative importance of various factors for genome size in a linear model (LM).
| All | GC% | 0.086 | <2 × 10–16 | 91.84% |
| Plasmid | 0.714 | <2 × 10–16 | 5.91% | |
| Virus | 0.454 | <2 × 10–16 | 2.22% | |
| CRISPR | −0.043 | 0.248 | 0.03% | |
| Virus∗plasmid | −0.130 | 0.104 | – | |
| No plasmids | GC% | 0.087 | <2 × 10–16 | 96.62% |
| Virus | 0.454 | <2 × 10–16 | 3.18% | |
| CRISPR | −0.108 | 0.017 | 0.20% | |
| No viruses | GC% | 0.087 | <2 × 10–16 | 93.16% |
| Plasmid | 0.713 | <2 × 10–16 | 6.77% | |
| CRISPR | 0.066 | 0.168 | 0.07% |
FIGURE 2Increasing numbers of viruses and plasmids are associated with increased genome sizes, while virus-associated genomes with CRISPR systems are smaller than those without CRISPR systems. (A) Boxplot of genomes size as a function of the number of associated viruses. Genome sizes are larger with increasing numbers of associated viruses, when genomes without CRISPR systems. (B) Boxplot of genomes size as a function of the number of associated plasmids. The impact of plasmids on genome size is similar to that of viruses. (C) Boxplot of genome size as a function of the presence/absence of CRISPRs in genomes associated with viruses. Virus-associated genomes with CRISPR systems are significantly smaller in size than those without CRISPR, regardless of the number of viruses they are associated with. (D) Boxplots of genome sizes in genomes associated with plasmids as a function of the presence/absence of CRISPRs. CRISPRs have no significant impact on genome sizes in genomes associated with plasmids. Wilcoxon rank sum tests were used to compare between groups. Level of significance: ∗∗∗P < 0.001; ∗∗P < 0.01; ∗P < 0.05; NS. P ≥ 0.05.
Relative importance of various factors for GC-content (GC%) in a linear model (LM).
| All | Size | 4.081 | <2 × 10–16 | 99.12% |
| Plasmid | –1.423 | 5.5 × 10–5 | 0.85% | |
| Virus | –0.089 | 0.788 | 0.02% | |
| CRISPR | 0.115 | 0.656 | 0.01% | |
| Virus∗plasmid | –0.434 | 0.438 | – | |
| No plasmids | Size | 4.132 | <2 × 10–16 | 99.85% |
| Virus | –0.139 | 0.678 | 0.01% | |
| CRISPR | 0.618 | 0.048 | 0.14% | |
| No viruses | Size | 4.107 | <2 × 10–16 | 99.30% |
| Plasmid | –1.442 | 7.7 × 10–5 | 0.60% | |
| CRISPR | 0.528 | 0.109 | 0.10% |