| Literature DB >> 25437804 |
Akifumi Yamashita1, Tsuyoshi Sekizuka2, Makoto Kuroda3.
Abstract
The global clustering of gene families through network analysis has been demonstrated in whole genome, plasmid, and microbiome analyses. In this study, we carried out a plasmidome network analysis of all available complete bacterial plasmids to determine plasmid associations. A blastp clustering search at 100% aa identity cut-off and sharing at least one gene between plasmids, followed by a multilevel community network analysis revealed that a surprisingly large number of the plasmids were connected by one largest connected component (LCC), with dozens of community sub-groupings. The LCC consisted mainly of Bacilli and Gammaproteobacteria plasmids. Intriguingly, horizontal gene transfer (HGT) was noted between different phyla (i.e., Staphylococcus and Pasteurellaceae), suggesting that Pasteurellaceae can acquire antimicrobial resistance (AMR) genes from closely contacting Staphylococcus spp., which produce the external supplement of V-factor (NAD). Such community network analysis facilitate displaying possible recent HGTs like a class 1 integron, str and tet resistance markers between communities. Furthermore, the distribution of the Inc replicon type and AMR genes, such as the extended-spectrum ß-lactamase (ESBL) CTX-M or the carbapenemases KPC NDM-1, implies that such genes generally circulate within limited communities belonging to typical bacterial genera. Thus, plasmidome network analysis provides a remarkable discriminatory power for plasmid-related HGT and evolution.Entities:
Year: 2014 PMID: 25437804 PMCID: PMC4243450 DOI: 10.3390/pathogens3020356
Source DB: PubMed Journal: Pathogens ISSN: 2076-0817
Number of connected component and community according to identity cutoff values.
| Clustering program | Identity cutoff (%) | Edges | Connected nodes | Connected components | Nodes in LCC | LCC % in total | Number of communities in the LCC using multilevel method |
|---|---|---|---|---|---|---|---|
| BLASTP | 50 | 251,961 | 3529 | 57 | 3265 | 86.1% | 19 |
| BLASTP | 60 | 177,932 | 3444 | 70 | 2793 | 73.6% | 25 |
| BLASTP | 70 | 129,545 | 3358 | 90 | 2613 | 68.9% | 31 |
| BLASTP | 80 | 104,815 | 3229 | 127 | 2265 | 59.7% | 26 |
| BLASTP | 90 | 81,752 | 3060 | 162 | 1995 | 52.6% | 34 |
| BLASTP | 99 | 52,029 | 2633 | 236 | 1389 | 36.6% | 36 |
| BLASTP | 100 | 41,956 | 2496 | 259 | 1241 | 32.7% | 26 |
| UCLUST | 50 | 188,844 | 3524 | 71 | 3233 | 85.2% | ND |
| UCLUST | 60 | 147,899 | 3435 | 75 | 2762 | 72.8% | ND |
| UCLUST | 70 | 121,841 | 3349 | 98 | 2558 | 67.4% | ND |
| UCLUST | 80 | 101,902 | 3213 | 133 | 2241 | 59.1% | ND |
| UCLUST | 90 | 82,046 | 3057 | 174 | 1949 | 51.4% | ND |
| UCLUST | 99 | 52,578 | 2626 | 240 | 1369 | 36.1% | ND |
| UCLUST | 100 | 43,275 | 2501 | 263 | 1232 | 32.5% | ND |
CC: connected component; LCC: largest connected component; ND: not determined.
Figure 1Plasmidome network analysis. (A) The filled circles (nodes) represent the respective plasmids. The grey lines (edges) represent the connections among the plasmids. The thickness of the edges represents the number of genes shared among the plasmids. The edges denoting large numbers of shared genes appear as grey ellipses. The colours of the nodes are automatically set according to the plasmid communities inferred from the multilevel method using the R igraph package. The nodes within the dotted circle represent the largest connected component (LCC). (B) Close-up of the connecting points between Gammaproteobacteria and Bacilli.
Figure 2Evaluation of community connections using several methods. The Shannon index values for the LCC using different identity cut-off values and community estimation methods at the phylum (A), class (B), and order (C) levels. (D) The numbers of communities in the LCC using different identity cut-off values.
Figure 3A schematic representation of the communities from the whole image shown in Figure 1. The communities mainly included Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria, Bacilli, and Borrelia, which are indicated by purple, green, pink, grey, and black circles, respectively. The other communities are indicated by blue circles. The number of the top three species in the community is shown beside the circle. The circle with the light-blue rim represents the communities without AMR genes. The red lines represent the AMR gene related connections among the communities.
Figure 4The host bacterial components of the communities. The upper and lower graphs show the numbers and ratios of the host bacterial components, respectively, for the class (A) and order (B) levels.
Figure 5COG annotation for genes on the edges between communities. (A) COG categories on all of the edges in the LCC. (B) The COG ID for the category [L] of replication, recombination and repair. (C) The COG ID for the category [V] of defence mechanisms.
Figure 6Frequently hit sequence on the 66 edges in LCC. Heatmap of sequence pairs of 1506 sequences on the 66 edges in LCC. Close-up of top 9 frequently hit sequences. Class 1 integron (21 edges), str (15 edges) and tet (15 edges) genes on the edges were shown in bold red line.
The class 1 integron mediated en bloc transmission of AMR genes among communities.
| Order of gene groups | Community | Total | |||||||
|---|---|---|---|---|---|---|---|---|---|
| co18 | co22 | co29 | co44 | co64 | co68 | co193 | |||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 2 | 1 | 1 | 4 | ||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 2 | 2 | ||||||||
| 1 | 1 | 2 | |||||||
| 1 | 1 | ||||||||
| 2 | 2 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 3 | 3 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| 1 | 1 | ||||||||
| Total | 3 | 9 | 2 | 2 | 1 | 22 | 1 | 40 | |
| 01 | integron integrase IntI1. | class 1 integron component | |||||||
| 02, 03 | quaternary ammonium compound-resistance protein QacEdelta1. | ||||||||
| 04 | sulphonamide resistant dihydropteroate synthase Sul1. | ||||||||
| 05 | protein YbaA. | ||||||||
| 06 | aminoglycoside 3'-
| aminoglycoside resistance | |||||||
| 07 | aminoglycoside 3'-
| ||||||||
| 08 | aminoglycoside acetyltransferase. | ||||||||
| 09 | aminoglycoside adenylyltransferase. | ||||||||
| 10 | aminoglycoside
| ||||||||
| 11–14 | aminoglycoside resistance protein
| ||||||||
| 15 | aminoglycoside-2'-adenylyltransferase. | ||||||||
| 16 | acetyltransferases
| ||||||||
| 17 | putative aminoglycoside 6'-
| ||||||||
| 18 | putative aminoglycoside adenyltransferase. | ||||||||
| 19 | class D β-lactamase OXA-10. | β-lactamase | |||||||
| 20 | class D β-lactamase OXA-21. | ||||||||
| 21 | class D β-lactamase OXA-2. | ||||||||
| 22 | class D β-lactamase OXA. | ||||||||
| 23 | metallo-β-lactamase GIM-1. | ||||||||
| 24 | metallo-β-lactamase IMP-6. | ||||||||
| 25 | metallo-β-lactamase VIM-1. | ||||||||
| 26 | chloramphenicol acetyltransferase catB2. | chloramphenicol resistance | |||||||
| 27 | chloramphenicol acetyltransferase catB8. | ||||||||
| 28, 29 | chloramphenicol acetyltransferase. | ||||||||
| 30 | chloramphenicol aminotransferase. | ||||||||
| 31 | chloramphenicol resistance protein CmlA1. | ||||||||
| 32 | dihydrofolate reductase DfrA5. | dihydrofolate reductase | |||||||
| 33-35 | dihydrofolate reductase. | ||||||||
| 36 | trimethoprim-resistant dihydrofolate reductase type I DhfrA1. | ||||||||
| 37–40 | transposase. | ||||||||
| 41 | molecular chaperone GroEL. | ||||||||
| 42 | molecular chaperone GroES. | ||||||||
| 43 | fluoroquinolone resistance protein QnrB2. | ||||||||
| 44 | quinolone-resistance determinant Qnr. | ||||||||
| 45–52 | Others | ||||||||
| 53-60 | hypothetical protein. | ||||||||
Figure 7Pairwise comparisons of the (A) Haemophilus parasuis and (B) Pasteurella multocida plasmids carrying the aacA-aphD and aphA3 genes compared to Staphylococcus aureus plasmids. The light-blue boxes represent the genes on the plasmids. The red and blue bars represent the homologous regions in the forward and inverted directions, respectively. The plasmids carrying the homologous genes are shown below the pairwise comparison. (C ) Average GC-content of the host chromosomes.
Distribution of Inc types and communities.
| Inc type | Community | |||||||
|---|---|---|---|---|---|---|---|---|
| co11 | co18 | co22 | co29 | co64 | co68 | co192 | co193 | |
| A/C | - | - | 25 | - | - | 2 | - | - |
| B/O | - | 3 | - | - | 1 | 1 | - | - |
| FIA | - | 5 | - | - | - | 3 | - | - |
| FIB | - | 5 | - | - | - | - | - | - |
| FIB-M | - | - | - | - | - | 1 | - | - |
| FII | - | 60 | - | - | 1 | 3 | - | - |
| FIIk | - | - | - | - | - | 17 | - | - |
| FIIS | - | 23 | - | 27 | - | 3 | 1 | - |
| HI1 | - | 1 | - | - | - | 1 | - | - |
| HI2 | - | - | - | - | - | 5 | - | - |
| HIB-M | 3 | - | - | - | - | - | - | - |
| I1 | - | - | - | - | 28 | - | - | - |
| K | - | 1 | - | - | 3 | - | - | - |
| L/M | - | - | - | - | - | 7 | - | - |
| N | - | - | - | - | - | 26 | - | - |
| P | - | - | - | - | - | - | - | 7 |
| R | - | - | 2 | - | - | 3 | - | - |
| T | - | 1 | - | - | - | 1 | - | - |
| U | - | - | - | - | - | 3 | - | - |
| W | - | - | 2 | - | - | 3 | - | - |
| X1 | - | 1 | - | 7 | - | - | - | - |
| X2 | - | - | - | - | - | 1 | - | - |
| Not Assigned | 1 | 13 | 83 | 37 | 3 | 144 | 2 | 10 |
| Total | 4 | 113 | 112 | 71 | 36 | 224 | 3 | 17 |
Shaded communities showed significant over-representation with p-value < 0.001 by chi-square test.
Figure 8Components of the major Inc-type plasmids. The components of the communities of the major Inc-type plasmids and the bacterial components for each community are shown (Table 3).
Figure 9Components of the communities with AMR genes. (A) The CTX-M-1, -2 and -9 subgroups and the community and bacterial components. (B) KPC carbapenemase and the bacterial components. The plasmids carrying KPC are only found in co68. (C) The communities and bacterial components of the plasmids carrying the NDM-1 gene (Table 4).
Distribution of Inc types, antimicrobial resistance genes, and communities.
| Inc type | CTX-M | KPC | NDM-1 | ||||||
|---|---|---|---|---|---|---|---|---|---|
| M-1 | M-2 | M-9 | 2 | 3 | 4 | 10 | co22 | co68 | |
| FII | 7 | 8 | 1 | ||||||
| FIIk | 4 | 5 | 2 | 1 | |||||
| N | 2 | 1 | 3 | 1 | 1 | 1 | |||
| A/C | 1 | 6 | |||||||
| L/M | 2 | 1 | 2 | ||||||
| I1 | 3 | ||||||||
| FIB-M | 1 | 1 | |||||||
| FIA | 1 | ||||||||
| FIIS | 1 | ||||||||
| HI1 | 1 | ||||||||
| HI2 | 1 | ||||||||
| K | 1 | ||||||||
| R | 1 | ||||||||
| Not Assigned | 5 | 4 | 4 | 4 | 1 | 1 | 12 | ||
| Total | 26 | 5 | 18 | 11 | 4 | 2 | 1 | 6 | 18 |