| Literature DB >> 23990995 |
Kei-ichi Uchiya1, Hiroyasu Takahashi, Tetsuya Yagi, Makoto Moriyama, Takayuki Inagaki, Kazuya Ichikawa, Taku Nakagawa, Toshiaki Nikai, Kenji Ogawa.
Abstract
Mycobacterium avium complex (MAC) infection causes disseminated disease in immunocompromised hosts, such as human immunodeficiency virus (HIV)-positive patients, and pulmonary disease in persons without systemic immunosuppression, which has been increasing in many countries. In Japan, the incidence of pulmonary MAC disease caused by M. avium is about 7 times higher than that caused by M. intracellulare. To explore the bacterial factors that affect the pathological state of MAC disease caused by M. avium, we determined the complete genome sequence of the previously unreported M. avium subsp. hominissuis strain TH135 isolated from a HIV-negative patient with pulmonary MAC disease and compared it with the known genomic sequence of M. avium strain 104 derived from an acquired immunodeficiency syndrome patient with MAC disease. The genome of strain TH135 consists of a 4,951,217-bp circular chromosome with 4,636 coding sequences. Comparative analysis revealed that 4,012 genes are shared between the two strains, and strains TH135 and 104 have 624 and 1,108 unique genes, respectively. Many strain-specific regions including virulence-associated genes were found in genomes of both strains, and except for some regions, the G+C content in the specific regions was low compared with the mean G+C content of the corresponding chromosome. Screening of clinical isolates for genes located in the strain-specific regions revealed that the detection rates of strain TH135-specific genes were relatively high in specimens isolated from pulmonary MAC disease patients, while, those of strain 104-specific genes were relatively high in those from HIV-positive patients. Collectively, M. avium strains that cause pulmonary and disseminated disease possess genetically distinct features, and it suggests that the acquisition of specific genes during strain evolution has played an important role in the pathological manifestations of MAC disease.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23990995 PMCID: PMC3749206 DOI: 10.1371/journal.pone.0071831
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
General features of the M. avium strain TH135 genome and comparison with M. avium strain 104.
| Property | Strain TH135 | Strain 104 |
| Genome size, bp | 4,951,217 | 5,475,491 |
| G+C content, % | 69.3 | 69.0 |
| Protein coding, % | 92.5 | 88.6 |
| ORFs | 4,636 | 5,120 |
| Average gene length, bp | 984 | 948 |
| tRNAs | 46 | 46 |
| rRNA operons (16S-23S-5S) | 1 | 1 |
| Prophage elements, no. | 0 | 1 |
| IS, total no. of copies | 30 | 129 |
Figure 1Circular representation of the M. avium strain TH135 genome.
The scale is shown in base pairs (Mb), with zero representing the location of the dnaA gene. From the outside to the inside, the outer two circles show forward- and reverse-strand coding sequences (CDS), respectively. The third and fourth circles show rRNA operons and tRNA genes, respectively. The fifth circle shows the percentage of G+C in relation to the mean G+C of the chromosome. The sixth circle shows the GC skew (G – C)/(G+C). The color of each CDS was assigned according to the cluster of orthologous groups (COG) functional classification system [31]. The color of each COG family is shown in the figure.
Figure 2Whole genome alignment and G+C content of M. avium strain TH135 (upper) and 104 (lower).
Representation of five local collinear blocks (LCBs) between chromosomal sequences of each strain generated by Mauve alignment software. The connecting lines between blocks indicate the location of each block in each genome. Each colored block represents a homologous region. LCBs drawn horizontally represent homology in the reverse strand of the chromosome. Gaps or white spaces within LCBs indicate the presence of strain-specific sequences. Specific region (SR)-1 to SR-10 in strain TH135 and SR-11 to SR-21 in strain 104 show strain-specific regions of over 10,000 bp in length.
Figure 3Venn diagram of M. avium strains TH135 and 104 genomes and comparison of COG functional categories between strain TH135- and 104-specific genes.
(A) Venn diagram showing the distribution of shared orthologues and strain-specific genes between M. avium strains TH135 and 104. The Venn diagram was created with IMCGE software. Comparative analysis revealed that 4,012 genes are shared between the two strains, and 624 genes and 1,108 genes are unique to strains TH135 and 104, respectively. (B) Classification of specific genes of each strain based on COG functional categories. Each colored segment indicates the relative contribution of a functional category as a percentage of total COGs. The color of each COG family is shown in Figure 1.
Candidates of strain-specific virulence factors in the strain TH135 genome.
| Region | Location (position) | Locus tag | Description | Species with most similar sequence | aa sequence identity (%) |
| SR-1 | 20756–23854 | MAH _0016 | MmpL family protein |
| 97 |
| SR-2 | 779143–782040 | MAH _0778 | MmpL5 protein |
| 96 |
| 801455–802618 | MAH _0796 | virulence factor Mce family protein |
| 68 | |
| 802615–803649 | MAH _0797 | virulence factor Mce family protein |
| 74 | |
| 803642–804766 | MAH _0798 | virulence factor Mce family protein |
| 73 | |
| 804763–806271 | MAH _0799 | virulence factor Mce family protein |
| 70 | |
| 806268–807449 | MAH _0800 | virulence factor Mce family protein |
| 68 | |
| 807449–809221 | MAH _0801 | virulence factor Mce family protein |
| 59 | |
| SR-3 | 1011402–1012598 | MAH _1006 | PPE family protein |
| 100 |
| SR-10 | 4792912–4793808 | MAH _4495 | Mn2+ dependent Catalase |
| 99 |
| 4799776–4800207 | MAH _4505 | MmpS4_1 protein |
| 99 | |
| 4800204–4803122 | MAH _4506 | MmpL5_5 protein |
| 99 | |
| Other | 167053–168243 | MAH _0168 | PPE family protein |
| 100 |
| 597293–599017 | MAH _0587 | virulence factor Mce4F protein |
| 100 | |
| 1754780–1756012 | MAH _1657 | PPE family protein |
| 100 | |
| 2061187–2062653 | MAH _1946 | PPE family protein |
| 99 | |
| 3644765–3647596 | MAH _3375 | MmpL6 protein |
| 67 | |
| 4561515–4565480 | MAH _4283 | Type VII system protein EccC3 |
| 99 | |
| 4914393–4918604 | MAH _4605 | Type VII system protein EccC2 |
| 99 |
Regions SR-1 to SR-10 indicate strain TH135-specific regions, as shown in Figure 2.
Candidates of strain-specific virulence factors in the strain 104 genome.
| Region | Location (position) | Locus tag | Description | Species with most similar sequence | aa sequence identity (%) |
| SR-14 | 753228–754160 | MAV_0790 | PPE family protein |
| 52 |
| 786286–787950 | MAV_0828 | 17 kDa surface antigen family protein |
| 97 | |
| SR-16 | 1802421–1803848 | MAV_1807 | virulence factor Mce family protein |
| 73 |
| SR-19 | 2566171–2567511 | MAV_2532 | virulence factor Mce family protein |
| 97 |
| 2567550–2568578 | MAV_2533 | virulence factor Mce family protein |
| 99 | |
| 2568575–2569900 | MAV_2534 | virulence factor Mce family protein |
| 99 | |
| 2569924–2571243 | MAV_2535 | virulence factor Mce family protein |
| 99 | |
| 2571240–2572400 | MAV_2536 | virulence factor Mce family protein |
| 99 | |
| 2572402–2573877 | MAV_2537 | virulence factor Mce family protein |
| 99 | |
| SR-21 | 5193791–5194750 | MAV_5047 | putative Mce family protein |
| 70 |
| 5194747–5195673 | MAV_5048 | putative Mce family protein |
| 72 | |
| 5195745–5196863 | MAV_5049 | putative Mce family protein |
| 67 | |
| 5196863–5197834 | MAV_5050 | putative Mce family protein |
| 77 | |
| 5197822–5198829 | MAV_5051 | putative Mce family protein |
| 76 | |
| 5198826–5199770 | MAV_5052 | putative Mce family protein |
| 73 | |
| Other | 889775–891037 | MAV_0948 | virulence factor Mce family protein |
| 99 |
| 891034–892056 | MAV_0949 | virulence factor Mce family protein |
| 99 | |
| 892056–893174 | MAV_0950 | virulence factor Mce family protein |
| 100 | |
| 893171–894598 | MAV_0951 | virulence factor Mce family protein |
| 99 | |
| 895904–897631 | MAV_0953 | virulence factor Mce family protein |
| 99 | |
| 114858–115163 | MAV_0117 | PE family protein |
| 99 | |
| 115166–116839 | MAV_0118 | PPE family protein |
| 97 | |
| 1304443–1305594 | MAV_1347 | PPE family protein |
| 99 |
Regions SR-14 to SR-21 indicate strain 104-specific regions, as shown in Figure 2.
The presence of strain TH135- and 104-specific genes in M. avium isolates.
| Strain | Region | GC% | Strain-specific gene (Locus tag) | TH | HIV | P value |
| TH135 | SR-1 | 66.0 | MAH_0016 | 1 (2.9%) | 0 (0%) | 1.000 |
| SR-2 | 63.3 | MAH_0798 | 13 (37.1%) | 5 (17.2%) | 0.099 | |
| SR-3 | 70.2 | MAH_1001 | 33 (94.3%) | 18 (62.1%) | 0.002 | |
| SR-5 | 64.1 | MAH_2592 | 10 (28.6%) | 0 (0%) | 0.001 | |
| SR-9 | 64.6 | MAH_3208 | 13 (37.1%) | 6 (20.7%) | 0.174 | |
| SR-10 | 68.2 | MAH_4506 | 18 (51.4%) | 7 (24.1%) | 0.039 | |
| 104 | SR-11 | 68.9 | MAV_0264 | 0 (0%) | 1 (3.4%) | 0.453 |
| SR-12 | 65.9 | MAV_0482 | 0 (0%) | 1 (3.4%) | 0.453 | |
| SR-14 | 66.8 | MAV_0828 | 0 (0%) | 2 (6.9%) | 0.201 | |
| SR-16 | 65.2 | MAV_1807 | 0 (0%) | 6 (20.7%) | 0.006 | |
| SR-19 | 65.0 | MAV_2532 | 2 (5.7%) | 4 (13.8%) | 0.398 | |
| SR-21 | 64.1 | MAV_5049 | 8 (22.9%) | 9 (31.0%) | 0.573 |
Lanes SR-1 to SR-10 in strain TH135 and lanes SR-11 to SR-21 in strain 104 indicate the.
strain-specific regions shown in Figure 2.
GC% in each region.
Strains from the sputa of patients with pulmonary MAC disease.
Strains from the blood of HIV-positive patients with disseminated MAC disease.