| Literature DB >> 27114869 |
Olabisi Oluwabukola Coker1, Angkana Chaiprasert1, Chumpol Ngamphiw2, Sissades Tongsima2, Sanjib Mani Regmi3, Taane G Clark4, Rick Twee Hee Ong5, Yik-Ying Teo5, Therdsak Prammananan6, Prasit Palittapongarnpim7.
Abstract
Genome sequencing plays a key role in understanding the genetic diversity of Mycobacterium tuberculosis (M.tb). The genotype-specific character of M. tb contributes to tuberculosis severity and emergence of drug resistance. Strains of M. tb complex can be classified into seven lineages. The Nonthaburi (NB) genotype, belonging to the Indo-Oceanic lineage (lineage 1), has a unique spoligotype and IS6110-RFLP pattern but has not previously undergone a detailed whole genome analysis. In addition, there is not much information available on the whole genome analysis of M. tb isolates from tuberculous meningitis (TBM) patients in public databases. Isolates CSF3053, 46-5069 and 43-13838 of NB genotype were obtained from the cerebrospinal fluids of TBM Thai patients in Siriraj Hospital, Bangkok. The whole genomes were subjected to high throughput sequencing. The sequence data of each isolate were assembled into draft genome. The sequences were also aligned to reference genome, to determine genomic variations. Single nucleotide polymorphisms (SNPs) were obtained and grouped according to the functions of the genes containing them. They were compared with SNPs from 1,601 genomes, representing the seven lineages of M. tb complex, to determine the uniqueness of NB genotype. Susceptibility to first-line, second-line and other antituberculosis drugs were determined and related to the SNPs previously reported in drug-resistant related genes. The assembled genomes have an average size of 4,364,461 bp, 4,154 genes, 48 RNAs and 64 pseudogenes. A 500 base pairs deletion, which includes ppe50, was found in all isolates. RD239, specific for members of Indo Oceanic lineage, and RD147c were identified. A total of 2,202 SNPs were common to the isolates and used to classify the NB strains as members of sublineage 1.2.1. Compared with 1,601 genomes from the seven lineages of M. tb complex, mutation G2342203C was found novel to the isolates in this study. Three mutations (T28910C, C1180580T and C152178T) were found only in Thai NB isolates, including isolates from previous study. Although drug susceptibility tests indicated pan-susceptibility, non-synonymous SNPs previously reported to be associated with resistance to anti-tuberculous drugs; isoniazid, ethambutol, and ethionamide were identified in all the isolates. Non-synonymous SNPs were found in virulence genes such as the genes playing roles in apoptosis inhibition and phagosome arrest. We also report polymorphisms in essential genes, efflux pumps associated genes and genes with known epitopes. The analysis of the TBM isolates and the availability of the variations obtained will provide additional resources for global comparison of isolates from pulmonary tuberculosis and TBM. It will also contribute to the richness of genomic databases towards the prediction of antibiotic resistance, level of virulence and of origin of infection.Entities:
Keywords: Draft genome; Genetic signatures; M. tuberculosis Nonthaburi genotype; Meningitis; Thailand; Whole genome sequence
Year: 2016 PMID: 27114869 PMCID: PMC4841212 DOI: 10.7717/peerj.1905
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Statistics of whole genome sequencing, genome assembly and annotation.
Gross statistics of the whole genome sequence data, mapping of reads, assembly of draft genome and annotation for isolates CSF-3053, 46-5069 and 43-13838. Length of reference genome (M. tuberculosis H37Rv, NC_000962.3) is 4,411,532 base pairs.
| Isolate | Total reads | % of reads mapped to reference | % of Reference covered | Number of contigs | Fold coverage of positions in the genome | GC content (%) | Number of predicted Genes | No. of predicted RNA genes | No. of predicted pseudo genes | |
|---|---|---|---|---|---|---|---|---|---|---|
| CSF-3053 | 50,004,564 | 99.96 | 99.78 | 159 | 69,028 | 1329.0 | 65.5 | 4153 | 48 | 62 |
| 46-5069 | 44,478,206 | 98.67 | 99.82 | 173 | 63,852 | 920 | 65.5 | 4159 | 48 | 63 |
| 43-13838 | 40,767,970 | 98.69 | 99.80 | 177 | 63,019 | 920 | 65.5 | 4150 | 48 | 67 |
Notes.
guanine/cytocine
Figure 1Distribution of single nucleotide polymorphisms in isolates CSF3053, 46-5069 and 43-13838.
Venn diagram showing the distribution of the single nucleotide polymorphisms (SNPs) observed in isolates CSF-3053 (blue), 46-5069 (red) and 43-13838 (yellow). CSF3053, 46-5069 and 43-13838 have 10, 7 and 49 unique SNPs respectively. 43-13838 and CSF3053 have 23 SNPs in common, CSF3053 and 46-5069 have 99 SNPs in common, while 43-13838 and 46-5069 have 14 SNPs in common. 2,202 SNPs are common to all isolates.
Regions of deletion and affected open reading frames found in isolates CSF-3053, 46-5069 and 43-13838.
All regions were confirmed by PCR reaction as described in methods.
| Region in reference genome (H37Rv, NC_000962.3) | Length | Region of difference | Open reading frame (ORF) affected |
|---|---|---|---|
| 1718912–1721213 | 2302 | RD147c | |
| 3501225–3501723 | 499 | This study | |
| 4092082–4092921 | 840 | RD239 |
Common SNPs found in drug resistance related genes in isolates CSF-3053, 46-5069 and 43-13838.
The reference genome positions, nucleotide change, amino acid change and effect of single nucleotide polymorphisms in drug resistance related genes that are common to isolates CSF3053, 46-5069 and 43-13838. The protein variation was determined by Protein Variation Effect Analyzer (PROVEAN), a web based protein variation analysis tool (Choi et al., 2012).
| Position in reference genome (H37Rv, NC_000962.3) | Nucleotide change | Amino acid change | Protein variation effect | Gene | Associated drug | References |
|---|---|---|---|---|---|---|
| 6112 | G>C | Met291Ile | Deleterious | Quinolones | ||
| 7362 | G>C | Glu21Gln | Neutral | Quinolones | ||
| 7585 | G>C | Ser95Thr | Neutral | Quinolones | ||
| 8452 | C>T | Ala384Val | Deleterious | Quinolones | ||
| 9143 | T>C | Ile614Ile | Quinolones | |||
| 9260 | G>C | Leu653Leu | Quinolones | |||
| 9304 | G>A | Gly668Asp (N) | Neutral | Quinolones | ||
| 412280 | T>G | His481Gln | Neutral | Ethambutol | ||
| 575368 | T>C | Asp7Asp | Isoniazid/Ethionamide | |||
| 763031 | T>C | Ala1081Ala | Rifampicin | |||
| 763531 | G>C | Pro54Pro | Rifampicin | |||
| 763884 | C>T | Ala172Val | Neutral | Rifampicin | ||
| 763886 | C>A | Arg173Arg | Rifampicin | |||
| 1406312 | A>G | His343His | Streptomycin | |||
| 1417019 | C>T | Cys110Tyr | Deleterious | Ethambutol | ||
| 1674162 | C>T | Gly241Gly | Isoniazid | |||
| 1792777 | T>C | Ile322Val | Neutral | Isoniazid | ||
| 1792778 | T>C | Glu321Glu | Isoniazid | |||
| 2154724 | C>A | Arg463Leu | Neutral | Isoniazid | ||
| 2518132 | C>T | Thr6Thr | Isoniazid | |||
| 2519048 | G>A | Gly312Ser | Neutral | Isoniazid | ||
| 2521342 | T>C | Asp200Asp | Isoniazid | |||
| 3154414 | A>G | Ile73Thr | Neutral | Isoniazid | ||
| 3571834 | T>G | Gln237Pro | Neutral | Isoniazid/Ethionamide | ||
| 3647041 | A>G | Ser257Pro | Neutral | Ethambutol | ||
| 3647591 | A>G | Asn73Asn | Ethambutol | |||
| 4049254 | G>A | Leu243Leu | Para-aminosalicylic acid | |||
| 4240671 | C>T | Thr270Ile | Neutral | Ethambutol | ||
| 4241042 | A>G | Asn394Asp | Deleterious | Ethambutol | ||
| 4242643 | C>T | Arg927Arg | Ethambutol | |||
| 4243580 | G>A | Val116Val | Ethambutol | |||
| 4244420 | G>C | Val396Val | Ethambutol | |||
| 4245969 | C>T | Pro913Ser | Deleterious | Ethambutol | ||
| 4247578 | G>A | Leu355Leu | Ethambutol | |||
| 4247646 | A>C | Glu378Ala | Neutral | Ethambutol | ||
| 4407588 | T>C | Ala205Ala | Streptomycin | |||
| 4407873 | C>A | Val110Val | Streptomycin |