| Literature DB >> 23773324 |
Sarbashis Das1, Tanmoy Roychowdhury, Parameet Kumar, Anil Kumar, Priya Kalra, Jitendra Singh, Sarman Singh, H K Prasad, Alok Bhattacharya.
Abstract
BACKGROUND: Tuberculosis remains a major public health problem. Clinical tuberculosis manifests often as pulmonary and occasionally as extra-pulmonary tuberculosis. The emergence of drug resistant tubercle bacilli and its association with HIV is a formidable challenge to curb the spread of tuberculosis. There have been concerted efforts by whole genome sequencing and bioinformatics analysis to identify genomic patterns and to establish a relationship between the genotype of the organism and clinical manifestation of tuberculosis. Extra-pulmonary TB constitutes 15-20 percent of the total clinical cases of tuberculosis reported among immunocompetent patients, whereas among HIV patients the incidence is more than 50 percent. Genomic analysis of M. tuberculosis isolates from extra pulmonary patients has not been explored.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23773324 PMCID: PMC3699378 DOI: 10.1186/1471-2164-14-404
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Description of clinical isolates used in the study and basic statistics of the sequencing data
| Isolation sites | CSFa | CSFa | FNACb | FNACb | Lymph node from biopsy |
| Total reads | 3611330*2 | 3045593*2 | 13609758*2 | 7100266*2 | 3072728*2 |
| Average read length | 72 | 72 | 72 | 72 | 72 |
| Total reads aligned after filtering | 5572900 | 4656246 | 21378146 | 11116118 | 4672838 |
| Total reference length | 4411532 | 4411532 | 4411532 | 4411532 | 4411532 |
| % Total reference covered | 88.42 | 91.51 | 90.17 | 81.87 | 86.36 |
| % Reads aligned with reference | 96.32 | 83.78 | 95.81 | 94.82 | 96.72 |
| Optimized average read depth | 72.97 | 53.03 | 287.71 | 148.08 | 60.41 |
a Cerebro-spinal fluid.
bLymph node using Fine-Needle Aspiration Cytology.
Spoligotype patterns of the five extra-pulmonary clinical isolates
| 1 | LN 8 | □□□■■■□■■■■■■■■■■■■■■■■■■■■■□□□□■□■■□□□■■■■ | 73777777413071 | Unidentified | Unidentified |
| 2 | AC 74 | ■□□□□□□■■■■■■■■■■■■■□□□■■■■■□■■■□□□□■■■■■■■ | 403777617560771 | Unidentified | Unidentified |
| 3 | AC 544 | ■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■□□□□■■■■■■■ | 777777777760771 | 53 | T1 |
| 4 | F 85 | □□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□■■■■■■■■■ | 3771 | 1 | BEIJING |
| 5 | F99 | ■■■■■■■■■■■■■■■■■■■■■■■■■□■□□□□■□□□□■■■■■■■ | 777777775020771 | Unidentified | Unidentified |
Figure 1Common and unique SNVs present in the coding regions of the five isolates. LN8 has highest number of unique SNVs.
Figure 2Plot showing RSCU (Relative Synonymous Codon Usages) of isolate AC544 versus background (H37Rv). If RSCU values in an isolate and background are the same then it will lie above the diagonal (red line).
Figure 3Examples of major and rare variations are shown from isolates F85. (A) Major variation changes C to T (B) rare variation changes C to T.
Figure 4Plot showing percent variation versus read depth in isolate F85. Variations below 15% are likely to be due to sequencing error.
Figure 5Distribution of consensus base ratio in the five isolates and in simulated data. Blue line and red line represent data from isolates and simulation respectively.
Figure 6COG classification of the genes carrying SNVs in the five isolates. Different COG categories are represented by X-axis. Y-axis represents fraction of SNVs in each category after normalization.
Primer sequences and standardized PCR parameters
| A | 3480373, + strand, in AC74 | 5’-CAGCAGGGATGGATTCACC-3’5’-GCGAGTGGGATTCAGAGAG-3’ | 62°C/1mM | 1836 | 481 | |||
| B | 1527976, + strand, in AC544 | 5’-GTGGTGCCGTCGTTGTCTC-3’5’-ATGCCCGTAATGTCTGCTGG-3’ | 60.2°C/1.5mM | 1632 | 277 | |||
| C | 1356494, + strand, in AC741356495, + strand, in F85 | 5’-TTCGTCTCAATGGGCAACTAC-3’5’-CACCAGGCACTTCGTTATCG-3’ | 60.2°C/2.5mM | 1552 | 197 | |||
| D | 2550014-2551368, + strand, in H37Rv | 5’-CTGCGACTGCGTTGGTAATC-3’5’-ACGGTGGGGAAAGCCTGAAG-3’ | 61.3°C/1mM | 1563 | 205 | |||
| E | 1541952-1543306, - strand, in H37Rv | 5’-AGCAGGAGGAGCGGGACG-3’5’-GGATAACAGGCGCGAACCG-3’ | 72°C/1mM | 1622 | 264 | |||
| F | 1657017, + strand, in F85 | 5’-TCCCTACACTCGGTTCATCC-3’5’-ACAGCAGCAGCGCCACGG-3’ | 61.3°C/1mM | 1552 | 197 | |||
| G | 2229658, - strand, in F99 | 5’-TGGTGGTCAGGGAAAAGCC-3’5’-CCCTCCCGTAGCAGCCGC-3’ | 61.3°C/1.5mM | 1617 | 262 | |||
| H (IS6110ref) | Multiple copies present in all | 5’-CGTGAGGGCATCGAGGTGGC-3’5’-GCGTAGGCGTCGGTGACAAA-3’ | 65°C/1mM | 245 | 0 | |||
[Ref: Hermans P.W.M., van Soolingen D., Dale J.W. et al., Insertion element IS986 from Mycobacterium tuberculosis: a Useful tool in Diagnosis and Epidemiology of tuberculosis. 1990 J. Clin. Microbiol. 28(9):2051-2058].
Figure 7Analysis of PCR products derived from Primer pairs (Panels A-H) detailed in Table3using agarose gel electrophoresis. Panels A, B and C represent specific coordinates of IS element insertion in the isolates, whereas panels D and E represent those from where IS element has been translocated in the isolates in comparison to the reference strain H37Rv. Panel H (IS6110 internal primers) confirms the presence of IS6110 element in all the strains. Co-ordinates for IS element insertion/deletion and expected amplicon size in the respective strains are detailed in Table 3. Lane 1: Negative control; Lane 2: 100-3000 bp DNA Ladder; Lane 3–8: H37Rv DNA (Lane 3) and Triton extracts of isolates (Lane 4:AC74; Lane 5:AC544; Lane 6:LN8; Lane 7: F85; Lane 8: F99). [Lane 2: DNA Ladder; with IS element; : without IS element].
Figure 8Phylogenetic relationship. SNVs were used to derive the tree using “Neighbour Joining” method.
Figure 9Cartoon representation showing partitioning of the H37Rv and CDC1551 genomes used in generating simulated reads.
Figure 10Cartoon representation of reads aligned to the 5’ and 3’ ends of IS6110. (A) Read aligned to 5’ and 3’ of IS6110 as expected. (B) BLAT alignment of fragmented reads in the ends.