| Literature DB >> 24408216 |
Kandarp Rakeshkumar Joshi1, Heena Dhiman, Vinod Scaria.
Abstract
Mycobacterium tuberculosis, along with closely related species, commonly known as M. tuberculosis complex (MTBC), causes tuberculosis in humans and other organisms. Tuberculosis is a disease with high morbidity and mortality, especially in the third world. The genetic variability between clinical isolates of MTBC has been poorly understood, although recent years have seen the re-sequencing of a large number of clinical isolates of MTBC from around the world. The availability of genomic data of multiple isolates in public domain would potentially offer a unique opportunity toward understanding the variome of the organism and the functional consequences of the variations. This nevertheless has been limited by the lack of systematic curation and analysis of data sets available in public domain. In this report, we have re-analyzed re-sequencing data sets corresponding to >450 isolates of MTBC available in public domain to create a comprehensive variome map of MTBC comprising >29 000 single nucleotide variations. Using a systematic computational pipeline, we have annotated potential functional variants and drug-resistance-associated variants from the variome. We have made available this data set as a searchable database. Apart from a user-friendly interface, the database also has a novel option to annotate variants from clinical re-sequencing data sets of MTBC. To the best of our knowledge, tbvar is the largest and most comprehensive genome variation resources for MTBC. Database URL: http://genome.igib.res.in/tbvar/Entities:
Mesh:
Year: 2014 PMID: 24408216 PMCID: PMC3885892 DOI: 10.1093/database/bat083
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Workflow illustrating data sets and methodologies applied in building tbvar.
Figure 2.Comparison of variations with other resources and their mapping to different regions of the genome. (A) Comparison of the variations in M. tuberculosis with respect to other variation resources. (B) Graphical representation showing distribution of SNPs in various loci of the M. tuberculosis reference genome.
Figure 3.Variations plotted across subset of the genomes. Each bin of five samples was randomly chosen for 1000 iterations, and novel SNVs were identified. This number for each bin was averaged, and a box plot was plotted. The 95 percentile and 5 percentile form upper and lower boxes, while upper and lower error bars indicate maximum and minimum, respectively. The blue line passing through the boxes indicates average per bin of samples.
Figure 4.Screenshot showing result table and information or each section of database and the respective tabs for each of the sections.
‘annoTB’ variants annotation report
| Total variants | Known variants | Novel variants | Drug-resistant variants | Genic variants |
|---|---|---|---|---|
| 1454 | 1304 | 150 | 4 | 1144 |
aTotal drug-resistant variants present in the OSDD271 strain. The resistant variants are against ethambutol (3) and fluoroquinolone (1) antibiotics.