| Literature DB >> 24731071 |
Jean-Pierre Flandrois1, Gérard Lina, Oana Dumitrescu.
Abstract
BACKGROUND: Tuberculosis is an infectious bacterial disease caused by Mycobacterium tuberculosis. It remains a major health threat, killing over one million people every year worldwide. An early antibiotic therapy is the basis of the treatment, and the emergence and spread of multidrug and extensively drug-resistant mutant strains raise significant challenges. As these bacteria grow very slowly, drug resistance mutations are currently detected using molecular biology techniques. Resistance mutations are identified by sequencing the resistance-linked genes followed by a comparison with the literature data. The only online database is the TB Drug Resistance Mutation database (TBDReaM database); however, it requires mutation detection before use, and its interrogation is complex due to its loose syntax and grammar. DESCRIPTION: The MUBII-TB-DB database is a simple, highly structured text-based database that contains a set of Mycobacterium tuberculosis mutations (DNA and proteins) occurring at seven loci: rpoB, pncA, katG; mabA(fabG1)-inhA, gyrA, gyrB, and rrs. Resistance mutation data were extracted after the systematic review of MEDLINE referenced publications before March 2013. MUBII analyzes the query sequence obtained by PCR-sequencing using two parallel strategies: i) a BLAST search against a set of previously reconstructed mutated sequences and ii) the alignment of the query sequences (DNA and its protein translation) with the wild-type sequences. The post-treatment includes the extraction of the aligned sequences together with their descriptors (position and nature of mutations). The whole procedure is performed using the internet. The results are graphs (alignments) and text (description of the mutation, therapeutic significance). The system is quick and easy to use, even for technicians without bioinformatics training.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24731071 PMCID: PMC4021062 DOI: 10.1186/1471-2105-15-107
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Study selection process and reasons for the exclusion of studies.
List of the reference gene sequences of
| [AL123456] REGION: 759808 to 763325 | |
| [AL123456] REGION: 2288681 to 2289280 | |
| promoter | [AL123456] REGION: 1673325 to 1673439 |
| [AL123456] REGION: 1674202 to 1675011 | |
| [AL123456] REGION: 2153889 to 2156111 identical to [×68081.1] | |
| [AL123456] REGION: 7302 to 9818 | |
| [AL123456] REGION: 5123 to 7267 | |
| [AL123456] REGION: 1471846 to 1473762 identical to [NR_102810] |
Example of a mutation database: a part of the B mutation database for
| A1291G ~ S431G ~ ~ | Nucleotide chain: A at position 1291 is replaced by G; Protein chain: S at position 431 is replaced by G |
| +1300TTC ~ +434 F ~ ~ | Insertion of TTC at position 1300 (F at position 434 of the protein) |
| CAG1306- ~ Q436- ~ ~ | Deletion of CAG at position 1306 (deletion of Q at position 436 of the protein) |
| C1331T ~ T4441 ~ High Confidence ~ See mutDB2012 | Example with a note and a remark |
| C1294T ~ Q432- ~ Rare ~ RULE::STOPCODON | Example with a note and a rule |
| CATGGACCAGAA1299- ~ MDQN434- ~ ~RULE::AMBIGUOUS= > N_ATGGACCAGAAC1300- | Example of a deletion occurring within a repeated zone. The position 1299 deduced from the nucleotide-level sequence alignment is corrected in the results |
Figure 2MUBII general organization. The MUBII-TB-DB database and the de novo constructed mutation database (orange frame) are built before the query session. The mutation database is used for the results interpretation and for the construction of the BLAST mutation database. Ovals indicate the use of external software (dark green) or of MUBII routines (pale green). The query analysis process (green frame) combines the BLAST result and the expertise of the alignment using the mutation database. Outputs are in blue.
Figure 3Screen capture of an B query: DNA alignment and detected mutations. The figure shows an example of the detection of a mutation. The whole nucleotide alignment can be observed by horizontally scrolling the alignment window. In the case of the rpoB gene mutation, the nucleotide positions in M. tuberculosis and E. coli are indicated. The protein scheme compares the wild-type and query sequences for a given position. This scheme describes the changes along the protein sequence and emphasizes the effect of the mutation on the protein chain.
Figure 4Protein alignment and detected mutations. A shows the detection of a mutation. B shows frameshift creation. C shows stop-codon creation. The whole alignment can be observed by horizontally scrolling the alignment window. These representations emphasize the effect of the mutation on the protein chain to provide solid understanding in terms of establishing therapy and monitoring.