| Literature DB >> 27363525 |
Solomon A Yimer1,2, Amine Namouchi3, Ephrem Debebe Zegeye4, Carol Holm-Hansen5, Gunnstein Norheim5, Markos Abebe6, Abraham Aseffa6, Tone Tønjum3,7.
Abstract
BACKGROUND: A deeply rooted phylogenetic lineage of Mycobacterium tuberculosis (M. tuberculosis) termed lineage 7 was discovered in Ethiopia. Whole genome sequencing of 30 lineage 7 strains from patients in Ethiopia was performed. Intra-lineage genome variation was defined and unique characteristics identified with a focus on genes involved in DNA repair, recombination and replication (3R genes).Entities:
Keywords: 3R genes; Amhara Region; Ethiopia; Lineage 7; Mutations; Mycobacterium tuberculosis; Single nucleotide polymorphism; Whole genome sequencing
Mesh:
Year: 2016 PMID: 27363525 PMCID: PMC4929747 DOI: 10.1186/s12862-016-0715-z
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Fig. 1a Maximum likelihood (ML) tree including strains belonging to all MTBC lineages. The ML tree is based on all identified polymorphic sites. b ML tree of lineage 7 isolates included in this study
Fig. 2a Distribution of all identified SNPs in lineage 7 strains based on the Cluster of Orthologous Classification (COG). The green color defines the synonymous SNPs and the red color the non-synonymous SNPs. b Similarity matrix based on whole genome analysis. The percentages indicated in each box correspond to the identity between two isolates at the genomic level. Letter codings are described as follows: [C] Energy production and conversion, [D] Cell cycle control, cell division, chromosome partitioning, [E] Amino acid transport and metabolism, [F] Nucleotide transport and metabolism, [G] Carbohydrate transport and metabolism, [H] Coenzyme transport and metabolism, [I] Lipid transport and metabolism, [K] Transcription, [L] Replication, recombination and repair, [M] Cell wall/membrane/envelope biogenesis, [N] Cell motility, [O] Post-translational modification, protein turnover, and chaperones, [Q] Secondary metabolites biosynthesis, transport, and catabolism, [T] Signal transduction mechanisms, [U] Intracellular trafficking, secretion, and vesicular transport, [V] Defense mechanisms
Distribution of SNPs according to the Clusters of Orthologous Groups (COG) classification. The binomial test was calculated using Excel as follows: BINOMIAL DIST (Number of SNPs, Total number of SNPs, Gene length/Total gene length, 1)
| Cluster of Orthologous Categories | COG codes | Nb of nsSNPs | Nb of sSNPs | Total nb of SNP | Nb of genes | Total gene length | BINOMIAL TEST |
|---|---|---|---|---|---|---|---|
| Secondary metabolites biosynthesis, transport and catabolism | Q | 169 | 111 | 280 | 85 | 186303 | 0.016339104 |
| Energy production and conversion | C | 128 | 77 | 205 | 104 | 141315 | 0.00956263 |
| Lipid transport and metabolism | I | 110 | 75 | 185 | 86 | 108423 | 0.551376551 |
| Amino acid transport and metabolism | E | 112 | 58 | 170 | 85 | 111717 | 0.075317667 |
| Transcription | K | 100 | 54 | 154 | 68 | 60555 | 0.999999395 |
| Signal transduction mechanisms | T | 102 | 41 | 143 | 43 | 55596 | 0.99999919 |
| Inorganic ion transport and metabolism | P | 92 | 40 | 132 | 70 | 100593 | 0.000946095 |
| Replication, recombination and repair | L | 80 | 46 | 126 | 51 | 75246 | 0.463792377 |
| Coenzyme transport and metabolism | H | 83 | 42 | 125 | 60 | 63582 | 0.955338675 |
| Carbohydrate transport and metabolism | G | 83 | 33 | 116 | 57 | 82752 | 0.017486939 |
| Cell wall/membrane/envelope biogenesis | M | 70 | 39 | 109 | 57 | 70737 | 0.164200506 |
| Post-translational modification, protein turnover, and chaperones | O | 41 | 31 | 72 | 42 | 54294 | 0.016391663 |
| Cell cycle control, cell division, chromosome partitioning | D | 34 | 18 | 52 | 22 | 38130 | 0.059214452 |
| Nucleotide transport and metabolism | F | 28 | 22 | 50 | 33 | 36285 | 0.073929704 |
| Defense mechanisms | V | 29 | 14 | 43 | 26 | 34749 | 0.017540819 |
| Intracellular trafficking, secretion, and vesicular transport | U | 6 | 6 | 12 | 7 | 8544 | 0.310996283 |
| Cell motility | N | 1 | 0 | 1 | 0 | 0 | 1 |
| Not in COGs | R | 441 | 228 | 669 | 386 | 325323 | 0.999999959 |
| General function prediction only | R | 250 | 136 | 386 | 189 | 238986 | 0.156980544 |
| Function unknown | S | 116 | 69 | 185 | 106 | 101895 | 0.838428538 |
Fig. 3Comparison of the ratio of nsSNP/sSNP. Observed differences in the ratio of nsSNP/sSNP before and after the expansion of lineage 7
Fig. 4List of specific deletions identified in all lineage 7 isolates included in this study. Deletions were identified by calculating the coverage rate throughout the genome using bedtools on the alignment files generated by samtools. Each line corresponds to the coverage rate for each strain. The gray line corresponds to a control strain that does not include any of the identified deletions specific for the lineage isolates. For each deletion, the information is: genomic coordinates, size of the deleted region and the genes concerned.