| Literature DB >> 25176035 |
Francesc Coll1, Ruth McNerney1, José Afonso Guerra-Assunção2, Judith R Glynn2, João Perdigão3, Miguel Viveiros4, Isabel Portugal3, Arnab Pain5, Nigel Martin6, Taane G Clark7.
Abstract
Strain-specific genomic diversity in the Mycobacterium tuberculosis complex (MTBC) is an important factor in pathogenesis that may affect virulence, transmissibility, host response and emergence of drug resistance. Several systems have been proposed to classify MTBC strains into distinct lineages and families. Here, we investigate single-nucleotide polymorphisms (SNPs) as robust (stable) markers of genetic variation for phylogenetic analysis. We identify ~92 k SNP across a global collection of 1,601 genomes. The SNP-based phylogeny is consistent with the gold-standard regions of difference (RD) classification system. Of the ~7 k strain-specific SNPs identified, 62 markers are proposed to discriminate known circulating strains. This SNP-based barcode is the first to cover all main lineages, and classifies a greater number of sublineages than current alternatives. It may be used to classify clinical isolates to evaluate tools to control the disease, including therapeutics and vaccines whose effectiveness may vary by strain type.Entities:
Mesh:
Year: 2014 PMID: 25176035 PMCID: PMC4166679 DOI: 10.1038/ncomms5812
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Global phylogeny of 1,601 MTBC isolates.
A total of 91,648 SNPs spanning the whole genome were used to reconstruct the phylogeny of 1,601 MTBC isolates. All seven main MTBC lineages are indicated at the inner area of the tree. The main sublineages are annotated at the outer arc along with lineage-specific RDs. Identified clades are colour coded.
A comparison of SNP-typing systems.
| This study62 SNPs | Yes (8 | Yes (6 | Yes (5 | Yes (36 | Yes (0) | Yes (0) | Yes (0) | Yes (0) | 7 (55) | 19 |
| Homolka | Yes (1 | Yes (0) | Yes (0) | Yes (7 | Yes (2 | Yes (0) | No (−) | Yes (0) | 6 (10) | NA |
| Comas | Yes (1 | Yes (2 | Yes (0) | Yes (5 | Yes (0) | Yes (0) | No (−) | Yes (0) | 6 (7) | 9 |
| Filliol | No (−) | No (−) | No (−) | No (−) | No (−) | No (−) | No (−) | No (−) | 6 (−) | NA |
NA, not applicable; RD, regions of difference; SNP, single-nucleotide polymorphism.
*See Supplementary Data 1 for a complete description of lineages and sublineages.
†RD239, 105, 207, 181, 150, 142, 750, 182, 183, 193, 122, 726, 219, 761, 115, 174, 724, 711, 702; 239.
‡Lineage 1.2.1.
§Lineages 4.6.2.2, 4.1.2.1, 4.3, 4.4.1.1, 4.2.2.1, 4.2.1 and an ambiguous ‘Ghana’.
||West Africanum Ia and Ib.
¶Lineages 2.1 and 2.2.
#Lineages 4.6.2.2, 4.6.1, 4.1.1, 4.1.2.1 and 4.3.
**RD105, 207, 750, 726, 724, 182, 711, 702 and 7.