| Literature DB >> 23995134 |
Iñaki Comas1, Mireia Coscolla, Tao Luo, Sonia Borrell, Kathryn E Holt, Midori Kato-Maeda, Julian Parkhill, Bijaya Malla, Stefan Berg, Guy Thwaites, Dorothy Yeboah-Manu, Graham Bothamley, Jian Mei, Lanhai Wei, Stephen Bentley, Simon R Harris, Stefan Niemann, Roland Diel, Abraham Aseffa, Qian Gao, Douglas Young, Sebastien Gagneux.
Abstract
Tuberculosis caused 20% of all human deaths in the Western world between the seventeenth and nineteenth centuries and remains a cause of high mortality in developing countries. In analogy to other crowd diseases, the origin of human tuberculosis has been associated with the Neolithic Demographic Transition, but recent studies point to a much earlier origin. We analyzed the whole genomes of 259 M. tuberculosis complex (MTBC) strains and used this data set to characterize global diversity and to reconstruct the evolutionary history of this pathogen. Coalescent analyses indicate that MTBC emerged about 70,000 years ago, accompanied migrations of anatomically modern humans out of Africa and expanded as a consequence of increases in human population density during the Neolithic period. This long coevolutionary history is consistent with MTBC displaying characteristics indicative of adaptation to both low and high host densities.Entities:
Mesh:
Year: 2013 PMID: 23995134 PMCID: PMC3800747 DOI: 10.1038/ng.2744
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1Genome-based phylogeny of MTBC mirrors that of human mitochondrial genomes
A, Whole-genome phylogeny of 220 strains of Mycobacterium tuberculosis complex (MTBC). Support values for the main branches after inference with Neighbour-joining (left) and Maximum-likelihood (right) are shown. B, Principal Component Analysis (PCA) of the 34,167 SNPs. The first three PCA axes are shown; these discriminate between evolutionarily “modern” (highlighted in grey) and “ancient” (all other) strains. Individual lineages are shown following the colour coding of Fig. 1A. C and D, Comparison of MTBC phylogeny (C) and a phylogeny derived from 4,955 mitochondrial genomes representative of the main human haplogroups (D). The colour coding highlights the similarities in tree topology and geographic distribution of MTBC strains and main human mitochondrial macro-haplogroups (black - African clades: MTBC Lineage 5 and 6, human mitochondrial macro-haplogroups L0-L3; pink – South-East Asian and Oceanian clades: MTBC Lineage 1, human mitochondrial macro-haplogroup M; blue – Eurasian clades: MTBC Lineage 2, 3, and 4, human mitochondrial macro-haplogroup N). The MTBC Lineage 7 has only been found in Ethiopia and its correlation with any of the three main human haplogroups remains unclear.
Comparison of different dating scenarios of MTBC evolution.
| Dating scenario | MTBC-70 | MTBC-185 | MTBC-10 | MTBC-65 | |
|---|---|---|---|---|---|
| Emergence of MTBC with human mtDNA haplogroup L3 | Emergence of MTBC with anatomically modern humans | Emergence of MTBC during Neolithic Demographic Transition | Emergence of Out-of-Africa MTBC with human mtDNA haplogroup M | ||
| most recent common ancestor of MTBC | 73 (50–96) | 198 (170–229) | 11 (9–14) | 67 (44–91) | |
| coalescent time for Lineage 5/6 | 70 (48–88) | 184 (164–203) | 10 (8–12) | 61 (40–81) | |
| coalescent time for Lineage 1 | 67 (46–88) | 183 (160–207) | 10 (8–12) | 62 (42–82) | |
| coalescent time for Lineage 2/3/4 | 46 (31–61) | 126 (104–148) | 7 (6–10) | 41 (26–55) | |
| period of maximum logistic growth | 4–7 | 31–34 | 1 | 4–7 | |
| 3.37E-4 (2.38E-4-4.65E-4) | 1.23E-4 (1.04E-4-1.46E-4) | 2.17E-3 (1.71E-3-2.68E-3) | 3.78E-4 (2.62E-4-5.36E-4) | ||
dates are shown as the median value and 95% highest posterior density interval predicted in the corresponding Bayesian analysis
value provided as prior input in Bayesian analysis
BEAST predicted rate of SNP accumulation (per polymorphic position and thousand year). In the main text we use the estimated genomic substitution rate (per position and year) for comparative purposes with published estimations from other bacterial species.
Figure 2Out-of-Africa and Neolithic expansion of MTBC
A, Map summarizing the results of the phylogeographic and dating analyses of MTBC. The colour codes used for lineages are according to Fig. 1A. Major splits are annotated with the median value (in kya) of the dating of the relevant node. Lineage 7 (yellow) has so far been isolated exclusively from patients with known country of origin in the Horn of Africa[14]. Lineage 7 diverged subsequent to the proposed Out-of-Africa migration of MTBC; it may have arisen amongst a human population that remained in Africa, or a population that returned to Africa. B, Bayesian skyline plots illustrating changes in population diversity of MTBC (red line) and humans based on mitochondrial DNA (blue lines) during the last 60 ky. Dashed lines represent the 95% highest probability density (HPD) intervals for the estimated population sizes.
Figure 3Neolithic expansion and spread of MTBC Lineage 2 “Beijing” in East Asia
A, Bayesian skyline indicating changes in Lineage 2 diversity over time (red line) as compared to human mtDNA haplogroups from East Asia (blue line). 95% HPD intervals for the population size estimations are shown in dashed lines. B, Dated Bayesian phylogeny of the MTBC Lineage 2 based on coalescent analysis. C, Map of the parallel origin and migration of MTBC and humans in East Asia indicating the first archaeological evidence of modern human in the region 32–42 kya, coinciding with the migration of MTBC from Central to East Asia, the start of the Neolithic in the region indicated by the first evidence of domesticated crop in China coinciding with the origin of the MTBC “Beijing family” 8 kya (6–11 kya), and the co-expansion of agriculture and MTBC “Beijing family” into neighbouring countries 3–5 kya.
Figure 4Time-dependent decay of substitution rates in bacteria based on whole-genome datasets
Scatter plot graph representing the relationship between substitution rate and time span between the most recent ancestor and the last sampling date for each studied pathogen. Values were extracted from relevant publications that use whole-genome representative datasets and coalescent analysis of substitution rates (for a complete list of references see Supplementary Table 9).