| Literature DB >> 12453367 |
Austin L Hughes1, Robert Friedman, Megan Murray.
Abstract
Comparison of the pattern of synonymous nucleotide substitution between two complete genomes of Mycobacterium tuberculosis at 3298 putatively orthologous loci showed a mean percent difference per synonymous site of 0.000328 0.000022. Although 80.5% of loci showed no synonymous or nonsynonymous nucleotide differences, the level of polymorphism observed at other loci was greater than suggested by previous studies of a small number of loci. This level of nucleotide difference leads to the conservative estimate that the common ancestor of these two genotypes occurred approximately 35000 ago, which is twice as high as some recent estimates of the time of origin of this species. Our results suggest that a large number of loci should be examined for an accurate assessment of the level of nucleotide diversity in natural populations of pathogenic microorganisms.Entities:
Mesh:
Year: 2002 PMID: 12453367 PMCID: PMC2738538 DOI: 10.3201/eid0811.020064
Source DB: PubMed Journal: Emerg Infect Dis ISSN: 1080-6040 Impact factor: 6.883
FigureA plot of p(A|p), the probability of assignment of a locus to group A given the observed p value, as a function of p at 3,309 loci compared between the H37Rv and CDC1551 genotypes of Mycobacterium tuberculosis. The plot shows the bimodal nature of the distribution of p values, with overall higher values of p at the 11 loci having p(A|p) >50%.
Estimatesa of the divergence time of the H37Rv and CDC1551 genotypes of Mycobacterium tuberculosis
| Reference | No. loci | Synomymous substitutions/site/yr | Divergence time (H37Rv and CDC1551) |
|---|---|---|---|
|
| 67 | 4.7 ± 0.2 X 10-9 | 34,900 ± 2,300 b (33,500–36,400) c |
|
| 128 | 4.4 X 10-9 | 37,300 ± 2,500 b |
aBased on synonymous substitutions between Escherichia coli and Salmonella typhimurium, assumed to have diverged 100 million years ago (13,14). bEstimates are shown ± standard error, based on standard error of mean p. cRange based on standard error of rate estimate.
Proteins for which the nearest homologous comparison between the H37Rv and CDC1551 genotypes of Mycobacterium tuberculosis has a high p value (Group B)
| Accession nos. | Protein function |
|
| |
|---|---|---|---|---|
| Probable differential deletion | ||||
| NP_216309, NP_335079 | unknown | 0.0470 | 0.0000 | |
| NP_215713, NP_335504 | unknown | 0.0628 | 0.0043 | |
| NP_216319, NP_336310 | PE repeat family | 0.0115 | 0.0094 | |
| NP_215965, NP_335949 | PE repeat family | 0.0448 | 0.0185 | |
| Possible horizontal gene transfer | ||||
| NP_214910, NP_334815 | unknown | 0.0226 | 0.0105 | |
| NP_216104, NP_336077 | unknown | 0.0265 | 0.0084 | |
| NP_216281, NP_336535 | unknown | 0.0210 | 0.0161 | |
| NP_215835, NP_335809 | adenylate cyclase | 0.0229 | 0.0068 | |
| NP_216564, NP_336573 | polyketide synthase | 0.0093 | 0.0036 | |
| NP_217029, NP_337080 | unknown | 0.0148 | 0.0156 | |
| NP_216862, NP_335679 | unknown | 0.0313 | 0.0000 | |
| Mean ± S.E. | 0.0286 ± 0.0050 | 0.0085 ± 0.0019a | ||
aPaired sample t-test of the hypothesis that p = p , p<0.01. The quantities p and p are the proportion of nucleotide difference per synonymous site and per nonsynonymous site, respectively.