| Literature DB >> 22666131 |
Takahiro Yonezawa1, Masami Hasegawa.
Abstract
Although overwhelming circumstantial evidence supports the existence of the universal common ancestor of all extant life on Earth, it is still an open question whether the universal common ancestor existed or not. Theobald (Nature 465, 219-222 (2010)) recently challenged this problem with a formal statistical test applied to aligned sequences of conservative proteins sampled from all domains of life and concluded that the universal common ancestor hypothesis holds. However, we point out that there is a fundamental flaw in Theobald's method which used aligned sequences. We show that the alignment gives a strong bias for the common ancestor hypothesis, and we provide an example that Theobald's method supports a common ancestor hypothesis for two apparently unrelated families of protein-encoding sequences (cytb and nd2 of mitochondria). This arouses suspicion about the effectiveness of the "formal" test.Entities:
Mesh:
Year: 2012 PMID: 22666131 PMCID: PMC3361263 DOI: 10.1100/2012/479824
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1Independent origins hypothesis versus common origin hypotheses of cytb and nd2. No branch exists connecting the two genes in the independent origins hypothesis, while the common ancestor of the two genes exists in the common origin hypothesis.
Formal tests of the common ancestry between cytb and nd2 based on the amino acid sequence data sets aligned with various values of gap penalties (GOP and GEP).
| Model | No alignment (346 aa) | (GOP, GEP) = (100, 100) (338 aa) | (GOP, GEP) = (15, 6.66) (342 aa) | (GOP, GEP) = (10, 0.1) (330 aa) | (GOP, GEP) = (1, 0.1) (313 aa) | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Independent | Common | Independent | Common | Independent | Common | Independent | Common | Independent | Common | |
| Poisson | 5934.3 | 5933.5† | 5748.6 | 5745.8† | 5856.9 | 5838.6† |
|
| 5403.1 | 5288.6† |
| Poisson+Γ | 5922.0† | 5933.5 | 5735.9† | 5740.6 | 5843.9 | 5832.3† |
|
| 5392.7 | 5288.5† |
| JTT | 5591.5 | 5586.1† | 5420.3 | 5414.0† | 5515.8 | 5495.6† |
|
| 5080.2 | 4879.8† |
| mtmam | 5247.4† | 5252.5 | 5083.1† | 5090.8 | 5174.7† | 5176.0 |
|
| 4754.3 | 4688.6† |
| JTT+F+Γ | 5304.3† | 5325.8 | 5133.7† | 5152.8 | 5226.8† | 5231.7 |
|
| 4809.5 | 4682.4† |
| mtmam+F+Γ |
| 5272.3 |
| 5107.7 |
| 5185.4 |
|
| 4759.7 |
|
|
| ||||||||||
| Homology* | 0.077 | 0.083 | 0.107 |
| 0.216 | |||||
AICs of each model comparing the independent and common origin hypotheses were shown. In the comparison between the two hypotheses, the hypothesis with lower AIC was indicated by †. The substitution model with the minimal AIC in each data set was indicated by an underline. Default values of GOP and GEP were indicated in bold fonts.
*Homology between cytb and nd2 alignments, which is defined by 1–(average p-distance between cytb and nd2).
(a)
| Model | No alignment (1038 bp) | (GOP, GEP) = (100, 100) (1026 bp) | (GOP, GEP) = (50, 6.66) (1029 bp) | |||
|---|---|---|---|---|---|---|
| Independent | Common | Independent | Common | Independent | Common | |
| JC | 11043.8 | 11005.5† | 10876.9 | 10844.5† | 10935.0 | 10862.9† |
| K80 | 10820.8† | 10821.2 | 10669.3 | 10662.2† | 10727.6 | 10684.4† |
| HKY | 10398.6† | 10414.7 | 10255.3† | 10266.6 | 10309.7 | 10294.4† |
| GTR | 10307.5† | 10320.4 | 10186.5† | 10192.1 | 10242.4 | 10224.3† |
| K80+Γ | 10789.5 | 10723.4† | 10637.5 | 10562.7† | 10695.7 | 10650.4† |
| HKY+Γ | 10329.8 | 10274.8† | 10186.4 | 10119.4† | 10239.7 | 10228.4† |
| GTR+Γ | 10271.9 |
| 10129.5 |
| 10184.1 |
|
|
| ||||||
| Homology* | 0.314 | 0.317 | 0.349 | |||
(b)
| Model | (GOP, GEP) = (30, 6.66) (1025 bp) | (GOP, GEP) = (15, 6.66) (999 bp) | (GOP, GEP) = (3, 6.66) (974 bp) | |||
|---|---|---|---|---|---|---|
| Independent | Common | Independent | Common | Independent | Common | |
| JC | 10890.6 | 10802.2† |
|
| 10262.1 | 9865.7† |
| K80 | 10684.6 | 10623.3† |
|
| 10056.9 | 9613.1† |
| HKY | 10271.8 | 10241.0† |
|
| 9645.8 | 9283.2† |
| GTR | 10204.9 | 10170.3† |
|
| 9585.0 | 9234.3† |
| K80+Γ | 10652.5 | 10577.5† |
|
| 10028.1 | 9595.4† |
| HKY+Γ | 10202.4 | 10162.0† |
|
| 9580.9 | 9249.5† |
| GTR+Γ | 10146.3 |
|
|
| 9531.1 |
|
|
| ||||||
| Homology* | 0.360 |
| 0.504 | |||
AICs of each model comparing the independent and common origin hypotheses were shown. In the comparison between the two hypotheses, the hypothesis with lower AIC was indicated by †. The substitution model with the minimal AIC in each data set was indicated by an underline. Default values of GOP and GEP were indicated in bold fonts.
*Homology between cytb and nd2 alignments, which is defined by 1–(average p-distance between cytb and nd2).