| Literature DB >> 20535231 |
Véronique Campbell1, François-Joseph Lapointe.
Abstract
TWO DIFFERENT APPROACHES CAN BE USED IN PHYLOGENOMICS: combined or separate analysis. In the first approach, different datasets are combined in a concatenated supermatrix. In the second, datasets are analyzed separately and the phylogenetic trees are then combined in a supertree. The supertree method is an interesting alternative to avoid missing data, since datasets that are analyzed separately do not need to represent identical taxa. However, the supertree approach and the corresponding consensus methods have been highly criticized for not providing valid phylogenetic hypotheses. In this study, congruence of trees estimated by consensus and supertree approaches were compared to model trees obtained from a combined analysis of complete mitochondrial sequences of 102 species representing 93 mammal families. The consensus methods produced poorly resolved consensus trees and did not perform well, except for the majority rule consensus with compatible groupings. The weighted supertree and matrix representation with parsimony methods performed equally well and were highly congruent with the model trees. The most similar supertree method was the least congruent with the model trees. We conclude that some of the methods tested are worth considering in a phylogenomic context.Entities:
Keywords: DNA sequences; combined analysis; consensus; phylogenomics; separate analysis; supermatrix
Year: 2010 PMID: 20535231 PMCID: PMC2880846 DOI: 10.4137/ebo.s4527
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Figure 1.First model tree (MT1) representing mitogenomic relationships among 93 mammalian families. Bootstrap values (BS) and Bayesian posterior probabilities (BPP) are indicated on branches (BS/BPP). Branches without values correspond to BS/BPP = 100/100.
Figure 2.Second model tree (MT2) representing mitogenomic relationships among mammalian families with eight extra polytomies added to MT1 to obtain a tree compatible with recent molecular studies. Bootstrap values (BS) and Bayesian posterior probabilities (BPP) are indicated on branches (BS/BPP). Branches without values correspond to BS/BPP = 100/100.
Statistical description of individual datasets (the twelve genes on the mitochondrial H-strand) and of concatenated datasets (AL). L (bp): length of the gene in base pairs. No cst: number of constant sites in the alignment. No info: number of informative sites in the alignment. AIC: Model selected according to AIC criterion in Modeltest, which always included parameters G (a gamma distribution of substitution rates) and I (a proportion of invariable sites). χ2 (1, 2): chi square test for homogeneity of base frequencies across species on datasets with third codon position removed (P = 1.0 in every case). χ2 (1, 2, 3): chi square test on datasets with codon positions 1, 2 and 3 included.
| ATP6 | 452 | 200 (44.2) | 204 (45.1) | GTR | 78.25 | 419.99 |
| ATP8 | 60 | 10 (16.7) | 47 (78.3) | TrN | 145.87 | 198.83 |
| COX1 | 1022 | 780 (76.3) | 157 (15.4) | GTR | 15.85 | 573.75 |
| COX2 | 440 | 247 (56.1) | 150 (34.1) | TVM | 28.26 | 290.95 |
| COX3 | 522 | 339 (64.9) | 137 (26.2) | TVM | 33.42 | 320.19 |
| CYTB | 754 | 402 (53.3) | 276 (36.6) | TIM | 80.04 | 525.82 |
| NAD1 | 618 | 312 (50.5) | 239 (38.7) | GTR | 69.18 | 484.12 |
| NAD2 | 690 | 173 (25.1) | 463 (67.1) | GTR | 129.20 | 623.00 |
| NAD3 | 230 | 109 (47.4) | 100 (43.5) | TrN | 78.92 | 251.49 |
| NAD4 | 918 | 380 (41.4) | 458 (49.9) | GTR | 96.29 | 681.88 |
| NAD4 L | 192 | 69 (35.9) | 105 (54.7) | TVM | 65.08 | 251.49 |
| NAD5 | 1202 | 463 (38.5) | 619 (51.5) | GTR | 136.64 | 919.59 |
| ALL | 7100 | 3484 (49.1) | 2955 (41.6) | GTR | 267.78 | 4107.63 |
Identifies significant values after a Bonferroni correction, P < 0.004 (i.e. 0.05/12).
GTR: General time reversible mode.l70–72
TrN: Tamura-Nei model.111
TVM: Tranversional model.69
TIM: Transitional model.69
Congruence of phylogenetic trees inferred from consensus and supertree methods (that ignore or consider branch lengths). CI: Rohlf’s consensus information index (ranges from 0 to 1; 0 being a bush and 1 a fully resolved tree), PM: partition metric, D1: maximum agreement subtrees. Indices range from 0 to 1. MR: majority rule consensus, MRC: majority rule consensus with compatible groupings, MRP: matrix representation with parsimony, MSS: most similar supertree, SFIT: maximum splits fit, AC: average consensus, SDM: unweighted super-distance matrix, SDMw: weighted super-distance matrix.
| Topological consensus methods | Strict | 0.02 | 0.46 | 0.94 | 0.39 | 0.94 |
| MR | 0.10 | 0.30 | 0.79 | 0.25 | 0.73 | |
| MRC | 0.94 | 0.22 | 0.39 | 0.23 | 0.46 | |
| Adams | 0.18 | 0.42 | 0.78 | 0.36 | 0.75 | |
| Topological supertree methods | MRP | 0.98 | 0.23 | 0.47 | 0.23 | 0.43 |
| MSS | 0.91 | 0.54 | 0.62 | 0.56 | 0.60 | |
| SFIT | 0.53 | 0.24 | 0.51 | 0.22 | 0.50 | |
| Branch-length supertree methods | AC | 1.00 | 0.25 | 0.43 | 0.22 | 0.49 |
| SDM | 1.00 | 0.27 | 0.45 | 0.24 | 0.50 | |
| SDMw | 1.00 | 0.30 | 0.45 | 0.27 | 0.50 | |
Strict consensus of five most parsimonious trees.
Strict consensus of two equally optimal supertrees.
Strict consensus of 184 equally optimal supertrees.
GenBank accession numbers of complete mitochondrial DNA sequences from 102 species representing 93 mammalian families. Family and species taxonomy based on Wilson and Reeder.1
| MONOTREMATA | Tachyglossidae | NC_003321 | |
| Ornithorhynchidae | NC_000891 | ||
| DIDELPHIMORPHIA | Didelphidae | NC_001610 | |
| PAUCITUBERCULATA | Caenolestidae | NC_005828 | |
| MICROBIOTHERIA | Microbiotheriidae | NC_005826 | |
| DASYUROMORPHIA | Thylacinidae | NC_011944 | |
| Myrmecobiidae | NC_011949 | ||
| Dasyuridae | NC_006523 | ||
| PERAMELEMORPHIA | Thylacomyidae | NC_006520 | |
| Peramelidae | NC_002746 | ||
| NOTORYCTEMORPHIA | Notoryctidae | NC_006522 | |
| DIPROTODONTIA | Phascolarctidae | NC_008133 | |
| Vombatidae | NC_003322 | ||
| Phalangeridae | NC_003039 | ||
| Potoroidae | NC_006524 | ||
| Macropodidae | NC_001794 | ||
| Pseudocheiridae | NC_006519 | ||
| Petauridae | NC_008135 | ||
| Tarsipedidae | NC_006518 | ||
| Acrobatidae | NC_008145 | ||
| XENARTHRA | Dasypodidae | NC_001821 | |
| Bradypodidae | NC_006923 | ||
| Megalonychidae | NC_006924 | ||
| Myrmecophagidae | NC_004032 | ||
| PROBOSCIDEA | Elephantidae | NC_005129 | |
| NC_000934 | |||
| SIRENIA | Dugongidae | NC_003314 | |
| Trichechidae | NC_010302 | ||
| HYRACOIDEA | Procaviidae | NC_004919 | |
| NC_010301 | |||
| TUBULIDENTATA | Orycteropodidae | NC_002078 | |
| MACROSCELIDEA | Macroscelididae | NC_004026 | |
| NC_004921 | |||
| AFROSORICIDA | Tenrecidae | NC_002631 | |
| Chrysochloridae | NC_004920 | ||
| NC_010304 | |||
| CETACARTIODACTYLA | Balaenidae | NC_005268 | |
| Balaenopteridae | NC_006927 | ||
| Eschrichtiidae | NC_005270 | ||
| Neobalaenidae | NC-005269 | ||
| Delphinidae | NC_005278 | ||
| Monodontidae | NC_005279 | ||
| Phocoenidae | NC_005280 | ||
| Physeteridae | NC_002503 | ||
| Iniidae | NC_005276 | ||
| Platanistidae | NC_005275 | ||
| Ziphiidae | NC_005274 | ||
| Suidae | NC_000845 | ||
| Tayassuidae | NC_012103 | ||
| Hippopotamidae | NC_000889 | ||
| Camelidae | NC_002504 | ||
| Giraffidae | NC_012100 | ||
| Cervidae | NC_007704 | ||
| Bovidae | NC_001567 | ||
| PERISSODACTYLA | Equidae | NC_001640 | |
| Tapiridae | NC_005130 | ||
| Rhinocerotidae | NC_001808 | ||
| CARNIVORA | Ailuridae | NC_011124 | |
| Ursidae | NC_003426 | ||
| Canidae | NC_008434 | ||
| Felidae | NC_001700 | ||
| Herpestidae | NC_006835 | ||
| Mustelidae | NC_009685 | ||
| Otariidae | NC_001050 | ||
| Odobenidae | NC_004029 | ||
| Phocidae | NC_001325 | ||
| Procyonidae | NC_009126 | ||
| Mephitidae | AY598529–AY598539, X94927 | ||
| EULIPOTYPHLA | Soricidae | NC_006893 | |
| NC_005435 | |||
| NC_003040 | |||
| Talpidae | NC_002391 | ||
| NC_008156 | |||
| NC_005035 | |||
| NC_005034 | |||
| CHIROPTERA | Pteropodidae | NC_002612 | |
| Vespertilionidae | NC_002626 | ||
| Mystacinidae | NC_006925 | ||
| Rhinolophidae | NC_005433 | ||
| Phyllostomidae | NC_002009 | ||
| RODENTIA | Thryonomyidae | NC_002658 | |
| Caviidae | NC_000884 | ||
| Gliridae | NC_001892 | ||
| Sciuridae | NC_002369 | ||
| Dipodidae | NC_005314 | ||
| Spalacidae | NC_005315 | ||
| Cricetidae | NC_007936 | ||
| Muridae | NC_005089 | ||
| LAGOMORPHA | Ochotonidae | NC_005358 | |
| Leporidae | NC_001913 | ||
| PRIMATES | Lemuridae | NC_004025 | |
| Indriidae | NC_011053 | ||
| Daubentoniidae | NC_010299 | ||
| Lorisidae | NC_002765 | ||
| Tarsiidae | NC_002811 | ||
| Cebidae | NC_002763 | ||
| Aotidae | AY250707 | ||
| Cercopithecidae | NC_005943 | ||
| Hylobatidae | NC_002082 | ||
| Hominidae | NC_001643 | ||
| DERMOPTERA | Cynocephalidae | NC_004031 | |
| SCANDENTIA | Tupaiidae | NC_002521 |
Wilson DE, Reeder DM. Mammal Species of the World: A Taxonomic and Geographic Reference. 3rd ed. Baltimore, Maryland: The Johns Hopkins University Press; 2005.