| Literature DB >> 19036132 |
Claudine Montgelard1, Ellen Forty, Véronique Arnal, Conrad A Matthee.
Abstract
BACKGROUND: The number of rodent clades identified above the family level is contentious, and to date, no consensus has been reached on the basal evolutionary relationships among all rodent families. Rodent suprafamilial phylogenetic relationships are investigated in the present study using approximately 7600 nucleotide characters derived from two mitochondrial genes (Cytochrome b and 12S rRNA), two nuclear exons (IRBP and vWF) and four nuclear introns (MGF, PRKC, SPTBN, THY). Because increasing the number of nucleotides does not necessarily increase phylogenetic signal (especially if the data is saturated), we assess the potential impact of saturation for each dataset by removing the fastest-evolving positions that have been recognized as sources of inconsistencies in phylogenetics.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19036132 PMCID: PMC2613922 DOI: 10.1186/1471-2148-8-321
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Intron sequences
| Total alignment | Conserved positions | Exon | Intron | Mean Intron Length | Standard deviation | |
|---|---|---|---|---|---|---|
| MGF | 1330 | 820 | 35 | 785 | 684 | 82 |
| PRKC | 2355 | 533 | 77 | 456 | 553 | 182 |
| SPTBN | 2578 | 833 | 77 | 756 | 706 | 159 |
| THY | 1790 | 711 | 227 | 484 | 481 | 91 |
| TOTAL | 8053 | 2897 | 416 | 2481 | - | - |
Number of positions: in the full alignment (column 1), after elimination of poorly aligned positions by Gblocks (column 2), in the remaining exonic parts (column 3) and in the intronic regions (column 4). For each gene, columns 5 and 6 give the mean intron length before alignment and its standard deviation.
Gamma rate distribution for the mitochondrial (mito), exon and intron genes
| Rate | MITO | EXON | INTRON | |||
|---|---|---|---|---|---|---|
| Category | 2126 sites | Rate | 2571 sites | Rate | 2897 sites | Rate |
| 1 | 826 (38.8%) | 0.000 | 944 (36.7%) | 0.004 | 708 (24.4%) | 0.275 |
| 2 | 0 (0%) | 0.0006 | 0 (0%) | 0.045 | 231 (8%) | 0.475 |
| 3 | 0 (0%) | 0.009 | 31 (1.2%) | 0.144 | 228 (7.9%) | 0.641 |
| 4 | 280 (13.2%) | 0.052 | 371 (14.4%) | 0.321 | 266 (9.2%) | 0.806 |
| 5 | 228 (10.7%) | 0.200 | 201 (7.8%) | 0.607 | 241 (8.3%) | 0.988 |
| 6 | 340 (16%) | 0.614 | 347 (13.5%) | 1.070 | 313 (10.8%) | 1.207 |
| 7 | 187 (8.9%) | 1.706 | 364 (14.2%) | 1.890 | 374 (12.9%) | 1.510 |
| 8 | 265 (12.4%) | 5.420 | 313 (12.2%) | 3.919 | 536 (18.5%) | 2.10 |
For each dataset and the eight gamma categories, the number of sites (percentages in parentheses) is given in the left column and relative rates in the right column.
Slope of saturation for each gene partition before and after (in italics) removal of fast-evolving positions
| Slopes of saturation (number of position considered) | ||||
|---|---|---|---|---|
| Gene | Total number of position | Partition 1 | Partition 2 | Partition 3 |
| Cytochrome | 1140 | 0.13 (380) | 0.42 (380) | 0.009 (380) |
| 12S rRNA | 986 | 0.29 (458) | 0.12 (528) | |
| IRBP | 1272 | 0.55 (433) | 0.59 (433) | 0.25 (433) |
| vWF | 1299 | 0.52 (424) | 0.55 (424) | 0.11 (424) |
| MGF | 785 | 0.70 | ||
| PRKC | 456 | 0.56 | ||
| SPTBN | 756 | 0.65 | ||
| THY | 484 | 0.58 | ||
| Flanking-Exons of introns | 416 | 0.31 | ||
Protein coding genes (cytochrome b, vWF and IRBP) have been partitioned according to codon positions, the 12S rRNA is partitioned in stems and loops and there is one partition for each intron and one partition for combined exons. In each case, the number of positions is given in parentheses.
Figure 1Bayesian phylogram of Rodentia. Phylogenetic relationships are inferred from the reduced-concatenated dataset. Numbers at nodes refer, from left to right respectively, to bootstrap percentages in ML analysis with RAxML (100 replications), to posterior probabilities in Bayesian analysis and to bootstrap percentages in MP analysis with PAUP (1000 replications). Only nodes not supported by posterior probabilities of 1.00 or 100% BP are indicated. In both probabilistic analyses, dataset was analysed with the GTR + I + G model applied to 13 independent partitions (see text for details). Rodent family names are indicated on the right and grey upper case letters at nodes correspond to suprafamilial groupings as defined in Additional file 4.
Supports for two suprafamilial groupings according to various datasets: the three separate (mitochondrial, exon and intron genes), the three combinations of two datasets and all genes concatenated (conc).
| Myomorpha + Castorimorpha + Anomaluromorpha | Sciuromorpha + Hystricomorpha + "Mouse-related" clade | |||||
|---|---|---|---|---|---|---|
| Myo + Ano | Myo + Casto | Casto + Ano | Sciuro + Hystrico | Hystrico basal | Sciuro basal | |
| MITO (2126) | 10/0.17 | 14/0.37 | 15/0.39 | 5/- | -/- | 3/0.48 |
| R-MITO (1861) | -/- | 2/- | 9/- | 29/- | 4/0.33 | 2/- |
| EXON (2571) | 38/- | 29/0.70 | -/0.05 | 5/0.08 | 37/0.74 | 16/- |
| R-EXON (2258) | 31/0.7 | 16/0.2 | 1/0.09 | 17/0.09 | 57/0.81 | 11/0.08 |
| INTRON (2897) | 100/1.00 | -/- | -/- | 54/0.69 | 30/0.28 | 16/- |
| R-INTRON (2361) | 77/0.79 | 20/0.14 | 3/0.08 | 95/1.00 | -/- | 5/- |
| MITO + EXON (4697) | 32/0.17 | 56/0.74 | 7/0.09 | 4/- | 26/- | 54/0.99 |
| R-MITO + R-EXON (4119) | 5/- | -/- | -/- | 30/0.16 | 46/0.57 | 14/0.27 |
| MITO + INTRON (5023) | 95/0.99 | 3/- | 2/- | 37/0.43 | 26/0.13 | 37/0.45 |
| R-MITO + R-INTRON (4222) | 74/0.70 | 26/0.21 | -/0.08 | 98/1.00 | -/- | 2/- |
| EXON + INTRON (5468) | 90/0.97 | 10/- | -/- | 44/0.38 | 46/0.57 | 10/- |
| R-EXON + R-INTRON (4619) | 84/0.92 | 15/0.06 | 1/- | 90/1.00 | 7/- | 5/- |
| CONC (7594) | 88/0.93 | 12/0.06 | 27/0.07 | 37/0.23 | 36/0.32 | 27/0.45 |
| R-CONC (6480) | 73/0.87 | -/- | -/- | 91/1.00 | 5/- | 4/- |
Each dataset is analysed with and without (noted R for reduced) fast-evolving sites and the numbers of characters analysed is indicated in parenthesis. In each case, the bootstrap support resulting from 100 replications in partitioned maximum likelihood analysis with RaxML and the posterior probability in Bayesian analysis with MrBayes are indicated from right to left, respectively for the three possible relationships among the "mouse-related clade" (Myomorpha, Castorimorpha and Anomaluromorpha) and between the three main rodent lineages (Sciuromorpha, Hystricomorpha and "Mouse-related" clade).
Three tests of nine a priori topologies
| Topology tested | Whole Dataset 7594 nucleotides | Reduced Dataset 6480 nucleotides | ||||
|---|---|---|---|---|---|---|
| 2 (((((Casto, Ano), Myo), (Hyst, Sciu)) | 0.126 | 0.580 | 0.018 | 0.432 | 0.662 | 0.196 |
| 3 ((((Myo, Casto), Ano), (Hyst, Sciu)) | 0.315 | 0.615 | 0.028 | 0.211 | 0.591 | 0.111 |
| 4 ((((Myo, Ano), Casto), Sciu), Hyst) | 0.544 | 0.799 | 0.215 | 0.176 | 0.149 | 0.001 |
| 5 ((((Casto, Ano), Myo), Sciu), Hyst) | 0.056 | 0.463 | 0.005 | 0.066 | 0.083 | <10-3 |
| 6 ((((Myo, Casto), Ano), Sciu), Hyst) | 0.297 | 0.561 | 0.014 | 0.049 | 0.059 | <10-5 |
| 7 ((((Myo, Casto), Ano), Hyst,) Sciu) | 0.296 | 0.540 | 0.011 | 0.051 | 0.079 | 10-4 |
| 8 ((((Myo, Ano), Casto), Hyst,) Sciu) | 0.540 | 0.778 | 0.165 | 0.044 | 0.128 | 0.001 |
| 9 ((((Casto, Ano), Myo), Hyst,) Sciu) | 0.093 | 0.464 | 0.005 | 0.035 | 0.066 | <10-6 |
P-values of the approximately unbiased test (AU), Shimodaira-Hasegawa test (SH) and approximate Bayesian posterior probability (PP) for nine topologies tested before (whole) and after (reduced) removal of fast-evolving positions. The best topology is indicated in bold with the following abbreviations: Myo = Myomorpha; Ano = Anomaluromorpha; Casto = Castorimorpha; Hyst = Hystricomorpha; Sciu = Sciuromorpha.
Figure 2Chronogram of rodent diversification. Divergence times are calculated from the reduced-concatenated dataset using a relaxed molecular clock with four constrained nodes numbered 1 to 4 (see text for details) and represented by stars. Estimated dates with the 95% credibility intervals are represented by grey boxes at nodes. Time scale is indicated in Million of years below the figure. The zygomasseteric condition (H: Hystricomorphy; M: Myomorphy; S: Sciuromorphy; P: Protogomorphy) of each rodent family is specified on the right.