| Literature DB >> 17712423 |
Morgan Kullberg1, Björn Hallström, Ulfur Arnason, Axel Janke.
Abstract
BACKGROUND: We investigate the usefulness of expressed sequence tags, ESTs, for establishing divergences within the tree of placental mammals. This is done on the example of the established relationships among primates (human), lagomorphs (rabbit), rodents (rat and mouse), artiodactyls (cow), carnivorans (dog) and proboscideans (elephant). METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2007 PMID: 17712423 PMCID: PMC1942079 DOI: 10.1371/journal.pone.0000775
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Classification of the human homologues
| Function | Number of genes |
| Protein biosynthesis | 71 |
| Transport related | 35 |
| mRNA Transcription | 27 |
| Cell structure | 22 |
| Proteolysis | 21 |
| Cell cycle | 17 |
| Protein folding | 16 |
| mRNA splicing | 13 |
| Protein phosphorylation | 12 |
| Nucleoside, nt and na metabolism | 10 |
| DNA replication | 10 |
| Stress response | 8 |
| Cell motility | 8 |
| rRNA metabolism | 8 |
| Mitosis | 8 |
| Protein modification | 7 |
| Oxidative phosphorylation | 7 |
| Tricarboxylic acid pathway | 6 |
| Cell adhesion | 6 |
| Immunity and defense | 6 |
| Protein complex assembly | 6 |
| Protein glycosylation | 6 |
| Cell communication | 6 |
| Developmental processes | 6 |
| Intracellular protein traffic | 6 |
| Unclassified | 109 |
NOTE–Only the most common classes according to the PANTHER classification with >5 identified homologues are shown.
General statistics of the concatenated data sets
| Data set | Length nt | Gaps | Distance human-mouse | Constant sites | ||||
| 123cdp | 12cdp | aa | 123cdp | 12cdp | aa | |||
| maxgen | 164466 | 54% | 0.111 | 0.033 | 0.050 | 27.2% | 34.2% | 32.1% |
| maxspe | 77328 | 6% | 0.113 | 0.036 | 0.055 | 55.1% | 69.4% | 65.1% |
NOTE–The observed percentages and distances (substitutions per site) are shown.
Figure 1Distribution of the alignment lengths after trimming.
Figure 2Distribution of the pair wise aa distances between human and mouse.
Pairwise aa distances between species.
| Chicken |
| Elephant | Dog | Cow | Human | Rabbit | Mouse | Rat | |
| Chicken | 0.108 | 0.126 | 0.113 | 0.116 | 0.111 | 0.125 | 0.117 | 0.118 | |
| 0.135 | 0.162 | 0.142 | 0.146 | 0.139 | 0.161 | 0.147 | 0.149 | ||
|
| 0.125 | 0.103 | 0.092 | 0.093 | 0.090 | 0.103 | 0.100 | 0.100 | |
| 0.160 | 0.126 | 0.110 | 0.112 | 0.108 | 0.126 | 0.122 | 0.123 | ||
| Elephant | 0.141 | 0.108 | 0.051 | 0.056 | 0.050 | 0.064 | 0.067 | 0.068 | |
| 0.184 | 0.133 | 0.056 | 0.062 | 0.054 | 0.073 | 0.075 | 0.077 | ||
| Dog | 0.125 | 0.096 | 0.054 | 0.037 | 0.036 | 0.047 | 0.053 | 0.054 | |
| 0.158 | 0.114 | 0.059 | 0.039 | 0.038 | 0.051 | 0.058 | 0.059 | ||
| Cow | 0.128 | 0.100 | 0.059 | 0.039 | 0.040 | 0.052 | 0.057 | 0.058 | |
| 0.162 | 0.120 | 0.65 | 0.042 | 0.042 | 0.057 | 0.063 | 0.064 | ||
| Human | 0.124 | 0.096 | 0.053 | 0.038 | 0.043 | 0.046 | 0.050 | 0.050 | |
| 0.157 | 0.113 | 0.058 | 0.040 | 0.046 | 0.050 | 0.064 | 0.055 | ||
| Rabbit | 0.135 | 0.108 | 0.066 | 0.050 | 0.056 | 0.048 | 0.060 | 0.062 | |
| 0.176 | 0.132 | 0.074 | 0.054 | 0.061 | 0.052 | 0.066 | 0.070 | ||
| Mouse | 0.132 | 0.106 | 0.072 | 0.058 | 0.063 | 0.055 | 0.048 | 0.020 | |
| 0.170 | 0.129 | 0.082 | 0.064 | 0.069 | 0.060 | 0.071 | 0.021 | ||
| Rat | 0.133 | 0.108 | 0.074 | 0.060 | 0.064 | 0.056 | 0.066 | 0.023 | |
| 0.171 | 0.132 | 0.084 | 0.066 | 0.071 | 0.062 | 0.071 | 0.024 |
Note—The values show the observed and JTT+4Γ+Ι distances (top to bottom). Above diagonal the maxgen and below maxspe alignment.
Figure 3Phylogenetic relationships based on partitioned Bayesian inference of the maxgen alignment (164466 nt) and a GTR+4Γ+I model of sequence evolution.
The partitioning are according to four evolutionary rate classes and codon positions. The model parameters are estimated separate for every partitioning.
Figure 4Relationships between species in the tested alternative topologies.
ML support for the different phylogenetic hypothesis.
| tree-1 | tree-2 | tree-3 | tree-4 | tree-5 | tree-6 | |||
| aa |
| JTT+4Γ+I | [250468.7] | 2.11 | 1.63 | 3.03 | 2.20 | 2.21 |
| pSH | 1.000 | 0.313 | 0.417 | 0.010 | 0.020 | 0.019 | ||
|
| JTT+4Γ+I | [127791.9] | 0.96 | 0.67 | 2.15 | 1.81 | 1.68 | |
| pSH | 1.000 | 0.670 | 0.673 | 0.090 | 0.052 | 0.060 | ||
| 12cdp |
| GTR+4Γ+I | [331163.9] | 2.59 | 2.53 | 3.77 | 1.86 | 1.63 |
| pSH | 1.000 | 0.216 | 0.248 | 0.000 | 0.033 | 0.067 | ||
|
| GTR+4Γ+I | [152577.9] | 1.43 | 1.79 | 2.23 | 1.08 | 0.89 | |
| pSH | 1.000 | 0.482 | 0.404 | 0.072 | 0.203 | 0.259 | ||
| 123cdp |
| GTR+4Γ+I | [668754.0] | 6.84 | 6.03 | 5.63 | 1.18 | 2.65 |
| pSH | 1.000 | 0.001 | 0.006 | 0.000 | 0.151 | 0.007 | ||
|
| GTR+4Γ+I | [326172.5] | 4.33 | 5.35 | 4.47 | 1.93 | 3.39 | |
| pSH | 1.000 | 0.056 | 0.069 | 0.000 | 0.035 | 0.000 |
Note–The support is expressed as the ΔlogL/S.E. ratio and the pSH value. The -logL value of the best tree is shown in square brackets.
Support of three alternative topologies by ML analysis of single genes.
| Topology | Number of genes | ||
| aa+4Γ+I | cdp12+4Γ+I | cdp123+4Γ+I | |
| tree-1 | 25 | 31 | 42 |
| tree-2 | 3 | 0 | 4 |
| tree-3 | 14 | 16 | 23 |
| tree-4 | 4 | 8 | 2 |
| tree-5 | 12 | 7 | 11 |
| tree-6 | 11 | 20 | 17 |
| unresolved | 92 | 79 | 62 |
NOTE–The number of genes that support a fixed topology with a ΔlogL/S.E. ratio equal or better than 0.5 are shown. The resolution was regarded as unresolved when the ΔlogL/S.E. ratio was smaller than 0.5.
Numbers of genes that support a certain clade.
| Clade | Number of genes | ||
| aa+4Γ+I | 12cdp+4Γ+I | 123cdp+4Γ+I | |
| Boreotheria | 13/55 | 14/49 | 24/50 |
| Euarch.glir | 7/49 | 7/31 | 14/58 |
| Glires | 17/61 | 18/50 | 21/54 |
| Rodentia | 111/147 | 128/142 | 158/160 |
| Cetferung. | 23/74 | 22/60 | 39/82 |
NOTE–The trees have been reconstructed for single gene alignments with TREE-PUZZLE (first value) and PHYML (second value) respectively. HDC: human plus dog plus cow clade, Euarch.glir: Euarchontaglires clade (human plus rodents).
Sequences showing signs of positive selection on at least one branch under branch-site models. Significant LRT at 1%.
| Human accession number | Branch/Species | Codon positions under pos. selection/number of codons | Gene function/name |
| NM_001064 | elephant | 2/204 | transketolase |
| NM_003670 | human | 1/132 | basic helix-loop-helix |
| NM_005094 | cow | 1/86 | solute carrier |
| NM_005866 | rabbit | 1/126 | opioid receptor |
| NM_015150 | Sminthopsis | 2/76 | lipid raft linker |
| NM_012225 | rabbit | 2/163 | nucleotide binding protein |
| NM_000546 | rabbit | 3/123 | tumor protein p53 |
| NM_004559 | elephant | 5/243 | Y box binding protein |
| NM_001428 | rat | 1/284 | p53-binding protein |
| NM_006739 | rabbit | 1/216 | minichromosome maintenance complex |
| NM_018622 | rat | 11/207 | presenilin associated |
| NM_000989 | cow | 4/114 | ribosomal protein L30 |
| NM_024092 | elephant | 2/182 | transmembrane protein |
| NM_006449 | dog | 2/200 | CDC42 effector protein |
| NM_018285 | dog | 2/172 | ribonucleoprotein |
| NM_005530 | cow | 2/232 | isocitrate dehydrogenase |
| NM_015292 | cow | 1/265 | family with sequence similarity 62 |
| NM_004135 | cow | 1/199 | isocitrate dehydrogenase |
| NM_018361 | rodentia | 2/108 | lysophosphatidic acid acyltransferase |
| NM_006530 | rabbit | 2/218 | YEATS domain containing 4 |
| NM_002306 | rabbit | 2/182 | galactoside-binding |
| NM_002306 | mouse | 1/182 | galactoside-binding |
| NM_014300 | rat | 1/163 | SEC11 homolog A |
| Concatenated | Chicken | 55/28521 | |
| Concatenated |
| 8/28521 | |
| Concatenated | Elephant | 40/28521 | |
| Concatenated | Cow | 7/28521 | |
| Concatenated | Dog | 3/28521 | |
| Concatenated | Human | 1/28521 | |
| Concatenated | Rabbit | 5/28521 | |
| Concatenated | Mouse | 1/28521 | |
| Concatenated | Rat | 4/28521 | |
| Concatenated | Cetferungulata | 1/28521 | |
| Concatenated | Rodentia | 1/28521 | |
| Concatenated | Glires | 0/28521 | |
| Concatenated | Euarchontaglires | 0/28521 | |
| Concatenated | Boreotheria | 0/28521 | |
| Concatenated | Placentalia | 9/28521 |