| Literature DB >> 20194423 |
Feng-Chi Chen1, Chiuan-Jung Chen, Wen-Hsiung Li, Trees-Juen Chuang.
Abstract
The evolution of duplicate genes has been a topic of broad interest. Here, we propose that the conservation of gene family size is a good indicator of the rate of sequence evolution and some other biological properties. By comparing the human-chimpanzee-macaque orthologous gene families with and without family size conservation, we demonstrate that genes with family size conservation evolve more slowly than those without family size conservation. Our results further demonstrate that both family expansion and contraction events may accelerate gene evolution, resulting in elevated evolutionary rates in the genes without family size conservation. In addition, we show that the duplicate genes with family size conservation evolve significantly more slowly than those without family size conservation. Interestingly, the median evolutionary rate of singletons falls in between those of the above two types of duplicate gene families. Our results thus suggest that the controversy on whether duplicate genes evolve more slowly than singletons can be resolved when family size conservation is taken into consideration. Furthermore, we also observe that duplicate genes with family size conservation have the highest level of gene expression/expression breadth, the highest proportion of essential genes, and the lowest gene compactness, followed by singletons and then by duplicate genes without family size conservation. Such a trend accords well with our observations of evolutionary rates. Our results thus point to the importance of family size conservation in the evolution of duplicate genes.Entities:
Mesh:
Year: 2010 PMID: 20194423 PMCID: PMC2908708 DOI: 10.1093/molbev/msq055
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
FThe data collation processes.
The Human–Chimpanzee–Macaque Orthologous Gene Families Analyzed in this Study.
| Types of Gene Families | Number of Families | Number of Genes | ||
| H | C | M | ||
| H-C-M families (H, C, M>0) | 9,446 | 17,211 | 16,580 | 17,402 |
| H=C=M | 5,318 | 6,750 | 6,750 | 6,750 |
| H=C=M=1 | 4,343 | 4,343 | 4,343 | 4,343 |
| H=C=M>1 | 975 | 2,407 | 2,407 | 2,407 |
| Non-H=C=M | 1,639 | 5,985 | 5,354 | 6,176 |
| Dup-H≠C≠M (or H≠C≠M; H,C,M>1) | 143 | 1,359 | 1,102 | 1,251 |
The Evolutionary Rates (Ka, Ks, and the Ka/Ks ratio) of the Human–Chimpanzee and Human–Macaque 1:1 Orthologous Gene Pairs in H=C=M and non-H=C=M families.
| Human vs. Chimpanzee (median value) | Human vs. Macaque (median value) | |||||
| Types of Gene Families | ||||||
| H=C=M | 0.0027 | 0.0108 | 0.2516 | 0.0140 | 0.0662 | 0.2093 |
| Non-H=C=M | 0.0030 | 0.0119 | 0.2715 | 0.0169 | 0.0762 | 0.2301 |
| <10−4 | <10−6 | <0.05 | <10−11 | <10−15 | <0.001 | |
P values were estimated by using the two-tailed Wilcoxon rank sum test.
Potential Family Expansion/Contraction in the Human and Chimpanzee Genomes with Reference to the Family Sizes of the Corresponding Rhesus Macaque Families.
FComparisons of median Ka (filled circles), Ks (filled triangles), and Ka/Ks (open diamonds) values of human genes and their closest counterparts in chimpanzee in (A) HS/CS expansion, HS/CS contraction, and H=C=M families; and (B) the C>H>M, all CS expansion (C>H≥M), C>H=M, and H=C=M families. Error bars represent the standard errors.
The Evolutionary Rates (Ka, Ks, and the Ka/Ks ratio) of the Human–Chimpanzee and Human–Macaque 1:1 Orthologous Gene Pairs in H=C=M=1, H=C=M>1, and Dup-H≠C≠M Families.
| Human vs. Chimpanzee (median value) | Human vs. Macaque (median value) | |||||
| Types of Gene Families | ||||||
| H=C=M=1 | 0.0030 | 0.0107 | 0.2727 | 0.0152 | 0.0658 | 0.2318 |
| H=C=M>1 | 0.0022 | 0.0109 | 0.2127 | 0.0115 | 0.0668 | 0.1728 |
| <10−11 | NS | <10−12 | <10−14 | NS | <10−15 | |
| Dup-H≠C≠M (H≠C≠M; H,C,M>1) | 0.0050 | 0.0121 | 0.4237 | 0.0267 | 0.0859 | 0.3350 |
| <10−15 | <0.01 | <10−15 | <10−15 | <10−15 | <10−15 | |
| <10−15 | <10−4 | <10−12 | <10−15 | <10−15 | <10−11 | |
NOTE.—NS, not significant.
P values are estimated by using the two-tailed Wilcoxon rank sum test.
FThe left panel compares (A) the proportions of essential genes, (C) the expression levels, (E) the expression breadth, and (G) the gene compactness (average intron/UTR length) between gene families with and without size conservation (“H=C=M” and “non-H=C=M,” respectively). The right panel (B, D, F, and H) compares the four same features in the same order between singleton gene families with size conservation (“H=C=M=1”) and multigene families with or without size conservation (“H=C=M>1” and “dup-H≠C≠M,” respectively). The P values were estimated by using the two-tailed Fisher’s exact test (A, B, E, and F), the two-tailed Wilcoxon rank sum test (C and D), and the two-tailed t-test (G and H). NS, not significant.