| Literature DB >> 19851465 |
Abstract
The notion that gene duplications generating new genes and functions is commonly accepted in evolutionary biology. However, this assumption is more speculative from theory rather than well proven in genome-wide studies. Here, we generated an atlas of the rate of copy number changes (CNCs) in all the gene families of ten animal genomes. We grouped the gene families with similar CNC dynamics into rate pattern groups (RPGs) and annotated their function using a novel bottom-up approach. By comparing CNC rate patterns, we showed that most of the species-specific CNC rates groups are formed by gene duplication rather than gene loss, and most of the changes in rates of CNCs may be the result of adaptive evolution. We also found that the functions of many RPGs match their biological significance well. Our work confirmed the role of gene duplication in generating novel phenotypes, and the results can serve as a guide for researchers to connect the phenotypic features to certain gene duplications.Entities:
Mesh:
Year: 2009 PMID: 19851465 PMCID: PMC2761543 DOI: 10.1371/journal.pone.0007342
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
General statistics of gene families in 10 animal species.
| Species | # of genes | # of gene families | # of genes per family |
| Human | 22680 | 12241 | 1.85 |
| Macaca | 21944 | 10395 | 2.11 |
| Mouse | 24118 | 11801 | 2.04 |
| Rat | 22993 | 10375 | 2.22 |
| Cow | 21755 | 10152 | 2.14 |
| Dog | 19305 | 9743 | 1.98 |
| Opossum | 19520 | 9267 | 2.11 |
| Chicken | 16736 | 9818 | 1.7 |
| Zebrafish | 21322 | 7766 | 2.75 |
| Fruitfly | 14039 | 9214 | 1.52 |
includes singleton gene families.
The first 21 major RPGs.
| Pattern ID | Pattern content | # of gene families |
|
|
| 4543 (19.16%) |
|
|
| 1984 (8.37%) |
|
|
| 1981 (8.35%) |
|
|
| 1642 (6.92%) |
|
|
| 869 (3.66%) |
|
|
| 800 (3.37%) |
|
|
| 798 (3.37%) |
|
|
| 768 (3.24%) |
|
|
| 559 (2.36%) |
|
|
| 474 (2.00%) |
|
|
| 385 (1.62%) |
|
|
| 385 (1.62%) |
|
|
| 377 (1.59%) |
|
|
| 374 (1.58%) |
|
|
| 333 (1.40%) |
|
|
| 262 (1.10%) |
|
|
| 216 (0.91%) |
|
|
| 192 (0.81%) |
|
|
| 182 (0.77%) |
|
|
| 164 (0.69%) |
|
|
| 124 (0.52%) |
| Total | 17412 (73.43%) |
Sorted by the number of genes in each pattern in descending order. When species are connected with “ = ”, they are sorted by alphabetic order.
shortened pattern IDs, which are indexed based on the number of gene families in each pattern, the smaller the ID number is the more genes the pattern has.
Species-specific conjugated RPG pairs.
| RPG pairs | ||||
| Expanded | Shrunk | |||
| Species | Index | # of gene families | Index | # of gene families |
| Fruitfly | P0 | 4543 (49.3%) | P6 | 798 (8.7%) |
| Zebrafish | P5 | 800 (10.3%) | P19 | 164 (2.1%) |
| Chicken | P2 | 1981 (20.2%) | P20 | 124 (1.3%) |
| Opossum | P11 | 385 (4.2%) | P34 | 37 (0.40%) |
| Cow | P4 | 869 (8.6%) | P25 | 59 (0.58%) |
| Dog | P17 | 192 (2.0%) | P114 | 11 (0.11%) |
| Rat | P9 | 474 (4.6%) | P59 | 24 (0.23%) |
| Mouse | P3 | 1642 (13.9%) | P637 | 2 (0.017%) |
| Macaca | P10 | 385 (3.7%) | P78 | 18 (0.17%) |
Within the parenthesis are the percentages of the numbers of gene families in RPGs to the total numbers of gene families in the corresponding species.
Figure 1Species tree with divergence times.
Figure 2Representative annotation matrices for biological process.
Grey cells in the matrix are not within the outlier of each category. The left part of the tree are generated from GO DAGs not based on the similarities of the rows of the matrix.
Figure 3Representative annotation matrices for molecular function.
Grey cells in the matrix are not within the outlier of each category. The left part of the tree are generated from GO DAGs not based on the similarities of the rows of the matrix.
Figure 4Representative annotation matrices for cellular component.
Grey cells in the matrix are not within the outlier of each category. The left part of the tree are generated from GO DAGs not based on the similarities of the rows of the matrix.
Top species-specific annotations (BP).
| Species | Pattern | Go id | Weight | GO Description |
| fly | P0 | GO:0050911 | 72.98 | detection of chemical stimulus involved in sensory perception of smell |
| GO:0035072 | 62.76 | ecdysone-mediated induction of salivary gland cell autophagic cell death | ||
| GO:0007174 | 43.29 | epidermal growth factor ligand processing | ||
| GO:0002077 | 37.10 | acrosome matrix dispersal | ||
| GO:0006510 | 37.01 | ATP-dependent proteolysis | ||
| GO:0034223 | 36.86 | regulation of spore wall chitin biosynthetic process | ||
| GO:0007218 | 35.42 | neuropeptide signaling pathway | ||
| GO:0032199 | 33.93 | transcription during RNA-mediated transposition | ||
| GO:0021973 | 33.78 | corticospinal neuron axon decussation | ||
| GO:0033355 | 29.38 | ascorbate glutathione cycle | ||
| GO:0034234 | 27.21 | regulation of spore wall chitin catabolic process | ||
| GO:0002028 | 26.17 | regulation of sodium ion transport | ||
| P6 | GO:0032199 | 72.94 | transcription during RNA-mediated transposition | |
| GO:0016578 | 46.15 | histone deubiquitination | ||
| GO:0007220 | 37.15 | Notch receptor processing | ||
| GO:0045812 | 29.75 | negative regulation of Wnt receptor signaling pathway, calcium modulating pathway | ||
| GO:0016525 | 29.24 | negative regulation of angiogenesis | ||
| GO:0006510 | 29.14 | ATP-dependent proteolysis | ||
| GO:0001759 | 28.01 | induction of an organ | ||
| GO:0010165 | 26.78 | response to X-ray | ||
| GO:0048286 | 26.41 | alveolus development | ||
| GO:0007069 | 25.13 | negative regulation of transcription from RNA polymerase I promoter, mitotic | ||
| GO:0007071 | 25.11 | negative regulation of transcription from RNA polymerase III promoter, mitotic | ||
| GO:0006888 | 24.59 | ER to Golgi vesicle-mediated transport | ||
| GO:0031564 | 23.78 | transcription antitermination | ||
| GO:0007329 | 23.30 | positive regulation of transcription from RNA polymerase II promoter by pheromones | ||
| GO:0045766 | 23.24 | positive regulation of angiogenesis | ||
| GO:0034205 | 23.03 | beta-amyloid formation | ||
| zebrafish | P5 | GO:0032199 | 29.31 | transcription during RNA-mediated transposition |
| GO:0050911 | 21.01 | detection of chemical stimulus involved in sensory perception of smell | ||
| GO:0006419 | 19.36 | alanyl-tRNA aminoacylation | ||
| GO:0051895 | 17.82 | negative regulation of focal adhesion formation | ||
| GO:0031564 | 16.63 | transcription antitermination | ||
| GO:0046856 | 16.26 | phosphoinositide dephosphorylation | ||
| GO:0035072 | 16.12 | ecdysone-mediated induction of salivary gland cell autophagic cell death | ||
| GO:0001947 | 13.53 | heart looping | ||
| GO:0051898 | 12.87 | negative regulation of protein kinase B signaling cascade | ||
| P19 | GO:0016578 | 23.05 | histone deubiquitination | |
| GO:0016598 | 16.25 | protein arginylation | ||
| GO:0006423 | 12.39 | cysteinyl-tRNA aminoacylation | ||
| chicken | P2 | GO:0042450 | 19.30 | arginine biosynthetic process via ornithine |
| GO:0045768 | 12.30 | positive regulation of anti-apoptosis | ||
| P20 | GO:0006434 | 19.97 | seryl-tRNA aminoacylation | |
| GO:0000716 | 16.31 | transcription-coupled nucleotide-excision repair, DNA damage recognition | ||
| GO:0007042 | 12.63 | lysosomal lumen acidification | ||
| GO:0032199 | 10.57 | transcription during RNA-mediated transposition | ||
| opossum | P11 | GO:0007604 | 9.04 | phototransduction, UV |
| GO:0043576 | 9.02 | regulation of respiratory gaseous exchange | ||
| dog | P17 | GO:0006269 | 10.14 | DNA replication, synthesis of RNA primer |
| cow | P4 | GO:0006888 | 13.07 | ER to Golgi vesicle-mediated transport |
| GO:0021777 | 11.18 | BMP signaling pathway involved in spinal cord association neuron specification | ||
| mouse | P3 | GO:0034080 | 6.60 | DNA replication-independent nucleosome assembly at centromere |
| GO:0006335 | 6.51 | DNA replication-dependent nucleosome assembly | ||
| rat | P9 | GO:0032199 | 11.40 | transcription during RNA-mediated transposition |
| macaca | P10 | GO:0018279 | 13.18 | protein amino acid N-linked glycosylation via asparagine |
| human | P1 | GO:0000710 | 12.19 | meiotic mismatch repair |
| GO:0007218 | 9.05 | neuropeptide signaling pathway | ||
| GO:0043089 | 8.07 | positive regulation of Cdc42 GTPase activity |
Top group-specific annotations (BP).
| Groups | Pattern | Go id | Weight | GO Description |
| fly and zebrafish | P15 | GO:0032199 | 52.20 | transcription during RNA-mediated transposition |
| GO:0001569 | 39.08 | patterning of blood vessels | ||
| GO:0007329 | 33.92 | positive regulation of transcription from RNA polymerase II promoter by pheromones | ||
| GO:0001755 | 32.47 | neural crest cell migration | ||
| GO:0007071 | 29.86 | negative regulation of transcription from RNA polymerase III promoter, mitotic | ||
| GO:0045494 | 29.42 | photoreceptor cell maintenance | ||
| GO:0046024 | 28.74 | positive regulation of transcription from RNA polymerase III promoter, mitotic | ||
| GO:0007069 | 27.89 | negative regulation of transcription from RNA polymerase I promoter, mitotic | ||
| GO:0046020 | 27.83 | negative regulation of transcription from RNA polymerase II promoter by pheromones | ||
| GO:0046018 | 26.76 | positive regulation of transcription from RNA polymerase I promoter, mitotic | ||
| GO:0030321 | 26.30 | transepithelial chloride transport | ||
| GO:0030502 | 26.04 | negative regulation of bone mineralization | ||
| GO:0001560 | 25.31 | regulation of cell growth by extracellular stimulus | ||
| GO:0030514 | 23.58 | negative regulation of BMP signaling pathway | ||
| GO:0060196 | 23.33 | positive regulation of antisense RNA transcription | ||
| cold | P8 | GO:0007004 | 37.74 | telomere maintenance via telomerase |
| blooded | GO:0031295 | 28.86 | T cell costimulation | |
| GO:0032199 | 28.40 | transcription during RNA-mediated transposition | ||
| GO:0045842 | 27.47 | positive regulation of mitotic metaphase/anaphase transition | ||
| GO:0007257 | 21.80 | activation of JNK activity | ||
| GO:0007037 | 18.16 | vacuolar phosphate transport | ||
| GO:0050860 | 17.26 | negative regulation of T cell receptor signaling pathway | ||
| GO:0045086 | 17.07 | positive regulation of interleukin-2 biosynthetic process | ||
| GO:0002071 | 16.47 | glandular epithelial cell maturation | ||
| GO:0050434 | 15.24 | positive regulation of viral transcription | ||
| non-mammals | P12 | GO:0006120 | 34.89 | mitochondrial electron transport, NADH to ubiquinone |
|
| GO:0032199 | 22.44 | transcription during RNA-mediated transposition | |
| GO:0016578 | 19.51 | histone deubiquitination | ||
| GO:0031658 | 13.21 | G1/S-specific negative regulation of cyclin-dependent protein kinase activity | ||
| GO:0031661 | 13.20 | G2/M-specific negative regulation of cyclin-dependent protein kinase activity | ||
| GO:0030317 | 13.03 | sperm motility | ||
| GO:0042776 | 11.86 | mitochondrial ATP synthesis coupled proton transport | ||
| GO:0042777 | 11.86 | plasma membrane ATP synthesis coupled proton transport | ||
| non-placental | P16 | GO:0030890 | 22.66 | positive regulation of B cell proliferation |
|
| GO:0042104 | 18.04 | positive regulation of activated T cell proliferation | |
| GO:0048304 | 17.04 | positive regulation of isotype switching to IgG isotypes | ||
| GO:0042523 | 13.13 | positive regulation of tyrosine phosphorylation of Stat5 protein | ||
| murine | P14 | GO:0006465 | 12.00 | signal peptide processing |
Top annotations of the conserved RPG.
| Groups | Pattern | Go id | Weight | GO Description |
| BP | P7 | GO:0016578 | 64.15 | histone deubiquitination |
| GO:0032199 | 47.44 | transcription during RNA-mediated transposition | ||
| GO:0006387 | 38.81 | snRNA capping | ||
| GO:0016245 | 37.80 | hyperphosphorylation of RNA polymerase II | ||
| GO:0007368 | 34.31 | determination of left/right symmetry | ||
| GO:0006888 | 30.94 | ER to Golgi vesicle-mediated transport | ||
| GO:0032858 | 29.67 | activation of Rab GTPase | ||
| GO:0000059 | 29.31 | protein import into nucleus, docking | ||
| GO:0030503 | 29.07 | regulation of cell redox homeostasis | ||
| GO:0006388 | 28.31 | tRNA splicing | ||
| GO:0006422 | 28.11 | aspartyl-tRNA aminoacylation | ||
| GO:0006303 | 25.91 | double-strand break repair via nonhomologous end joining | ||
| GO:0007095 | 25.68 | mitotic cell cycle G2/M transition DNA damage checkpoint | ||
| MF | P7 | GO:0005524 | 419.15 | ATP binding |
| GO:0008270 | 385.80 | zinc ion binding | ||
| GO:0005525 | 105.26 | GTP binding | ||
| GO:0005509 | 102.04 | calcium ion binding | ||
| GO:0000287 | 88.52 | magnesium ion binding | ||
| GO:0003735 | 86.97 | structural constituent of ribosome | ||
| GO:0033392 | 84.09 | actin homodimerization activity | ||
| GO:0003705 | 78.67 | RNA polymerase II transcription factor activity, enhancer binding | ||
| GO:0003743 | 76.45 | translation initiation factor activity | ||
| GO:0050372 | 55.98 | ubiquitin-calmodulin ligase activity | ||
| GO:0003924 | 55.34 | GTPase activity | ||
| GO:0032791 | 51.04 | lead ion binding | ||
| GO:0046982 | 49.70 | protein heterodimerization activity | ||
| GO:0046911 | 48.52 | metal chelating activity | ||
| GO:0003689 | 47.82 | DNA clamp loader activity | ||
| GO:0015616 | 47.82 | DNA translocase activity |
Figure 5An example sub-graph of GO DAGs.