| Literature DB >> 16466575 |
Dirk Steinke1, Walter Salzburger, Ingo Braasch, Axel Meyer.
Abstract
BACKGROUND: Gene and genome duplication events increase the amount of genetic material that might then contribute to an increase in the genomic and phenotypic complexity of organisms during evolution. Thus, it has been argued that there is a relationship between gene copy number and morphological complexity and/or species diversity. This hypothesis implies that duplicated genes have subdivided or evolved novel functions compared to their pre-duplication proto-orthologs. Such a functional divergence might be caused by an increase in evolutionary rates in one ortholog, by changes in expression, regulatory evolution, insertion of repetitive elements, or due to positive Darwinian selection in one copy. We studied a set of 2466 genes that were present in Danio rerio, Takifugu rubripes, Tetraodon nigroviridis and Oryzias latipes to test (i) for forces of positive Darwinian selection; (ii) how frequently duplicated genes are retained, and (iii) whether novel gene functions might have evolved.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16466575 PMCID: PMC1413527 DOI: 10.1186/1471-2164-7-20
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Flowchart of the analysis routine used in this study.
Figure 2Example of a ternary representation of distances of fish species to human orthologs (Danio rerio, Oryzias latipes, Tetraodon nigroviridis). The red circle represents Lysyl-oxidase-like-1 (LOXL-1), a gene with significantly lower distance from Tetraodon nigroviridis to the human ortholog than to other fish species. The blue circle represents the ankyrin repeat domain 10, a gene with significantly higher distance from Danio rerio to the human ortholog than to other fish species.
Abundance and proportion of protein genes of a given fish species with significantly lower or higher distance to the human ortholog than other species. The total number of comparisons was N = 2466. a denotes genes with significantly lower or higher distance for both pufferfish species
| Pufferfishesa | |||||||||
| lower | higher | lower | higher | lower | higher | lower | higher | lower | higher |
| 25 | 35 | 58 | 81 | 80 | 64 | 208 | 26 | 19 | 14 |
| ∑ = 60 | ∑ = 139 | ∑ = 144 | ∑ = 234 | ∑ = 33 | |||||
| 1.01% | 1.42% | 2.35% | 3.28% | 3.24% | 2.60% | 8.43% | 1.05% | 0.77% | 0.57% |
| ∑ = 2.43% | ∑ = 5.64% | ∑ = 5.84% | ∑ = 9.49% | ∑ = 1.34% | |||||
Figure 3Phylogeny of the studied fish species. The proportion of genes with significantly lower (red) or higher (blue) distance to the human ortholog than to other fish species, are mapped onto the phylogeny as proportional triangles. Numbers within the triangle represent the total abundance of those genes. The percentages represent the corresponding proportion of transcription factors.
Tetraodon nigroviridis protein genes under positive Darwinian selection with significantly lower (a) or higher (b) distance to the human ortholog than other fish species. Duplicated genes are given with rate differences between the two copies according to the relative rate test of Tajima (1993) at the 5% level with one asterisk and at the 1% level with two asterisks.
| lysyl oxidase-like 1 a | 0.247** | |
| nuclear LIM interactor-interacting factor 2 a | 0.120** | |
| chloride intracellular channel 5 a | 0.053* | |
| twisted gastrulation a | - | |
| TAFA2 a | 0.014 | |
| serine/threonine protein kinase 6; aurora-A; IPL1-related kinase a | 0.067** | |
| hypothetical protein DKFZp564D0478 a | 0.147* | |
| eukaryotic translation elongation factor 1 gamma a | - | |
| TTK protein kinase a | - | |
| zinc finger protein 207 a | - | |
| exosome component 7a | - | |
| hypothetical protein FLJ10996 a | - | |
| ubiquitin domain containing 1 a | 0.128 | |
| peroxisomal lon protease a | - | |
| hypothetical protein FLJ21156 a | - | |
| syntaxin binding protein 1 b | 0.350** | |
| short-chain dehydrogenase/reductase b | - | |
| exostosin 1 b | 0.074 |
Takifugu rubripes protein genes under positive Darwinian selection with significantly lower (a) or higher (b) distance to the human ortholog than other fish species. Duplicated genes are given with rate differences between the two copies according to the relative rate test of Tajima (1993) at the 5% level with one asterisk and at the 1% level with two asterisks.
| FRUP00000147506 | syntaxin 3A a | 0.178** |
| FRUP00000132184 | lysyl oxidase-like 1 a | 0.207** |
| FRUP00000139282 | chloride intracellular channel 5 a | 0.074 |
| FRUP00000156028 | twisted gastrulation a | - |
| FRUP00000136434 | mitochondrial elongation factor G1 a | 0.170* |
| FRUP00000139600 | TAFA2 a | - |
| FRUP00000129212 | serine/threonine protein kinase 6 a | - |
| FRUP00000153498 | exostosin 1 a | 0.086 |
| FRUP00000133663 | zinc finger protein 106 homolog b | 0.201** |
| FRUP00000162759 | hydroxysteroid (17-beta) dehydrogenase 4 a | - |
| FRUP00000133083 | dUTP pyrophosphatase a | - |
| FRUP00000133686 | excision repair cross-complementing 1 isoform 2 a | - |
| FRUP00000132409 | cytochrome c oxidase subunit Va a | - |
| FRUP00000129950 | glycine cleavage system protein H a | - |
| FRUP00000140923 | mitogen-activated protein kinase kinase kinase kinase 2 a | - |
| FRUP00000157532 | tryptophan rich basic protein a | - |
| FRUP00000133839 | stress-induced-phosphoprotein 1 a | - |
| FRUP00000136741 | dual specificity phosphatase 14 a | - |
| FRUP00000148346 | transitional epithelia response protein a | - |
| FRUP00000149078 | drebrin-like a | - |
| FRUP00000151439 | jumonji domain containing 2C a | 0.319** |
| FRUP00000134740 | tumor necrosis factor type 1 receptor associated protein a | 0.099** |
| FRUP00000133087 | leprecan-like 1 a | 0.317** |
| FRUP00000144902 | calcium binding protein 5 a | 0.012 |
| FRUP00000162608 | neurogenic differentiation 4 a | - |
| FRUP00000130779 | FLJ21963 protein a | - |
| FRUP00000143840 | thioredoxin domain containing a | - |
| FRUP00000154905 | terminal deoxynucleotidyltransferase interacting factor 1 a | - |
| FRUP00000128255 | SMILE protein a | - |
Danio rerio genes under positive Darwinian selection with significantly lower (a) or higher (b) distance to the human ortholog than other fish species. Duplicated genes are given with rate differences between the two copies according to the relative rate test of Tajima (1993) at the 5% level with one asterisk and at the 1% level with two asterisks. Known gene symbols according to ZFIN are also provided.
| ENSDARP00000003385 | ankyrin repeat domain 28 a | 0.285** | ankrd |
| ENSDARP00000024204 | vesicle-associated membrane protein A isoform 2 a | - | vapa |
| ENSDARP00000006783 | cofilin 2 a | 0.078* | cfl2 |
| ENSDARP00000026380 | KIAA0073 protein a | - | - |
| ENSDARP00000039499 | haloacid dehalogenase-like hydrolase domain containing 2 a | - | - |
| ENSDARP00000005886 | arsenate resistance protein ARS2 isoform b a | - | ars2 |
| ENSDARP00000049098 | thyroid hormone receptor interactor 13 a | - | trip11 |
| ENSDARP00000010419 | DnaJ subfamily A member 2 a | 0.135 | dnaja2 |
| ENSDARP00000048880 | gamma-glutamyl Carboxylase a | - | ggcx |
| ENSDARP00000025803 | SEC14 (S. cerevisiae)-like 1 a | 0.114 | sec14l1 |
| ENSDARP00000001039 | hypothetical protein MGC10882 a | - | - |
| ENSDARP00000038616 | glutaryl-Coenzyme A dehydrogenase isoform a precursor a | 0.083 | gcdh |
| ENSDARP00000024124 | activin A type II receptor precursor a | 0.057** | acvr2a |
| ENSDARP00000025487 | DKFZP434B168 a | - | dkfzp434b168 |
| ENSDARP00000024082 | 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase/IMP a | - | atic |
| ENSDARP00000038219 | roundabout 1 isoform a a | 0.276* | robo1 |
| ENSDARP00000008117 | HIRA interacting protein 5 a | - | - |
| ENSDARP00000026984 | beta catenin-like 1 a | - | - |
| ENSDARP00000046992 | carbohydrate (chondroitin) synthase 1 a | - | chys1 |
| ENSDARP00000016016 | phosphatidylserine decarboxylase a | - | - |
| ENSDARP00000023471 | ADP-ribosylation factor-like 6 interacting protein a | - | arl6ip |
| ENSDARP00000002978 | aspartate aminotransferase 1 a | 0.015** | - |
| ENSDARP00000016111 | ADP-ribosylation factor 4-like a | 0.032** | arl4 |
| ENSDARP00000018086 | androgen-induced 1 a | - | aig1 |
| ENSDARP00000015026 | signal recognition particle 72 kDa a | - | wu:fi03d11 |
| ENSDARP00000038278 | cytidine deaminase a | - | aicda |
| ENSDARP00000006977 | ankyrin repeat domain 10 b | 0.211 | ankrd10 |
| ENSDARP00000009789 | chromobox homolog 3 HP1 gamma a | 0.009** | cbx |
| ENSDARP00000016540 | RAB25 a | 0.170* | rab |
Oryzias latipes genes under positive Darwinian selection with significantly lower distance to the human ortholog than other fish species. Duplicated genes are given with rate differences between the two copies according to the relative rate test of Tajima (1993) at the 5% level with one asterisk and at the 1% level with two asterisks.
| DNA topoisomerase I | - | |
| CLIP-associating protein 1 | - | |
| Ras-related associated with diabetes | - | |
| atrophin-1 interacting protein 1 | - | |
| TAF6-like RNA polymerase II | - | |
| protein inhibitor of activated STAT X isoform beta | - | |
| metastasis suppressor 1 | - | |
| Utrophin | - | |
| SNF2 histone linker PHD RING helicase | - | |
| AMPK-related protein kinase 5 | - | |
| hippocampus abundant transcript 1 | 0.080* | |
| ephrin receptor EphA2 | 0.149* | |
| mitogen-activated protein kinase kinase kinase kinase 3 | - | |
| NICE-4 protein | - | |
| tsJ homolog 1 isoform a | - | |
| transcription factor T | - | |
| G protein-coupled receptor 155 | - | |
| transducin-like enhancer protein 4 | - | |
| protein kinase C and casein kinase substrate in neurons 2 | - | |
| nucleoporin 62 kDa | - | |
| fibulin 6 | - | |
| TRAF2 and NCK interacting kinase | - | |
| PDZ domain containing ring finger 1 | - | |
| endothelin converting enzyme 1 | 0.084 | |
| DIP2-like protein isoform a | - | |
| microfibrillar-associated protein 1 | - | |
| BCL2/adenovirus E1B 19kD interacting protein 2 | - | |
| component of oligomeric golgi complex 4 | - | |
| UDP-glucose ceramide glucosyltransferase-like 1 | - | |
| nuclear phosphoprotein PWP1 | - | |
| ribosomal protein L27a | - | |
| pre-B-cell colony enhancing factor 1 isoform a | - | |
| TCDD-inducible poly(ADP-ribose) polymerase | - | |
| ubiquitin specific protease 34 | - | |
| neuro-oncological ventral antigen 1 isoform 2 | - | |
| thimet oligopeptidase 1 | - | |
| dihydrolipoamide S-succinyltransferase | - | |
| myeloid/lymphoid or mixed-lineage leukemia 3 | - | |
| chromodomain helicase DNA binding protein 4 | - | |
| engulfment and cell motility 1 isoform 1 | - | |
| transcription factor AP-2 alpha | - | |
| spermine synthase | - | |
| Rb1-inducible coiled coil protein 1 | - | |
| intersectin 1 isoform ITSN-l | - | |
| B-cell lymphoma 6 protein; | - | |
| RE1-silencing transcription factor | - | |
| MLL septin-like fusion | - | |
| T-cell lymphoma invasion and metastasis 1 | - | |
| RAD21 homolog | - | |
| SH3-domain binding protein 4 | - | |
| serine/threonine kinase 2 | - | |
| methylene tetrahydrofolate dehydrogenase 2 precursor | - | |
| Huntingtin | - | |
| bromodomain containing protein 2 | - | |
| fibrinogen C domain containing 1 | 0.061 | |
| JM1 protein | - | |
| Moesin | - | |
| microtubule associated serine/threonine kinase-like | - |
All fish specific paralogous pairs of proteins discovered in this study with significantly lower or higher distance to the human ortholog than other fish species. "dR/dC>1" represents the number of pairs where at least one paralog is under positive Darwinian selection according to the method of Hughes et al. (1990). "rel. rate sign." represents the number of pairs where at least one paralog shows a significantly higher rate of evolution according to the relative rate test of Tajima (1993).
| lower distance | 13 | 44.83% | 16 | 55.17% | 29 |
| higher distance | 1 | 5.00% | 8 | 40.00% | 20 |
| ∑ | 14 | 28.57% | 24 | 48.98% | 49 |
Figure 4Abundances of gene functions (according to GO) of all fish protein genes with significantly lower or higher distance to the human ortholog than to other fish species (blue) compared to total abundances of gene functions (violet). The percentages represent the rates of the chosen genes compared to the total number of proteins of a given gene function. Significant differences between total and divergent gene abundances according to a χ2-Test are given at the 5% level with one asterisk and at the 1% level with two asterisks.
Figure 5Abundances of gene functions (according to GO) of all fish protein genes under positive Darwinian selection with significantly lower or higher distance to the human ortholog than to other fish species.
Figure 6Comparison of rates of gene functions (according to GO) of all fish protein genes with significantly lower distance to the human ortholog than other fish species.