| Literature DB >> 26417197 |
Dahai Gao1, Dennis C Ko2, Xinmin Tian3, Guang Yang4, Liuyang Wang2.
Abstract
Gene duplication has been proposed to serve as the engine of evolutionary innovation. It is well recognized that eukaryotic genomes contain a large number of duplicated genes that evolve new functions or expression patterns. However, in mollusks, the evolutionary mechanisms underlying the divergence and the functional maintenance of duplicate genes remain little understood. In the present study, we performed a comprehensive analysis of duplicate genes in the protein kinase superfamily using whole genome and transcriptome data for the Pacific oyster. A total of 64 duplicated gene pairs were identified based on a phylogenetic approach and the reciprocal best BLAST method. By analyzing gene expression from RNA-seq data from 69 different developmental and stimuli-induced conditions (nine tissues, 38 developmental stages, eight dry treatments, seven heat treatments, and seven salty treatments), we found that expression patterns were significantly correlated for a number of duplicate gene pairs, suggesting the conservation of regulatory mechanisms following divergence. Our analysis also identified a subset of duplicate gene pairs with very high expression divergence, indicating that these gene pairs may have been subjected to transcriptional subfunctionalization or neofunctionalization after the initial duplication events. Further analysis revealed a significant correlation between expression and sequence divergence (as revealed by synonymous or nonsynonymous substitution rates) under certain conditions. Taken together, these results provide evidence for duplicate gene sequence and expression divergence in the Pacific oyster, accompanying its adaptation to harsh environments. Our results provide new insights into the evolution of duplicate genes and their expression levels in the Pacific oyster.Entities:
Keywords: Pacific oyster; RNA-seq; duplicate genes; protein kinase superfamily
Year: 2015 PMID: 26417197 PMCID: PMC4573066 DOI: 10.4137/EBO.S30230
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Figure 1Phylogenetic relationship of protein kinases from Pacific oyster. NJ topology was represented and bootstrap values were shown for the clades with more than 50% support. The scale bar indicates the number of amino acid substitutions per site. The genes with red circles represent the identified duplicate paralogs.
Identified duplicate protein kinase gene pairs and related information.
| PAIR NAME | GENE NAME | AMINO ACID LENGTH (aa) | SCAFFOLD | STRAND | EXON NUMBER |
|---|---|---|---|---|---|
| CGI_10022762 | 291 | scaffold443 | − | 8 | |
| CGI_10009747 | 319 | scaffold322 | − | 7 | |
| CGI_10005548 | 317 | scaffold268 | − | 6 | |
| CGI_10010191 | 1058 | scaffold930 | − | 13 | |
| CGI_10028660 | 360 | scaffold150 | + | 1 | |
| CGI_10028659 | 325 | scaffold150 | − | 4 | |
| CGI_10006983 | 563 | scaffold401 | + | 10 | |
| CGI_10016867 | 989 | scaffold1579 | + | 8 | |
| CGI_10009421 | 672 | scaffold116 | − | 25 | |
| CGI_10009798 | 439 | scaffold1560 | − | 13 | |
| CGI_10011917 | 1281 | scaffold1874 | − | 39 | |
| CGI_10014644 | 1414 | scaffold43964 | − | 21 | |
| CGI_10012098 | 327 | scaffold1195 | + | 10 | |
| CGI_10004264 | 409 | scaffold40612 | + | 2 | |
| CGI_10009696 | 701 | scaffold372 | − | 16 | |
| CGI_10028458 | 792 | scaffold102 | − | 7 | |
| CGI_10021945 | 442 | scaffold1086 | − | 11 | |
| CGI_10026068 | 488 | scaffold1174 | + | 18 | |
| CGI_10026336 | 486 | scaffold678 | + | 12 | |
| CGI_10003308 | 360 | scaffold39368 | + | 10 | |
| CGI_10016803 | 1961 | scaffold556 | − | 25 | |
| CGI_10010738 | 350 | scaffold954 | + | 9 | |
| CGI_10015287 | 936 | scaffold44008 | − | 26 | |
| CGI_10009716 | 504 | scaffold1028 | + | 13 | |
| CGI_10022933 | 545 | scaffold950 | − | 11 | |
| CGI_10013006 | 595 | scaffold1164 | − | 13 | |
| CGI_10004418 | 1209 | scaffold201 | + | 32 | |
| CGI_10021568 | 1755 | scaffold237 | − | 41 | |
| CGI_10007711 | 482 | scaffold42776 | + | 14 | |
| CGI_10017521 | 1106 | scaffold120 | + | 6 | |
| CGI_10010914 | 794 | scaffold1288 | + | 21 | |
| CGI_10001604 | 528 | C35776 | − | 14 | |
| CGI_10009216 | 338 | scaffold1688 | − | 9 | |
| CGI_10003535 | 354 | scaffold39740 | + | 10 | |
| CGI_10018112 | 760 | scaffold396 | + | 16 | |
| CGI_10021856 | 689 | scaffold164 | + | 16 | |
| CGI_10022111 | 491 | scaffold109 | − | 16 | |
| CGI_10024838 | 468 | scaffold492 | − | 13 | |
| CGI_10016201 | 392 | scaffold324 | + | 9 | |
| CGI_10001632 | 444 | scaffold277 | + | 9 | |
| CGI_10028745 | 368 | scaffold1009 | + | 8 | |
| CGI_10026280 | 361 | scaffold1836 | − | 11 | |
| CGI_10014307 | 499 | scaffold737 | + | 2 | |
| CGI_10014308 | 784 | scaffold737 | + | 2 | |
| CGI_10011211 | 324 | scaffold1157 | + | 1 | |
| CGI_10002545 | 387 | scaffold1795 | − | 8 | |
| CGI_10004768 | 1373 | scaffold1107 | + | 22 | |
| CGI_10001297 | 894 | C34444 | − | 12 | |
| CGI_10001599 | 861 | scaffold1453 | + | 18 | |
| CGI_10008024 | 952 | scaffold1277 | + | 21 | |
| CGI_10009988 | 499 | scaffold43366 | − | 13 | |
| CGI_10013117 | 796 | scaffold1252 | + | 19 | |
| CGI_10024885 | 832 | scaffold146 | + | 14 | |
| CGI_10019262 | 615 | scaffold506 | + | 10 | |
| CGI_10019292 | 484 | scaffold363 | + | 17 | |
| CGI_10003652 | 993 | scaffold1088 | − | 8 | |
| CGI_10012076 | 1087 | scaffold1492 | − | 8 | |
| CGI_10012077 | 1082 | scaffold1492 | − | 8 | |
| CGI_10010404 | 1166 | scaffold43446 | − | 16 | |
| CGI_10000974 | 543 | scaffold1496 | − | 6 | |
| CGI_10010300 | 567 | scaffold43426 | − | 2 | |
| CGI_10010302 | 977 | scaffold43426 | − | 2 | |
| CGI_10001931 | 910 | scaffold36398 | + | 17 | |
| CGI_10018647 | 845 | scaffold509 | + | 18 | |
| CGI_10000466 | 269 | C28760 | + | 5 | |
| CGI_10028439 | 1336 | scaffold102 | − | 21 | |
| CGI_10014121 | 370 | scaffold43932 | − | 8 | |
| CGI_10014126 | 1593 | scaffold43932 | + | 19 | |
| CGI_10011806 | 539 | scaffold43696 | + | 11 | |
| CGI_10026689 | 677 | scaffold53 | − | 10 | |
| CGI_10020838 | 5054 | scaffold1244 | − | 72 | |
| CGI_10006699 | 1033 | scaffold42366 | + | 25 | |
| CGI_10018029 | 387 | scaffold12 | + | 8 | |
| CGI_10006070 | 862 | scaffold1840 | + | 17 | |
| CGI_10024845 | 327 | scaffold492 | + | 6 | |
| CGI_10008929 | 441 | scaffold635 | + | 9 | |
| CGI_10007062 | 831 | scaffold1758 | − | 12 | |
| CGI_10007061 | 994 | scaffold1758 | + | 19 | |
| CGI_10027170 | 770 | scaffold1599 | − | 15 | |
| CGI_10028178 | 1359 | scaffold86 | − | 25 | |
| CGI_10012429 | 2389 | scaffold498 | + | 19 | |
| CGI_10008499 | 325 | scaffold43036 | + | 6 | |
| CGI_10011096 | 516 | scaffold340 | − | 14 | |
| CGI_10026549 | 479 | scaffold145 | + | 6 | |
| CGI_10004703 | 474 | scaffold1231 | + | 8 | |
| CGI_10003879 | 403 | scaffold40120 | + | 8 | |
| CGI_10006185 | 585 | scaffold1526 | − | 16 | |
| CGI_10006186 | 730 | scaffold1526 | + | 19 | |
| CGI_10016954 | 667 | scaffold117 | + | 15 | |
| CGI_10016955 | 621 | scaffold117 | + | 15 | |
| CGI_10012313 | 720 | scaffold477 | − | 18 | |
| CGI_10012310 | 720 | scaffold477 | − | 18 | |
| CGI_10016396 | 252 | scaffold594 | + | 3 | |
| CGI_10016395 | 466 | scaffold594 | + | 3 | |
| CGI_10010613 | 594 | scaffold43500 | + | 10 | |
| CGI_10027407 | 432 | scaffold1179 | − | 12 | |
| CGI_10022789 | 283 | scaffold443 | + | 7 | |
| CGI_10019511 | 347 | scaffold376 | + | 7 | |
| CGI_10021030 | 1493 | scaffold672 | − | 36 | |
| CGI_10021977 | 774 | scaffold1108 | − | 19 | |
| CGI_10010190 | 562 | scaffold930 | − | 10 | |
| CGI_10005580 | 517 | scaffold1708 | + | 10 | |
| CGI_10007244 | 271 | scaffold493 | + | 9 | |
| CGI_10004779 | 599 | scaffold1067 | − | 15 | |
| CGI_10018169 | 273 | scaffold459 | + | 7 | |
| CGI_10002412 | 290 | scaffold857 | + | 6 | |
| CGI_10001400 | 485 | scaffold34994 | − | 11 | |
| CGI_10013050 | 331 | scaffold43836 | − | 7 | |
| CGI_10025852 | 784 | scaffold1583 | + | 16 | |
| CGI_10010926 | 384 | scaffold1288 | + | 14 | |
| CGI_10020312 | 344 | scaffold522 | + | 9 | |
| CGI_10023891 | 368 | scaffold48 | − | 11 | |
| CGI_10019854 | 401 | scaffold1512 | + | 4 | |
| CGI_10009355 | 1247 | scaffold43208 | + | 10 | |
| CGI_10012125 | 434 | scaffold1890 | + | 13 | |
| CGI_10027818 | 482 | scaffold198 | + | 13 | |
| CGI_10017610 | 475 | scaffold1670 | + | 13 | |
| CGI_10021263 | 584 | scaffold157 | + | 13 | |
| CGI_10025910 | 645 | scaffold334 | − | 11 | |
| CGI_10019663 | 357 | scaffold1715 | + | 9 | |
| CGI_10020062 | 555 | scaffold258 | + | 1 | |
| CGI_10027350 | 709 | scaffold1179 | − | 14 | |
| CGI_10013344 | 530 | scaffold1894 | − | 11 | |
| CGI_10023484 | 516 | scaffold1258 | − | 13 | |
| CGI_10007923 | 1373 | scaffold42850 | + | 17 | |
| CGI_10028689 | 1383 | scaffold150 | + | 24 | |
| CGI_10015137 | 208 | scaffold1671 | + | 1 | |
| CGI_10025899 | 600 | scaffold733 | + | 2 |
Figure 2The sequence divergence between duplicate pairs. (A) The density distribution of synonymous rate (Ks) for all duplicate gene pairs. (B) The comparisons of Ka/Ks and Ks values, where Ks is a proxy of divergence time between duplicated genes.
Figure 3Expression patterns of duplicate gene pairs. (A) The heatmap was performed using Pearson’s correlation coefficient of gene expression under five conditions, and red to blue blocks indicate high-to-low correlation levels. (B) The boxplot represents the distribution of correlation coefficient values for each expression condition.
Figure 4The relationship between the correlation coefficient (R) of gene expression and Ka (or Ks) in duplicate genes. (A) No correlation between and Ka (or Ks) for tissue expression transcriptomes. (B–E) Negative correlations between and Ka (or Ks) under developmental stages, dry treatments, salt treatments, and heat treatments, respectively. These imply positive correlation between sequence divergence and expression divergence because 1−r can be regarded as expression divergence. Each point represents one gene pair.