| Literature DB >> 31604932 |
Peng Xu1,2,3,4, Jian Xu5, Guangjian Liu6, Lin Chen7, Zhixiong Zhou7, Wenzhu Peng7, Yanliang Jiang5, Zixia Zhao5, Zhiying Jia8, Yonghua Sun9, Yidi Wu7, Baohua Chen7, Fei Pu7, Jianxin Feng10, Jing Luo11, Jing Chai11, Hanyuan Zhang5, Hui Wang7,12, Chuanju Dong12, Wenkai Jiang6, Xiaowen Sun8.
Abstract
Common carp (Cyprinus carpio) is an allotetraploid species derived from recent whole genome duplication and provides a model to study polyploid genome evolution in vertebrates. Here, we generate three chromosome-level reference genomes of C. carpio and compare to related diploid Cyprinid genomes. We identify a Barbinae lineage as potential diploid progenitor of C. carpio and then divide the allotetraploid genome into two subgenomes marked by a distinct genome similarity to the diploid progenitor. We estimate that the two diploid progenitors diverged around 23 Mya and merged around 12.4 Mya based on the divergence rates of homoeologous genes and transposable elements in two subgenomes. No extensive gene losses are observed in either subgenome. Instead, we find gene expression bias across surveyed tissues such that subgenome B is more dominant in homoeologous expression. CG methylation in promoter regions may play an important role in altering gene expression in allotetraploid C. carpio.Entities:
Mesh:
Year: 2019 PMID: 31604932 PMCID: PMC6789147 DOI: 10.1038/s41467-019-12644-1
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Phylogenetic relationship of tetraploid Cypinidae and relevant teleost lineages. The phylogenetic topologies and divergence ages are taken from the TimeTree database (ref. [11]) and ref. [12]. Green, yellow and red circles represent the teleost-specific whole genome duplication (Ts3R), salmonid-specific whole genome duplication (Ss4R) and carp-specific whole genome duplication (Cs4R), respectively. The fish images are either collected and created by us (including Cyprinus carpio, Ctenopharyngodon idella, Oryzias latipes, Danio rerio, Cirrhinus, Garra, Sinilabeo, Onychostoma, Acrossocheilus, Poropuntius, Puntius, Hampala and Sinocyclocheilus) or obtained from public domains (including Lepisosteus oculatus, Gasterosteus aculeatus, Salmo salar, Oncorhynchus mykiss and Carassius auratus)
Fig. 2Allotetraploid origin and evolution history of C. carpio. a Phylogenetic relationship of rag2 orthologues of C. carpio and its tetraploid and diploid close relatives in the subfamily Cyprininae. The pentagrams indicate three selected diploid species (Poropuntius huangchuchieni, Hampala macrolepidota, and Onychostoma barbatulum) as progenitor-like diploid candidates, and one species from a relatively distant lineage (Cirrhinus molitorella) for genome sequencing to represent the closely related diploid lineages from Cyprininae. b A histogram shows the coverage of P. huangchuchieni genome sequence mapping to 50 chromosomes of C. carpio. Chromosome IDs have been re-assigned to represent two sets of homoeologous chromosomes. c Boxplots show the genome coverage and similarity comparisons of diploid relatives to the tetraploid genome of C. carpio. d The distribution of the synonymous substitution rates (Ks) of homologous genes between D. rerio and C. carpio, C. carpio and Sinocyclocheilus, and homoeologous genes between two subgenomes of C. carpio. Three peaks (Ks = 0.42, 0.16 and 0.068) of Ks distribution indicate the divergences of D. rerio and C. carpio, C. carpio and Sinocyclocheilus, and two progenitors of C. carpio. e The distribution of sequence divergence rates of transposable elements (TEs) as percentages of subgenome sizes of C. carpio. The TE content segregation between subgenomes A and B indicates the events of diploid progenitor divergence and subgenome merger
Fig. 3Asymmetrical homoeologous gene expression in C. carpio. a Circos plot distribution of homoeologous gene pairs in 25 chromosome pairs across subgenomes A and B. b Boxplot of the Ka/Ks ratio distribution of protein-coding genes in 50 chromosomes of C. carpio. Orange and blue boxplots indicate that the chromosomes belong to subgenomes A and B, respectively. c Histograms of the genome-wide expression of homoeologous genes among the indicated tissues of C. carpio. Log2(FPKM_A/FPKM_B) indicates the degree of expression difference of homoeologous gene pairs. N values indicate the number of dominant genes in subgenomes A and B. d Heatmaps of three divergently expressed triplet clusters (each triplet includes two homoeologous genes of C. carpio and their orthologue of C. idella), indicate potential subfunctionalization and neofunctionalization in the allotetraploid genome of C. carpio. The orders of 12 tissues in both C. idella and C. carpio are the same as 3c. e Histograms of the genome-wide expression divergence of homoeologous genes of C. carpio in stress treatments and controls. N values indicate the number of dominant genes in subgenomes A and B, respectively. f Histogram of the ratio of homoeologous expression divergence in the stress treatment and controls [(A/B)treatment/(A/B)control], which indicates accelerated homoeologous expression divergence in stress treatments compared to controls
Fig. 4DNA methylation in homoeologous expression bias in C. carpio. a Distribution of synonymous substitution values (Ks) (blue) and gene-body DmCG percentages (red) of 2393 methylated homoeologous genes. Peak values are indicated by arrows. b CG methylation levels of all annotated genes and 8291 homoeologous gene pairs in two subgenomes. c Heatmaps of two extremely divergent co-expression clusters, of which one of two homoeologous genes in subgenomes A or B was extensively transcribed while the other copies suppressed in 12 tissues. d CG methylation levels of divergent expressed homoeologous genes corresponding to clusters I and II in c. e Boxplot of the Ka/Ks ratio distribution of homoeologous genes of two extremely divergent expression clusters in two subgenomes as shown in c. Yellow and blue boxplots indicate the genes in subgenomes A or B, respectively. f CG methylation level and expression level of homoeologous genes eif3I of C. carpio, which demonstrated that reduced CG methylation levels in the promotor region correlated with increased expression levels of eif3Ia, while eif3Ib was heavily methylated and silenced. FPKM reads per kilobases per million. Data represent mean ± sem for n = 9 expressed tissues