| Literature DB >> 30760546 |
Marta Farré1, Jaebum Kim2, Anastasia A Proskuryakova3,4, Yang Zhang5, Anastasia I Kulemzina3, Qiye Li6, Yang Zhou6, Yingqi Xiong6, Jennifer L Johnson7, Polina L Perelman3,4, Warren E Johnson8,9, Wesley C Warren10, Anna V Kukekova7, Guojie Zhang6,11,12, Stephen J O'Brien13,14, Oliver A Ryder15, Alexander S Graphodatsky3,4, Jian Ma5, Harris A Lewin16, Denis M Larkin1,17.
Abstract
The role of chromosome rearrangements in driving evolution has been a long-standing question of evolutionary biology. Here we focused on ruminants as a model to assess how rearrangements may have contributed to the evolution of gene regulation. Using reconstructed ancestral karyotypes of Cetartiodactyls, Ruminants, Pecorans, and Bovids, we traced patterns of gross chromosome changes. We found that the lineage leading to the ruminant ancestor after the split from other cetartiodactyls was characterized by mostly intrachromosomal changes, whereas the lineage leading to the pecoran ancestor (including all livestock ruminants) included multiple interchromosomal changes. We observed that the liver cell putative enhancers in the ruminant evolutionary breakpoint regions are highly enriched for DNA sequences under selective constraint acting on lineage-specific transposable elements (TEs) and a set of 25 specific transcription factor (TF) binding motifs associated with recently active TEs. Coupled with gene expression data, we found that genes near ruminant breakpoint regions exhibit more divergent expression profiles among species, particularly in cattle, which is consistent with the phylogenetic origin of these breakpoint regions. This divergence was significantly greater in genes with enhancers that contain at least one of the 25 specific TF binding motifs and located near bovidae-to-cattle lineage breakpoint regions. Taken together, by combining ancestral karyotype reconstructions with analysis of cis regulatory element and gene expression evolution, our work demonstrated that lineage-specific regulatory elements colocalized with gross chromosome rearrangements may have provided valuable functional modifications that helped to shape ruminant evolution.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30760546 PMCID: PMC6442394 DOI: 10.1101/gr.239863.118
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Phylogenetic tree of the species and reconstructed ancestors. Numbers on branches from the cetartiodactyl ancestor (CET) to the cattle lineage are the evolutionary breakpoint rates (the number of evolutionary breakpoints per 1 My), and labels in italics are significantly different from the mean rearrangement rate across all branches. The dotted line leading to chevrotain represents the split of Pecora from the other ruminants, whereas the number crossing the line is the combined ruminant/pecoran evolutionary breakpoint rate. Arrowheads indicate gene family expansions (purple) and contractions (blue) in each branch. Details of the new genomes used can be found in Supplemental Table S1. Additional details of the reconstructed phylogenetic trees, rearrangements rates, and gene family expansions and contractions are shown in Supplemental Figure S1 and Supplemental Tables S5 and S6. (CET) Cetartiodactyl ancestral node; (RUM) ruminant ancestral node; (PEC) pecoran ancestral node; (BOV) bovid ancestral node.
Figure 2.Ideograms of the reconstructed ancestors relative to cattle chromosomes: (A) cetartiodactyl, (B) ruminant, (C) pecoran, (D) bovid. The vertical lines inside each chromosome reconstruction demarcate individual HSBs, whereas the diagonal lines indicate their orientation compared to the cattle genome. Arrowheads indicate evolutionary breakpoint regions (EBRs) associated with ruminant or cetartiodactyl enhancers (black), including the 25 TF motif enhancers (orange) and those not associated to any enhancers (white). Comparison of EBR positions with positions of enhancers is described in the “Functional constraint of enhancers” section.
Statistics of the reconstructed ancestral karyotypes
Chromosome rearrangements in cetartiodactyl, ruminant, pecoran, and bovidae ancestral karyotypes in the lineage leading to cattle inferred from combined DESCHRAMBLER and FISH data
Figure 3.Association of different types of EBRs with conserved noncoding elements (CNEs) and functional enhancers. (A) Fold enrichment of the CNEs inside EBRs and within 50 and 100 kbp of the different types of EBRs. (B) Fold enrichment of the functional enhancers. Asterisks mark statistically significant enrichments (FDR < 0.05). Dotted lines demarcate a fold enrichment of one. Additional data can be found in Supplemental Figure S3.
Figure 4.Association of transcription factor binding sites (TFBSs) with the different types of EBRs and their branches of origin. (A) Frequency of motifs in enhancers near each type of EBR according to their branch of origin. The frequency has been normalized by branch length of each classification. The dotted line corresponds to the total frequency of each branch of origin. (B) TFs with a different frequency of motifs in each lineage-specific EBR type. It shows the frequency of each motif in enhancers found in or ±50 kbp of EBRs. The TF motifs are colored according to their TF family: Blue TFs are part of the More than 3 adjacent zinc fingers; green TFs belong to the Three-zinc finger Krüppel-related factors; mauve TFs are in the AP-2 family; whereas gray are part of other TF families. The pink, purple, and orange lines in both A and B correspond to bovid-to-cattle lineage, ruminant- and cetartiodactyl-specific EBRs, respectively. Color-coded asterisks, according to the type of EBR, show significantly different frequencies (goodness-of-fit P-value <0.05). Additional data can be found in Supplemental Tables S13 and S14.
Figure 5.Gene expression correlation comparisons of genes in EBRs and msHSBs. (A) Pairwise correlation coefficients plotted against evolutionary distance for pairs of species with genes ±50 kbp of EBRs (olive green) and genes in msHSBs with the same distribution of mean expression levels across species (red), showing that genes in/near EBRs have more evolutionarily diverged expression patterns than genes in msHSBs. (B) Correlation coefficients of genes near EBRs with 25 TF motif enhancers (orange) compared to genes near EBRs without 25 TF motif enhancers (gray), suggesting that the 25 TF motif enhancers might contribute to the differences. (C) Correlation coefficients of genes in msHSBs with 25 TF motif enhancers (orange) compared to genes in msHSBs without 25 TF motif enhancers (gray). Genes near EBRs with 25 TF motif enhancers (D) or without 25 TF motif enhancers (E) in their regulatory regions were compared to matching genes in msHSBs, showing that the 25 TF motif enhancers in EBRs have a stronger effect on gene expression than the same type of enhancers in msHSBs. This effect was not observed for other types of enhancers. (F) Comparison between ruminant- (purple) and bovid-to-cattle lineage (pink) EBRs for expression of genes with 25 TF motif enhancers, suggesting that a higher number of motifs for the 25 TFs correlated with a more diverged gene expression. Lines correspond to linear regression trends with 95% confidence intervals in gray shading. P-values were obtained using Wilcoxon rank-sum test. Shading of the 25 TF motif enhancers represents the mean number of TF motifs in enhancers in each genomic region, ranging from a mean of 14.25 (pale orange) to 24.91 (dark orange) motifs in 25 TF motif enhancers. Additional data can be found in Supplemental Figure S6.
Figure 6.A model for the evolution of chromosome rearrangements with gene expression divergence by means of lineage-specific transposable elements (TEs). Chromosome rearrangement boundaries (EBRs) are enriched for lineage-specific TEs. These TEs harbor a higher number of TFBSs than ancestral TEs; therefore, they have a higher affinity for TFs and a stronger influence in gene expression and regulation than those found elsewhere in the genome. This leads to a higher differential expression for orthologous genes between species with and without the gross genomic rearrangement. Brown and green boxes represent ancestral or lineage-specific TEs, respectively. Purple bars represent TFBSs, and black boxes represent genes. Orange bell-shaped curves represent peaks of H3K27ac as functional enhancers, with the height of the bell proportional to the strength of the enhancer.