| Literature DB >> 29563166 |
David Thybert1,2, Maša Roller1, Fábio C P Navarro3, Ian Fiddes4, Ian Streeter1, Christine Feig5, David Martin-Galvez1, Mikhail Kolmogorov6, Václav Janoušek7, Wasiu Akanni1, Bronwen Aken1, Sarah Aldridge5,8, Varshith Chakrapani1, William Chow8, Laura Clarke1, Carla Cummins1, Anthony Doran8, Matthew Dunn8, Leo Goodstadt9, Kerstin Howe3, Matthew Howell1, Ambre-Aurore Josselin1, Robert C Karn10, Christina M Laukaitis10, Lilue Jingtao8, Fergal Martin1, Matthieu Muffato1, Stefanie Nachtweide11, Michael A Quail8, Cristina Sisu3, Mario Stanke11, Klara Stefflova5, Cock Van Oosterhout12, Frederic Veyrunes13, Ben Ward2, Fengtang Yang8, Golbahar Yazdanifar10, Amonida Zadissa1, David J Adams8, Alvis Brazma1, Mark Gerstein3, Benedict Paten4, Son Pham14, Thomas M Keane1,8, Duncan T Odom5,8, Paul Flicek1,8.
Abstract
Understanding the mechanisms driving lineage-specific evolution in both primates and rodents has been hindered by the lack of sister clades with a similar phylogenetic structure having high-quality genome assemblies. Here, we have created chromosome-level assemblies of the Mus caroli and Mus pahari genomes. Together with the Mus musculus and Rattus norvegicus genomes, this set of rodent genomes is similar in divergence times to the Hominidae (human-chimpanzee-gorilla-orangutan). By comparing the evolutionary dynamics between the Muridae and Hominidae, we identified punctate events of chromosome reshuffling that shaped the ancestral karyotype of Mus musculus and Mus caroli between 3 and 6 million yr ago, but that are absent in the Hominidae. Hominidae show between four- and sevenfold lower rates of nucleotide change and feature turnover in both neutral and functional sequences, suggesting an underlying coherence to the Muridae acceleration. Our system of matched, high-quality genome assemblies revealed how specific classes of repeats can play lineage-specific roles in related species. Recent LINE activity has remodeled protein-coding loci to a greater extent across the Muridae than the Hominidae, with functional consequences at the species level such as reproductive isolation. Furthermore, we charted a Muridae-specific retrotransposon expansion at unprecedented resolution, revealing how a single nucleotide mutation transformed a specific SINE element into an active CTCF binding site carrier specifically in Mus caroli, which resulted in thousands of novel, species-specific CTCF binding sites. Our results show that the comparison of matched phylogenetic sets of genomes will be an increasingly powerful strategy for understanding mammalian biology.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29563166 PMCID: PMC5880236 DOI: 10.1101/gr.234096.117
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Muridae genomes undergo large chromosomal rearrangements in punctuate bursts, resulting in greater structural diversity than primates. (A) Phylogenetic tree showing that the divergence time of the four Muridae species mirrors that of the four Hominidae species. The Mus species in blue were sequenced and assembled for this study. The 95% confidence interval of the divergence time estimation is shown by the shaded boxes (Supplemental Methods SM1.16). (B) Dot plots of whole-genome pairwise comparison between Mus musculus and the three other Muridae (top), and between human and the three other Hominidae (bottom). The chromosomes of Mus musculus and human were ordered by chromosome number. The chromosomes of the other species were ordered to optimize the contiguity across the diagonal. Red dots represent large (>3 Mb) inter-chromosomal rearrangements (fusion/fission and translocation). (C) Matrix of neighbor-joining tree of synteny breaks involving inter-chromosomal rearrangement for Muridae and Hominidae: (MMU) Mus musculus; (CAR) Mus caroli; (PAH) Mus pahari; (RAT) rat; (HUM) Human; (CHI) chimpanzee; (GOR) gorilla; (ORA) orangutan. (D) The rate of synteny breaks between sequential internal branch points of the Muridae and Hominidae clades. Muridae have undergone a punctuate increase in the rate of syntenic breaks between 3 and 6 MYA.
Figure 2.Acceleration of mutational rates in the Muridae lineage. (A) The evolutionary rate of nucleotide variation calculated for specific genomic regions. The error bar represents the standard error within the 95% confidence interval. (B) The rate of segmental turnover calculated for specific genomic regions. The error bar represents the standard error within the 95% confidence interval (Supplemental Methods SM3.2). (C) The bar chart shows the ratios of evolutionary rates between Muridae and Hominidae. Mouse versus human ratios were calculated for rates of nucleotide divergence (black bars) and the turnover rates (gray bars) for specific genomic regions (Supplemental Methods SM3.2).
Figure 3.Recent LINE activity can remodel protein-coding gene loci. (A) Violin plots showing the distribution of repeat elements that have the indicated divergence from the ancestral element sequence: (blue) SINE; (purple) LINE; (orange) LTR; (green) DNA. The age of the transposable elements was estimated using the nucleotide divergence from ancestral SINE, LINE, LTR, and DNA elements (Supplemental Methods SM4.1). The dashed lines indicate the estimated peaks of the most recent expansions in Mus musculus and human. (B) Violin plots showing the distribution of retrocopies (red) that have the indicated divergence from their parental genes for each Muridae (left) and Hominidae (right) species. The age of the retrocopies was estimated by the nucleotide divergence from ancestral retrocopies and the corresponding parental genes (Supplemental Methods SM4.3). The dashed line indicates the peak of the most recent expansion in Mus musculus. (C) Representation of the density of LINE elements in the Abp gene cluster for Mus musculus, Mus caroli, Mus pahari, the rat, and the thirteen-lined ground squirrel. The blue and red triangles represent the Abp genes: (blue) Abpa (Scgb1b); (red) Abpbg (Scgb2b). The black triangles represent the closest flanking genes (upstream [Scn1b] and downstream [Gpi1]) shared by the four Muridae species and the squirrel.
Figure 4.A single nucleotide mutation in a Mus caroli–specific expanding SINE B2 element contributed to the creation of thousands of novel CTCF binding events. (A) CTCF occupancy in the genome is shown by green tracks. The black squares show the location of SINE B2 retrotransposons. The yellow boxes represent two examples of a SINE B2 occupied by CTCF. (B) Fraction of transposable elements with CTCF binding in both Muridae (left) and Hominidae (right): (M) Mus musculus; (C) Mus caroli; (P) Mus pahari; (R) rat; (H) human; (Ch) chimpanzee; (G) gorilla; (O) orangutan. (C) Identity plots of SINE B2 with their consensus sequence, either occupied by CTCF (red) or not (brown) (Supplemental Methods SM4.1). The black arrow indicates a recent wave of SINE B2 expansion carrying CTCF binding sites in Mus caroli. (D) Neighbor-joining tree of SINE B2_Mm1 sequences from the three Mus species. The blue branches represent sequences from Mus caroli. The green branches represent sequences from Mus musculus or Mus pahari. The black lines in the outside tracks indicates the presence of a CTCF binding event. (E) A single nucleotide variation exists between the ancestral CTCF binding motif carried by the SINE B2_Mm1 element (middle) and a CTCF binding motif (top) carried by the elements recently expanded in Mus caroli. This branch-specific motif is enriched in CTCF occupancy.