| Literature DB >> 32421818 |
María Touceda-Suárez1, Elizabeth M Kita1, Rafael D Acemel2, Panos N Firbas2, Marta S Magri2, Silvia Naranjo2, Juan J Tena2, Jose Luis Gómez-Skarmeta2, Ignacio Maeso2, Manuel Irimia1,3,4.
Abstract
We investigated how the two rounds of whole-genome duplication that occurred at the base of the vertebrate lineage have impacted ancient microsyntenic associations involving developmental regulators (known as genomic regulatory blocks, GRBs). We showed that the majority of GRBs identified in the last common ancestor of chordates have been maintained as a single copy in humans. We found evidence that dismantling of the duplicated GRB copies occurred early in vertebrate evolution often through the differential retention of the regulatory gene but loss of the bystander gene's exonic sequences. Despite the large evolutionary scale, the presence of duplicated highly conserved noncoding regions provided unambiguous proof for this scenario for multiple ancient GRBs. Remarkably, the dismantling of ancient GRB duplicates has contributed to the creation of large gene deserts associated with regulatory genes in vertebrates, providing a potentially widespread mechanism for the origin of these enigmatic genomic traits.Entities:
Keywords: gene deserts; genomic regulatory block; origin of vertebrates; whole-genome duplications
Mesh:
Year: 2020 PMID: 32421818 PMCID: PMC7530604 DOI: 10.1093/molbev/msaa123
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Fig. 1Fates of ancient GRBs after the two WGDs in vertebrates. (A) Genetic redundancy allows the dismantling of GRBs after WGD. Two scenarios are depicted: (i) The GRB microsyntenic association is disrupted by a break point, in which both the trans-dev and the bystander genes may be maintained, but in different genomic locations; This scenario would impact the regulation of the trans-dev gene. (ii) The GRB microsyntenic association is dismantled by the differential loss of either the trans-dev or the bystander gene. The loss can occur by a large deletion of the genomic locus or by pseudogenization and “exon erosion.” Although the former would impact the regulation of the trans-dev when the bystander is lost, the latter would not. (B) Summary of the fates of the 156 studied ancient GRB pairs present by the last common ancestor of chordates. Human conservation, whether the human genome has conserved at least one linked copy or not of the ancient GRB associations; #GRB pair copies, for those conserved, the number of copies of GRB pairs maintained in human (1–4); Multiple gene copies, for those GRB pairs in single copy, in how many cases, there are multiple ohnologs for both the trans-dev and bystander genes (i.e., not linked), only for the trans-dev or bystander gene, or for none. (C) Percent of trans-dev (T) genes of different types that have at least one intergenic region (N1-T and/or T-B [for ancient GRB pairs] or T-N2 [for other trans-dev genes]) within the first, second, third, or another decile of intergenic region lengths genome-wide (i.e., trans-dev genes in decile 1 have at least one intergenic region whose length is among the top 10% of all intergenic regions). All td, all trans-dev genes linked to at least one non-trans-dev gene (n = 745); GRB td, trans-dev genes that are part of conserved ancient GRB pairs (n = 103); Unlinked td, trans-dev ohnologs from ancient GRBs that are not linked to the ancient bystander gene (n = 171); Unlinked td (max/min), the unlinked trans-dev ohnolog with the largest/smallest intergenic region (n = 107); GRB td (T-N2), for these cases, the distance from the trans-dev gene in a conserved ancient GRB pair to the gene after the bystander (gene N2) is considered as the only intergenic distance (n = 103). Genes are only counted once in each category and cases for which the two downstream neighbors (N1 and N2) are not present are not considered. All lengths for each category are provided in supplementary table S3, Supplementary Material online. The use of N1 and N2 to label gene neighbors does not imply that these genes are the ohnologs of the neighboring genes flanking a trans-dev gene before WGDs. (D) For each type of trans-dev gene, number of ATAC-seq peaks found in the intergenic region with the highest number of peaks, except for “GRB td (T-N2),” where only the number of peaks between the trans-dev and the gene N2 is considered (as in C; n = 655, 82, 131, 78, 78, and 82, respectively).
Fig. 2The evolution of the Islet-Scaper GRB pair exemplifies the contribution of bystander erosion to the origin of gene deserts. (A) Schematic representation of Isl1 and Isl2 gene regulatory landscapes in the mouse genome (regions contained between the TAD borders identified by Dixon et al. [2012]). Scaper, the bystander associated with Isl2, is depicted, as well as the pair of ohnologous HCNRs identified by McEwen et al. (2006). (B) 4C-seq signal using Isl promoters as viewpoints (orange) and H3K4me3 and ATAC-seq signal from 24 hpf zebrafish or 15 hpf amphioxus embryos, and conservation tracks from the UCSC Genome Browser (PhastCons and PhyloP, for zebrafish only); ohnologous HCNRs are highlighted in green. Red arrowheads indicate significant interactions with either Islet promoter. Right: In situ hybridization of isl1 and isl2a from Thisse et al. (2004) and GFP expression driven by the HCNRs associated with isl1 and isl2a at different timepoints using the ZED vector. During the first few days of development, the isl1 enhancer showed dynamic expression. At 24 hpf, the reporter shows expression in the anterior telencephalon, and a subset of retina and ventral hindbrain cells. At 48 and 72 h, it maintains expression in a sparse subset of neural cells that may overlap with small, localized regions of isl1 expression. In the case of isl2a, the enhancer found within scaper drove consistent expression in a diffuse anterior domain at 24 hpf, as well as in the pineal gland and ventral hindbrain. The expression becomes more restricted at 48 and 72 hpf; to the pineal gland, subsets of the retinal and otic cells, and faintly in the diencephalon. Scale bar: 100 μm. (C) Model of presumptive gene desert formation by GRB dismantling through bystander exon erosion.