Literature DB >> 25848752

Genetic conflict reflected in tissue-specific maps of genomic imprinting in human and mouse.

Tomas Babak1, Brian DeVeale2, Emily K Tsang3, Yiqi Zhou1, Xin Li3, Kevin S Smith3, Kim R Kukurba4, Rui Zhang5, Jin Billy Li5, Derek van der Kooy6, Stephen B Montgomery4, Hunter B Fraser1.   

Abstract

Genomic imprinting is an epigenetic process that restricts gene expression to either the maternally or paternally inherited allele. Many theories have been proposed to explain its evolutionary origin, but understanding has been limited by a paucity of data mapping the breadth and dynamics of imprinting within any organism. We generated an atlas of imprinting spanning 33 mouse and 45 human developmental stages and tissues. Nearly all imprinted genes were imprinted in early development and either retained their parent-of-origin expression in adults or lost it completely. Consistent with an evolutionary signature of parental conflict, imprinted genes were enriched for coexpressed pairs of maternally and paternally expressed genes, showed accelerated expression divergence between human and mouse, and were more highly expressed than their non-imprinted orthologs in other species. Our approach demonstrates a general framework for the discovery of imprinting in any species and sheds light on the causes and consequences of genomic imprinting in mammals.

Entities:  

Mesh:

Year:  2015        PMID: 25848752      PMCID: PMC4414907          DOI: 10.1038/ng.3274

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


Despite over 20 years of study[2-5], evolutionary explanations for genomic imprinting remain controversial. The conflict/kinship theory posits that imprinting evolved due to different selection pressures on maternally and paternally-derived alleles[3,5,6]. For example, in species where litters of multiple paternities are common, increased expression of genes that promote fetal growth at the expense of the mother and littermates can be advantageous for paternally inherited alleles. In contrast, the inclusive fitness of maternally inherited alleles is maximized by more controlled nutrient exchange to enable equal allocation to all littermates. Other prominent theories include co-adaptation of mutually favorable traits in parent and offspring[6,7] and others[4], and it is not clear whether imprinting can be entirely explained by one model. In order to systematically identify imprinted genes and measure the breadth of tissues and developmental stages in which they are imprinted, we constructed an atlas of genomic imprinting in mouse (Fig. 1; Supplementary File 1; Supplementary Fig. 1, Supplementary Fig. 2, Supplementary Fig. 3). We detected imprinting as allele-specific expression (ASE) consistently biased towards either the maternal or paternal allele in reciprocally crossed F1 hybrids of diverged inbred mouse strains[8-10] (C57Bl6/J and CAST/EiJ), using methods that reliably discriminate imprinting from technical and biological variation[11] (see Methods). We sequenced mRNA from 26 unique tissues/developmental stages (61 biological samples), and combined our data with seven additional published tissues[9,12-15] (). Among 207 imprinting measurements that have been previously reported[16] in the gene-tissue pairs assayed here, 95.6% agree (); five of the nine cases that disagreed were in tissues sampled at different developmental time-points, and the remaining four include some equivocal evidence (e.g. claims of imprinting without confirmation from a reciprocal cross[17]). We confirmed the reported non-canonical maternal expression of Igf2 and paternal expression of Grb10 in adult brain[12], and we found this reciprocal pattern in all central nervous system (CNS) tissues (). We observed both paternal and maternal expression for Copg2 and Rtl1 as well (). Overall, our atlas increased the number of reported gene-tissue imprinting measurements[16,18] by nearly an order of magnitude ().
Figure 1

Atlas of genomic imprinting in mouse. All known and novel validated imprinted genes with at least one expressed SNP in both crosses are shown. (a), Tissue type. (b), Proportion of genes imprinted, detected using the same number of allele-specific sequencing reads in all samples. (c), Atlas generated using all sequencing reads. Genes are colored by their Imprinting Score (blue/pink) when allelic counts supported a parent-of-origin bias, and on their levels of gene expression in yellow (asinh(FPKM)) when parent-of-origin bias was absent. The y-axis was clustered treating parent-of-origin expression (blue/pink) equivalently by setting imprinting scores to positive values and non-imprinted expression (yellow) to negative values, thereby grouping similarly expressed and imprinted genes together. The x-axis was sorted on the proportion of genes imprinted. Maternal expression in e9.5 placenta is not shown since we could not reliably exclude signal stemming from contaminating maternal tissue (). Genomic clusters of at least two genes (within 1 MB) were each assigned a unique color, shown on the right when these genes also cluster by imprinting pattern. ASE data used to generate the plot are available in . Previously published samples: Pre-optic area[12], e15 Brain[12], Prefrontal cortex[12], e9.5 Embryo[9], Trophoblast stem cells[14], e17.5 Placenta[15], Embryonic fibroblasts[13], differentially methylated regions (DMRs)[33], uniparental disomy phenotypes (Mousebook/Harwell Phenotype Maps) (e.g. Mat-UPD indicates that gene is within a region that affects a phenotype when both copies of the region are maternal). *Promising novel imprinted gene candidates only imprinted in one tissue ().

We identified 74 candidates for novel imprinted genes that we tested by pyrosequencing, finding evidence of imprinted expression for 12 (Supplementary Table 2; Supplementary Table 3; Supplementary Fig. 5). We designated 7/12 as “high confidence”, on account of evidence from imprinting in multiple tissues and/or biological replicates, high allelic bias, and concordance at multiple SNPs. 7/12 were also located in regions previously shown to lead to parent-of-origin phenotypes (2.34 expected by chance; p=0.006; ). We also found biallelic expression of eight genes previously reported as imprinted (Htr2a, Pde4d, Tbc1d12, Gatm, Dlx5, Gabrb3, Nap1l4, and Pon2), which together with other conflicting evidence[19] indicates that these are likely not imprinted (). lists all high-confidence imprinted genes. One of the most striking features of the mouse atlas was the robust conservation of imprinting across tissues; the majority of imprinted genes were imprinted in nearly all tissues where they were expressed (Fig. 1, Supplementary Fig. 2, Supplementary Fig. 3; Supplementary File 1). Embryonic, extra-embryonic, and CNS tissues had the highest proportion of genes imprinted (), consistent with the known role of imprinting in development and social cognition[2]. Among genes that are imprinted in some tissues and biallelic in others, 52/55 were imprinted in embryos, but not in adults (the remaining three genes [Phf17, Gab1, Slc22a3] are imprinted in placenta and yolk-sac). These observations support a model where imprinted expression manifests during embryogenesis and then either persists through adulthood, or is lost during development. Genes with the most similar imprinting patterns (“co-imprinting”) were often clustered in the genome (), as expected due to shared cis-regulatory elements[2]. We found a number of significant functional enrichments within clusters of co-imprinted genes, including growth (e.g. decreased fetal weight), nutrient processing (e.g. glucose transport and uptake), and CNS development and signaling (e.g. nerve growth factor signaling) (see Methods and ). We also found a strong enrichment for neuropeptide hormone activity mediated by oxytocin/vasopressin signaling (), consistent with a recent link to regulation of feeding behavior[20] and widespread imprinting in the hypothalamus (). To enable comparisons of imprinting patterns between species we also generated an atlas of human imprinting. The lack of engineered crosses in human necessitates a more complex approach to identify parental-specific expression. Two major causes of autosomal ASE are imprinting and genetic variants affecting expression through cis-regulation. ASE caused by genetic variants typically leads to a consistent expression bias from the same allele in heterozygous individuals (at most ~50% of individuals). In contrast, imprinted genes have ASE in all individuals, but without bias toward any particular allele (). With ASE data from many individuals, these differences could potentially allow identification of imprinted genes. We measured ASE in 1,687 RNA-Seq samples from 45 tissues in 178 individuals (GTEx.v3[21]; ). We previously showed that concordance in ASE between two samples measured at the same SNP underestimates error due to systematic biases[11]. Therefore, we calibrated our parameters on concordance of ASE between different SNPs within the same gene. We found excellent agreement between genotyped and imputed SNPs (Pearson r=0.94; ), demonstrating high accuracy of genotype imputation, phasing, and quantification of ASE. We found that, similar to mouse, imprinted genes are highly over-represented among monoallelically expressed genes (; 7/76 known imprinted genes among the top 12). Detection of imprinted genes was further improved by eliminating genes with consistent ASE directionality across individuals (likely due to cis-regulatory variants) () and incorporating allele-specific methylation[22] (ASM) data (). To assess the accuracy of our predictions, we analyzed RNA-Seq data from lymphocytes derived from 17 members of a three-generation family[23]. This pedigree allowed us to identify imprinted genes, since their direction of ASE depends on each allele's parent-of-origin (; see Methods), and to estimate a false discovery rate (FDR) for our GTEx scoring scheme (). This represents a general approach that could be applied to discover imprinted genes whenever multi-generation expression and genotype data are available. As in mouse, we identified most genes known to be imprinted in human (63/76; ; ). The majority of the genes that we missed did not meet our stringent criteria (e.g. four expressed heterozygous SNPs). Only a few novel human imprinted genes reached a level of significance comparable to well-established imprinted genes, supporting the expectation that the majority of genes imprinted in adult tissues have already been discovered. We identified 17 strong candidates () at a level corresponding to a 1% FDR, and 100% ASE validation by microfluidics-based multiplex PCR (Supplementary Fig. 10, Supplementary Fig. 11, Supplementary Fig. 12, Supplementary Table 7, Supplementary Note). NHP2L1, PMF1, PPIEL, and ZNF595 were previously predicted on the basis of strong ASM[22,24], but it was their patterns of ASE (i.e. RNA score) that pushed them to significance in our study, and distinguished them from many other genes with ASM but no evidence of ASE. Similar to mouse, when a gene was imprinted in adult tissues, it tended to have a consistent strong allelic bias in all tissues sampled. Nonetheless, distinct patterns of allelic expression bias were enriched for functions including development via hedgehog signaling, kidney development, skeletal system development, regulation of growth, and synaptosome localization (). Our novel imprinted genes regulate glucose import in response to insulin (PID1), glucagon signaling and feeding behavior (GNG7), growth (PMF1), and are associated with birth weight (DHFR) and type II diabetes (MYO1D) (). As in mouse, ubiquitously monoallelic genes (the first 19 genes in ) are highly enriched for oxytocin/vasopressin neuropeptide activity and genes governing eating behavior (). We identified several conserved properties of genomic imprinting between human and mouse. The dichotomy of imprinting between neural and non-neural adult tissues is shared (), suggesting a conserved role for imprinting in neural function. We also found that genes that are imprinted in both species have stronger allelic bias but similar imprinting breadth compared to species-specific imprints (). Interestingly, among the most strongly conserved imprinted genes, “response to growth factor stimulus” was the most highly enriched function (), consistent with the theory that imprinting evolved due to genetic conflict over nutrient allocation[3]. Among 41 genes imprinted in mouse but not human, we observed an excess of maternally expressed genes (61%). This is consistent with theoretical predictions that silencing of paternally derived alleles should be less evolutionarily stable, due to maternal alleles having greater control over the in utero environment[25]. If indeed paternal silencing is less stable, it should be less common overall—despite being enriched in species-specific imprints—which is indeed the case (average 35% maternal expression/paternal silencing across all mouse tissues). If imprinted genes are indeed often involved in genetic conflict, pairs of maternally and paternally expressed genes with opposing roles may co-evolve in evolutionary “arms races”, possibly leading to co-expression of antagonistic gene pairs[26]. Consistent with this hypothesis, we observed an excess of maternal/paternal pairs among the most strongly co-expressed imprinted genes in mouse (excluding genes in close genomic proximity; see Methods) (). Many of the strongest maternal/paternal co-expressed pairs also have reciprocal imprinting patterns () and opposing functions. For instance, Magel2 and Calcr are a co-expressed pair involved in neuropeptide hormonal signaling. Magel2 loss of function results in poor suckling and neonatal growth retardation[27] while Calcr affects appetite suppression through amylin regulation[28]. Igf2r and Phf17 have also been linked to regulation of growth: Igf2r suppresses growth[29], while Phf17 promotes vasculogenesis of the placenta[30], where it is preferentially imprinted (). The human orthologs of the maternal/paternal pairs were co-expressed as well (), suggesting conservation of these antagonistic interactions. An additional prediction of the conflict/arms race model is that the expression levels of imprinted genes may increase due to positive selection, in response to increases in their antagonistic counterparts. To investigate this, we computed the expression divergence (Euclidean distance[31]) between all mouse-human orthologs in 14 tissues profiled in both species. We found a higher rate of divergence among imprinted genes with the strongest allelic bias (Fig. 3d; Supplementary Fig. 18, Supplementary Fig. 19; see Methods). To test the possibility that strong imprinting itself causes variable expression, we searched for a similar pattern among human individuals, but actually found less expression variation among imprinted genes (). This pattern of high interspecies divergence, coupled with low intraspecies variation, is consistent with the idea that positive selection contributed to their divergence.
Figure 3

Species comparisons of imprinting. (a), Conservation of allelic bias between human and mouse. Average allelic bias is generally conserved between the two species. Tissue-specific imprinting was strongest between CNS and non-CNS samples and is conserved between species. (b), Genes that are imprinted in both species (B; n=47, n=54) have stronger allelic bias relative to species-specific (SS; n=47, n=71) imprinted genes (median in red, box delineates 25th/75th quartiles, whiskers show range of non-outliers) (c), Examples of top maternal/paternal pairings (also see ). (d), Strongly imprinted genes (high allelic bias in many tissues) have more divergent gene expression between human and mouse relative to all genes (Wilcoxon p=0.012; ; box plot metrics same as b). z-scores were computed against gene expression divergence of randomly selected genes matched for breadth of expression (see Methods). Expr = asinh(FPKM), median subtracted for each gene within species. (e), Significance of within-human expression variation comparisons for top 15, 20, and 25 most strongly imprinted genes (1,000 permutations); colors=strongest imprinting bins from d, grayscale=genes randomized to demonstrate null. (f) Example of a gene with elevated expression in species and tissues where it is imprinted. Ctx=cortex, Cer=cerebellum. (g) Sorted log-ratios of mean mouse/human (imprinted) expression vs. mean platypus/chicken (not imprinted) orthologs in gene-tissue combinations where imprinted in both human and mouse. Expression was elevated in mouse/human (p=0.0033; c.f. randomly selected non-imprinted genes in the same tissues).

To test whether this rapid expression divergence reflects up-regulation, as predicted by the conflict/arms race model, we compared imprinted gene expression levels in human/mouse with the levels of their orthologs in platypus and chicken, where they are not likely to be imprinted[32]. Of ten imprinted genes with 10-way 1:1 orthologs across amniotes, nine had higher expression in human/mouse vs. platypus/chicken ( GRB10 shown as an example; all data; p=0.0033, see Methods). This difference is in spite of only being expressed from one allele in human/mouse (and thus having 50% lower expected expression, all else being equal), consistent with up-regulation due to antagonistic co-evolution. In conclusion, our human and mouse imprinting atlases have revealed the patterns of imprinting – across development, tissues, and species – in unprecedented detail. Tissue-specific imprinting is surprisingly rare, with most genes either imprinted in all adult tissues where they are expressed, or in none. In addition, genetic conflict between imprinted loci can explain several key observations: co-expression of maternally/paternally expressed genes, rapid expression divergence, and an overall pattern of up-regulation associated with imprinting. We expect that these resources will be instrumental in refining our understanding of imprinting mechanisms at individual loci and that similar atlases in other species will improve our understanding of the origins of imprinting.

METHODS

Identifying and quantifying imprinting in mouse

Tissues were dissected from 16 reciprocally crossed male C57Bl/6J × CAST/EiJ F1 mice (8 mice in each direction of cross) aged 35-45d and two embryonic stages (). Animals were housed and euthanized in accordance with the current Animal Use Protocol approved by the Faculty of Medicine and Pharmacy Animal Care Committee at University of Toronto. Tissues were rinsed in PBS and snap-chilled in liquid nitrogen within 10 minutes of dissection. RNA was extracted with Trizol (Invitrogen) according to the manufacturer's recommendations and integrity confirmed by Bionalyzer (RIN>6). Sequencing libraries were prepared using TruSeq v2 RNA-Seq kits (RS-122-2001; Illumina) according to the manufacturer's recommendations. 14 libraries (7 tissues; see ) were treated with UNG nuclease to retain strand specificity in the sequenced libraries[34]. Libraries were indexed and sequenced on a HiSeq 2000 (PE90) to an average 7.2 Gb/sample at BGI (Hong Kong, China). Genomic imprinting was quantified using previously established criteria[11]. In brief, reads were aligned with Novoalign (Novocraft) against gene models (), all possible splice junctions representing up to two exon skipping events, and the mouse genome (mm9). Reads mapping to opposite strands were analyzed independently for libraries with strand-specificity. 19.6 million C57Bl/CAST SNPs, representing the union of two collections[35-36], were masked prior to alignment in order to minimize alignment biases (). Allele-specific expression was quantified for each gene by counting uniquely mapped read-pairs within the gene boundaries (incl. introns) that overlapped at least one SNP, such that allelic origin could be discerned. Probability of ASE was estimated from the cumulative binomial distribution with random expectation set to 50% (i.e. 50% expression from both alleles), consistent with observed global ASE distributions (). An Imprinting Score (IS), which was previously shown to reliably distinguish genuine imprinting events[11], was computed as the log10 of the less significant binomial p-value of the two reciprocally crossed tissues and paternal bias was arbitrarily set to be negative. Significance was established by comparing ISs to background ISs computed from biological replicates with the same parental background[11] (i.e. not from a reciprocal cross). We note that incomplete imprinting may be due to mixtures of imprinted and non-imprinted cell types within individual tissue samples. Known mouse imprinted genes were compiled from literature[11,18] and are available as a BED track along with novel genes discovered in this study and genes with conflicting evidence from this study and literature removed (). The seven samples sequenced in previous studies () were analyzed in parallel from the fastq files. Putative novel imprinted genes (p<1e-2, >50% ASE in both crosses) were validated by pyrosequencing as previously described[11]. In brief, imprinting was considered validated if the difference in allelic bias between the reciprocal samples was >5% (2 standard deviations of 182 biological replicate DNA measurements) and each ratio was reciprocally biased (i.e. in opposite directions) relative to the ratio obtained with DNA. Primer sequences and assay details are available in . Raw data are available in the Sequence Read Archive under accession SRP020526. Functional enrichments were identified using hypergeometric tests against the JAX Phenome database[37], mouse Gene Ontology[38], human Reactome[39], GeneGO[40], and Biobase[41] pathways mapped onto mouse genes via ENSEMBL orthologs[42]. All databases were downloaded between May 10 and Jun 20, 2013. FDR was established as the proportion of randomly sampled background sets (selected from genes powered to detect ASE) of the same size that exceeded significance of the gene set of interest.

Identifying and quantifying imprinting in human tissues

GTEx.v3[21] imputed genotype and aligned RNA-Seq (BAM) data were downloaded from dbGaP (phs000424.GTEx.v3.p1.c1) on August 27, 2013. 1,687 samples had both RNA-Seq and genotypes available, comprising 45 unique tissues/organs from 178 individuals (). No minimum number of individuals per tissue was required. Genotypes were phased using SHAPEIT.v2[43] with genomic maps (human b37) from the author's website downloaded on August 27, 2013 and all options set to default. Phased ASE read counts were compiled at all heterozygous sites using sra-pileup (v2.3.2 from SRA Toolkit) and custom scripts. Gene models were constructed from RefSeq transcripts, ENSEMBL transcripts, UCSC gene models, GenBank mRNAs, and computational gene predictions by collapsing overlapping transcripts on the same strand and iteratively adding new models; starting with collapsed RefSeq transcripts, then collapsed ENSEMBL transcripts completely outside RefSeq annotated boundaries, and similarly UCSC genes, mRNAs, NSCAN and GENSCAN gene predictions, respectively, in a total of six iterations. Known imprinted genes in human were compiled from literature[16,18], and are available in BED format (). Gene-level ASE was quantified by aggregating counts across all phased heterozygous sites within gene boundaries such that each gene is assigned two integers for each sample: a and b, representing expression from allele a and allele b, in sample i and gene g. Although assignment of a and b is arbitrary, phasing ensures that, when possible, assignment of alleles is consistent across individuals, thus enabling assessment of conserved allelic expression bias. Allelic Bias (AB) was quantified for each gene in each tissue as a/(a+b) and the RNA-Score (RS) computed as Ʃ|AB|-|ƩAB|, for i=1..n (n=number of subjects where that tissue was sequenced). More complicated combinations (e.g. adding regression-derived weights to the two main terms) did not improve performance as gauged by ROC/AUC analysis of known imprinted genes and assuming all negatives are true-negatives (e.g. ). An allele-specific methylation score (ASMS) was assigned to each gene by overlapping allele-specific methylation coordinates from 22 samples[22] with gene models above and tallying the number of samples supporting ASM within the gene boundaries (incl. UTRs and introns). Cultured and uncultured differentiated cells did not have different distributions of ASM among known imprinted genes () and extending gene models to capture potentially intergenic regulation did not improve performance (). Multiple linear regression was used to compute a combined score (CS) by determining ASMS and RS weights (mean ASMS weight=0.29, mean RS weight=0.11) that best distinguish known imprinted genes from all other genes within each tissue. Negative CS values were regarded as a lack of evidence for imprinting. Precision, typically defined as the proportion of all positive calls at some score threshold that are true, was more broadly defined since we observed that it was impacted by both the score and the number of tissues exceeding that score (i.e. an imprint was more likely to be real if supported by multiple tissues). Precision was estimated for each gene by choosing a CS threshold that optimized precision, given the number of tissues that exceed that threshold ().

Pedigree imprinting analysis

Genotype data (generated from whole-genome sequencing by Complete Genomics) and BAM files of mapped RNA-Seq data[23] (generated from lymphocytes) were downloaded for 17 individuals from a CEPH/Utah three-generation pedigree (NA1463; GSE56961). The two grandparent/parent trios and one parent/children pedigree were phased separately with SHAPEIT.v2[43] and heterozygous SNPs within gene boundaries were used to track inheritance of child alleles back to the grandparents [when possible; 67.9% and 68.5% of expressed genes in lymphocytes with at least one expressed heterozygote (i.e. ASE-powered; n=17,855) were traceable from each child back to the two sets of grandparents, and 94.1% of ASE-powered genes were traceable in at least one of 11 children for at least one allele]. ASE was quantified as described above under “Identifying and quantifying imprinting in human tissues”. Imprinting was considered validated when all of the following criteria were met: 1) at least one allele switched between preferential expression to preferential silencing consistent with its parent-of-origin (e.g. expressed in the father when inherited from his mother, but then silent in father's children), 2) parent-of-origin bias was consistent in all children and parents, 3) at least three children were powered for detection of ASE. A minimum of ten ASE-informative reads were required and bias was considered present when a binomial test (cumulative distribution function, as described previously[11]) yielded a significance of p<0.01 comparing the ASE read counts of the two alleles.

ASE validation in human samples

ASE validation was performed on 9 genes in 9 subjects across 42 unique tissues (average 14.2 tissues/subject; 745 total measurements). Tissue samples were contributed by the GTEx consortium[21] (see Acknowledgements) and processed as described previously[44] with some modifications (). One heterozygous SNP was identified for each gene and subject; preference was given to heterozygous SNPs identified by exome-sequencing, genotyped variants when exome SNPs were not detected, and imputed SNPs if no other evidence was available. Allelic ratios were quantified as described previously[44] (see for design details).

Human-Mouse comparisons

Mean allelic bias was quantified for each gene as the sum of allelic biases across all tissues where imprinting was detected (max(p)<0.01 from the two reciprocal crosses in mouse with consistent parent-of-origin bias and CS>0 in humans; see sections above for more details). Each tissue contributes a value between 0 (100% biallelic) and 1 (100% monoallelic).

Enrichment for maternal/paternal expressed genes among similar imprinting patterns

Gene expression was quantified as arcsinh (similar to the natural log but allowing for values of zero) of FPKM (Fragments Per Kilobase per Million uniquely aligned reads that overlap gene) and was quantile-normalized in each species separately. These data were used to compute distances between all unique pairwise combinations of genes. The distance similarity metric (dist.sim) used for all comparisons was standard Euclidean distance. In the comparisons of numbers of maternal/paternal interactions, an interaction was defined between two genes if they did not exceed the maximum dist.sim threshold. Interactions between genes less than 1 MB apart were not considered to avoid effects from shared cis-regulation (e.g. two genes affected by the same DMR). When multiple interactions existed between genes within a cluster (within 1 MB of each other) and one other gene, only the lowest dist.sim value was considered. Two interaction types were considered: mm/pp (maternal-maternal or paternal-paternal), and mp/pm (maternal-paternal or paternal-maternal). Significance of deviation from the null was tested with a binomial test (cumulative), where the null expectation was set to the ratio of all possible mm/pp pairs to mp/pm pairs (after removing interactions within the same genomic regions).

Divergence of gene expression among imprinted genes

Gene expression was quantified for all human/mouse orthologs[42] as described above in 15 pairs of matching tissues. Mean FPKM was used when multiple individuals were sequenced for the same tissue. Stomach was excluded since the mouse-human counterparts did not cluster next to each other, possibly due to the human dissection including muscular tissue (), leaving 14 pairs for analysis. Data were median-subtracted (for each gene: median across all tissues subtracted from its value) and quantile-normalized within each species separately, then merged and quantile-normalized again to minimize species-specific biases. Imprinting strength was measured as the sum of ASE across all tissues, such that each tissue can contribute a value of 0 (no ASE) to 1 (100% monoallelic). Divergence was measured using Euclidean distance between each set of orthologs and was quantified as a z-score (number of standard deviations from mean) relative to a background set matched for degree of expression. A gene was included in the background if it was expressed in the same number of tissues (non-zero expression post-median subtraction) with the highest FPKM value being within 10% of the highest FPKM of the imprinted gene; the background was further trimmed such that the same number of genes was above and below the imprinted maximal expression. There was no association between background scores and imprinting breadth. Imprinting breadth is the sum of allelic bias across all tissues. The sum of Euclidean distances between matched tissue pairs was also used to quantify divergence, where the inputs were groups of genes and matched background sets comprised randomly substituted genes from all orthologs as described above. Human-human comparisons () were run on mean expression across two random subsets of individuals for the top 15, 20, and 25 genes sorted on strength of ASE (see above). For comparison, random sets of genes were also analyzed in the same way.

Comparison of imprinted gene expression to non-imprinted orthologs

Normalized gene expression data from six tissues from 10 species[45] was averaged across biological replicates (disregarding sex). Among 41 genes imprinted in both human and mouse, 10 were represented among 10-way 1:1:..:1 orthologs. For each gene, an average log2 ratio of mean(human,mouse)/mean(chicken,platypus) RPKM was computed across all tissues where the gene was imprinted in both human and mouse (43 total gene-tissue pairs). To quantify significance, this was repeated 10,000 times on randomly selected non-imprinted genes using expression from the same tissues.
  43 in total

1.  Genomic landscape of human allele-specific DNA methylation.

Authors:  Fang Fang; Emily Hodges; Antoine Molaro; Matthew Dean; Gregory J Hannon; Andrew D Smith
Journal:  Proc Natl Acad Sci U S A       Date:  2012-04-20       Impact factor: 11.205

2.  Improved whole-chromosome phasing for disease and population genetic studies.

Authors:  Olivier Delaneau; Jean-Francois Zagury; Jonathan Marchini
Journal:  Nat Methods       Date:  2013-01       Impact factor: 28.547

Review 3.  Genomic imprinting in mammals.

Authors:  Denise P Barlow; Marisa S Bartolomei
Journal:  Cold Spring Harb Perspect Biol       Date:  2014-02-01       Impact factor: 10.005

4.  The Genotype-Tissue Expression (GTEx) project.

Authors: 
Journal:  Nat Genet       Date:  2013-06       Impact factor: 38.330

5.  Site-specific silencing of regulatory elements as a mechanism of X inactivation.

Authors:  J Mauro Calabrese; Wei Sun; Lingyun Song; Joshua W Mugford; Lucy Williams; Della Yee; Joshua Starmer; Piotr Mieczkowski; Gregory E Crawford; Terry Magnuson
Journal:  Cell       Date:  2012-11-21       Impact factor: 41.582

6.  Critical evaluation of imprinted gene expression by RNA-Seq: a new perspective.

Authors:  Brian DeVeale; Derek van der Kooy; Tomas Babak
Journal:  PLoS Genet       Date:  2012-03-29       Impact factor: 5.917

7.  The Reactome pathway knowledgebase.

Authors:  David Croft; Antonio Fabregat Mundo; Robin Haw; Marija Milacic; Joel Weiser; Guanming Wu; Michael Caudy; Phani Garapati; Marc Gillespie; Maulik R Kamdar; Bijay Jassal; Steven Jupe; Lisa Matthews; Bruce May; Stanislav Palatnik; Karen Rothfels; Veronica Shamovsky; Heeyeon Song; Mark Williams; Ewan Birney; Henning Hermjakob; Lincoln Stein; Peter D'Eustachio
Journal:  Nucleic Acids Res       Date:  2013-11-15       Impact factor: 16.971

8.  Mouse phenome database.

Authors:  Stephen C Grubb; Carol J Bult; Molly A Bogue
Journal:  Nucleic Acids Res       Date:  2013-11-15       Impact factor: 16.971

9.  Characterization of the imprinting signature of mouse embryo fibroblasts by RNA deep sequencing.

Authors:  Diana A Tran; Angela Y Bai; Purnima Singh; Xiwei Wu; Piroska E Szabó
Journal:  Nucleic Acids Res       Date:  2013-11-11       Impact factor: 16.971

10.  Quantifying RNA allelic ratios by microfluidic multiplex PCR and sequencing.

Authors:  Rui Zhang; Xin Li; Gokul Ramaswami; Kevin S Smith; Gustavo Turecki; Stephen B Montgomery; Jin Billy Li
Journal:  Nat Methods       Date:  2013-11-24       Impact factor: 28.547

View more
  77 in total

Review 1.  Epigenetic and Cellular Diversity in the Brain through Allele-Specific Effects.

Authors:  Wei-Chao Huang; Kathleen Bennett; Christopher Gregg
Journal:  Trends Neurosci       Date:  2018-08-08       Impact factor: 13.837

Review 2.  Cancer transcriptome profiling at the juncture of clinical translation.

Authors:  Marcin Cieślik; Arul M Chinnaiyan
Journal:  Nat Rev Genet       Date:  2017-12-27       Impact factor: 53.242

Review 3.  New developments in Silver-Russell syndrome and implications for clinical practice.

Authors:  Miho Ishida
Journal:  Epigenomics       Date:  2016-04-12       Impact factor: 4.778

Review 4.  Role of DNA methylation in imprinting disorders: an updated review.

Authors:  Amr Rafat Elhamamsy
Journal:  J Assist Reprod Genet       Date:  2017-03-09       Impact factor: 3.412

Review 5.  A general theory of sexual differentiation.

Authors:  Arthur P Arnold
Journal:  J Neurosci Res       Date:  2017-01-02       Impact factor: 4.164

6.  Maternal H3K27me3 controls DNA methylation-independent imprinting.

Authors:  Azusa Inoue; Lan Jiang; Falong Lu; Tsukasa Suzuki; Yi Zhang
Journal:  Nature       Date:  2017-07-19       Impact factor: 49.962

7.  Diverse Non-genetic, Allele-Specific Expression Effects Shape Genetic Architecture at the Cellular Level in the Mammalian Brain.

Authors:  Wei-Chao Huang; Elliott Ferris; Tong Cheng; Cornelia Stacher Hörndli; Kelly Gleason; Carol Tamminga; Janice D Wagner; Kenneth M Boucher; Jan L Christian; Christopher Gregg
Journal:  Neuron       Date:  2017-02-23       Impact factor: 17.173

8.  Phenotypic subregions within the split-hand/foot malformation 1 locus.

Authors:  Malene B Rasmussen; Sven Kreiborg; Per Jensen; Mads Bak; Yuan Mang; Marianne Lodahl; Esben Budtz-Jørgensen; Niels Tommerup; Lisbeth Tranebjærg; Nanna D Rendtorff
Journal:  Hum Genet       Date:  2016-02-02       Impact factor: 4.132

Review 9.  The importance of having two X chromosomes.

Authors:  Arthur P Arnold; Karen Reue; Mansoureh Eghbali; Eric Vilain; Xuqi Chen; Negar Ghahramani; Yuichiro Itoh; Jingyuan Li; Jenny C Link; Tuck Ngun; Shayna M Williams-Burris
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2016-02-01       Impact factor: 6.237

10.  Sex-specific epigenetic profile of inner cell mass of mice conceived in vivo or by IVF.

Authors:  Elena Ruggeri; Saúl Lira-Albarrán; Edward J Grow; Xiaowei Liu; Royce Harner; Emin Maltepe; Miguel Ramalho-Santos; Annemarie Donjacour; Paolo Rinaudo
Journal:  Mol Hum Reprod       Date:  2020-11-01       Impact factor: 4.025

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.