| Literature DB >> 28515049 |
Yasuaki Takada1, Ryutaro Miyagi2, Aya Takahashi2,3, Toshinori Endo1, Naoki Osada4.
Abstract
Joint quantification of genetic and epigenetic effects on gene expression is important for understanding the establishment of complex gene regulation systems in living organisms. In particular, genomic imprinting and maternal effects play important roles in the developmental process of mammals and flowering plants. However, the influence of these effects on gene expression are difficult to quantify because they act simultaneously with cis-regulatory mutations. Here we propose a simple method to decompose cis-regulatory (i.e., allelic genotype), genomic imprinting [i.e., parent-of-origin (PO)], and maternal [i.e., maternal genotype (MG)] effects on allele-specific gene expression using RNA-seq data obtained from reciprocal crosses. We evaluated the efficiency of method using a simulated dataset and applied the method to whole-body Drosophila and mouse trophoblast stem cell (TSC) and liver RNA-seq data. Consistent with previous studies, we found little evidence of PO and MG effects in adult Drosophila samples. In contrast, we identified dozens and hundreds of mouse genes with significant PO and MG effects, respectively. Interestingly, a similar number of genes with significant PO effect were detect in mouse TSCs and livers, whereas more genes with significant MG effect were observed in livers. Further application of this method will clarify how these three effects influence gene expression levels in different tissues and developmental stages, and provide novel insight into the evolution of gene expression regulation.Entities:
Keywords: cis-regulatory mutation; gene expression; genomic imprinting; maternal effect
Mesh:
Year: 2017 PMID: 28515049 PMCID: PMC5499130 DOI: 10.1534/g3.117.042895
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Schematic representation of the generalized linear model (GLM) design. A hypothetical reciprocal cross between fly strains A and B is assumed. Black and white chromosomes represent the A and B genotype, respectively, and binary code specifies the allelic genotype (AG) effect (A: 0, B: 1). The parent-of-origin (PO) effect is set to 0 when the chromosome is inherited from the mother (left side of diploid chromosomes) and 1 when the chromosome is inherited from the father (right side of diploid chromosomes). The maternal genotype (MG) effect is specified by the maternal genotype (A: 0, B: 1).
Figure 2Evaluation of method using a simulated dataset. For each panel, the rows represent effects given in the simulations and the columns represent effects estimated using the generalized linear model (GLM). Numbers in cells denote the fractions of correctly estimated effect among 1250 simulated genes. (A) d = 5 with two replicates; (B) d = 3 with five replicates; and (C) d = 5 for allelic genotype (AG) and maternal genotype (MG), and d = 2 for parent-of-origin (PO) effects.
Summary of the log-normal generalized linear model (GLM) analysis of Drosophila and mice
| Samples | RAL799/RAL820 | RAL324/RAL852 | TSC | Liver |
|---|---|---|---|---|
| No. of analyzed genes | 6176 | 6971 | 12,963 | 11,995 |
| AG (FDR = 0.05) | 776 | 1570 | 1456 | 1584 |
| PO (FDR = 0.05) | 0 | 0 | 22 | 16 |
| MG (FDR = 0.05) | 0 | 0 | 4 | 304 |
TSC, trophoblast stem cells; AG, allelic genotype effect; FDR, false discovery rate; PO, parent-of-origin effect; MG, maternal genotype effect.
Summary of the negative binomial generalized linear model (GLM) analysis of Drosophila and mice
| Samples | RAL799/RAL820 | RAL324/RAL852 | TSC | Liver |
|---|---|---|---|---|
| No. of analyzed genes | 6536 | 6797 | 12,219 | 11,169 |
| AG (FDR = 0.05) | 922 | 1732 | 2102 | 2104 |
| PO (FDR = 0.05) | 5 | 15 | 64 | 35 |
| MG (FDR = 0.05) | 6 | 12 | 393 | 1355 |
TSC, trophoblast stem cells; AG, allelic genotype effect; FDR, false discovery rate; PO, parent-of-origin effect; MG, maternal genotype effect.
Figure 3Comparison of effect sizes of mouse trophoblast stem cells (TSCs) and livers in the negative binomial generalized linear model (GLM). The estimated effect size of each gene is indicated by a colored circle. The effect sizes of the TSCs and livers are shown on the x- and y-axes, respectively. Red circles represent genes with significant effects in both tissues and blue circles represent genes with significant effects in either tissue. Genes indicated by black circles did not exert significant effects. The allelic genotype (AG), parent-of-origin (PO), and maternal genotype (MG) effects are shown in (A–C), respectively.