| Literature DB >> 31660858 |
Jing Xie1, Tieming Ji2, Marco A R Ferreira3, Yahan Li4, Bhaumik N Patel4, Rocio M Rivera4.
Abstract
BACKGROUND: High-throughput sequencing experiments, which can determine allele origins, have been used to assess genome-wide allele-specific expression. Despite the amount of data generated from high-throughput experiments, statistical methods are often too simplistic to understand the complexity of gene expression. Specifically, existing methods do not test allele-specific expression (ASE) of a gene as a whole and variation in ASE within a gene across exons separately and simultaneously.Entities:
Keywords: Allelic imbalance; Hierarchical generalized linear mixed model; High-throughput sequencing experiments; Single nucleotide polymorphism
Year: 2019 PMID: 31660858 PMCID: PMC6819473 DOI: 10.1186/s12859-019-3141-6
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Percentage of gene expression from maternal allele in brain, liver, kidney, and muscle, respectively. The top panel shows gene AOX1. The second panel shows gene HACL1. The third panel shows gene TMEM50B, and the bottom panel shows gene IGF2r. SNPs are drawn with ascending genomic locations. The bottom of each panel shows distribution of SNPs in exons from all RefSeq annotated transcripts of this gene. Rectangles represent exons (only those with SNPs are shown) with exon numbers indicated under each rectangle. Lengths of exons are not drawn to scale
Fig. 2Venn Diagram of detected ASEs across tissue types. Number of significant genes (estimated FDR=0.05) across four tissue types when testing ASE at the gene level, testing ASE variations across SNPs, and testing ASE gene and ASE variations within a gene simultaneously
Assess of FDR control and TPr when controlling estimated FDR at 0.05
| Method | True FDR | TPr(%) | ||||
|---|---|---|---|---|---|---|
| gene | SNP | gene-SNP | gene | SNP | gene-SNP | |
| BLMRM | 0.053 | 0.028 | 0.059 | 66.37 | 60.82 | 17.51 |
| (0.006) | (0.004) | (0.014) | (0.87) | (1.80) | (1.65) | |
| BLMRM | 0.060 | 0.030 | 0.094 | 68.87 | 56.82 | 17.50 |
| (pure Laplace) | (0.006) | (0.002) | (0.008) | (0.29) | (1.19) | (0.91) |
| GLMM | 0.073 | 0.006 | 0.625 | 68.66 | 57.20 | 86.72 |
| (0.010) | (0.002) | (0.004) | (1.52) | (1.49) | (0.86) | |
| MBASED | 0.358 | 0.032 | - | 91.34 | 64.32 | - |
| (0.006) | (0.005) | - | (0.54) | (1.51) | - | |
| ANOVA | 0.194 | - | - | 82.02 | - | - |
| (0.007) | - | - | (1.04) | - | - | |
| Binomial | 0.314 | - | - | 88.26 | - | - |
| (0.003) | - | - | (0.80) | - | - | |
Fig. 3FDR and ROC comparison. Top row shows results for testing the gene effect; middle row shows results for testing SNP variation within a gene; bottom row shows results for simultaneously testing gene ASE and SNP variation. Left panel shows box plots of true FDR across 10 simulations when controlling estimated FDR = 0.05; right panel presents ROC curves