| Literature DB >> 33035235 |
Li Luo1,2, Huining Kang1,2, Xichen Li3, Scott A Ness1,2, Christine A Stidley1.
Abstract
Changes in gene expression can correlate with poor disease outcomes in two ways: through changes in relative transcript levels or through alternative RNA splicing leading to changes in relative abundance of individual transcript isoforms. The objective of this research is to develop new statistical methods in detecting and analyzing both differentially expressed and spliced isoforms, which appropriately account for the dependence between isoforms and multiple testing corrections for the multi-dimensional structure of at both the gene- and isoform- level. We developed a linear mixed effects model-based approach for analyzing the complex alternative RNA splicing regulation patterns detected by whole-transcriptome RNA-sequencing technologies. This approach thoroughly characterizes and differentiates three types of genes related to alternative RNA splicing events with distinct differential expression/splicing patterns. We applied the concept of appropriately controlling for the gene-level overall false discovery rate (OFDR) in this multi-dimensional alternative RNA splicing analysis utilizing a two-step hierarchical hypothesis testing framework. In the initial screening test we identify genes that have differentially expressed or spliced isoforms; in the subsequent confirmatory testing stage we examine only the isoforms for genes that have passed the screening tests. Comparisons with other methods through application to a whole transcriptome RNA-Seq study of adenoid cystic carcinoma and extensive simulation studies have demonstrated the advantages and improved performances of our method. Our proposed method appropriately controls the gene-level OFDR, maintains statistical power, and is flexible to incorporate advanced experimental designs.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33035235 PMCID: PMC7546511 DOI: 10.1371/journal.pone.0232646
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Expression patterns of three types of genes with isoform abundance levels in log scale, which correspond to Models 0–2 proposed in Shi et al. [19].
(Model 0) no differentially expressed isoforms; (Model 1) differential expression of isoforms but no differential splicing; and (Model 2) differentially spliced isoforms with differential expression at the isoform level but not necessarily at the gene level.
List of genes that are called significant between 8 patients who are free of cancer vs. 6 patients in the screening tests along with the likelihood ratio test p-values and FDR.
Type 1 screening test identified 11 differentially expressed/spliced genes, and Type 2 screening test identified 4 differentially spliced genes. False discovery rate was controlled at the 0.10 level.
| Type 1 screening | Type 2 screening | |||
|---|---|---|---|---|
| Gene name | P value | FDR | P value | FDR |
| POSTN | 9.14E-06 | 0.00233 | 0.36 | |
| HNRNPA2B1 | 0.00004 | 0.00022 | 0.12 | |
| VEGFA | 0.00012 | 0.43157 | 0.84 | |
| FRMD4A | 0.00017 | 0.06945 | 0.54 | |
| TRIP12 | 0.00020 | 0.00012 | ||
| PRKAA1 | 0.00021 | 0.00012 | ||
| GNPTAB | 0.00023 | 0.01255 | 0.44 | |
| C3orf17 | 0.00030 | 0.00012 | ||
| ASXL1 | 0.00030 | 0.00425 | 0.43 | |
| RRBP1 | 0.00034 | 0.00010 | ||
| FZD6 | 0.00037 | 0.00104 | 0.24 | |
* Genes that passed both Types 1 and 2 screening tests.
List of 12 differentially expressed isoforms between 8 patients who are free of cancer vs. 6 patients in the confirmatory test.
| Isoform ID | gene name | Fold Change | P value (model-based) | P value (t test) | Two step significance threshold |
|---|---|---|---|---|---|
| ENST00000474311 | C3orf17 | 5.50 | 8.50E-07 | 0.00030 | 3.40E-06 |
| ENST00000492155 | FRMD4A | 3.67 | 7.35E-06 | 0.00178 | 2.94E-05 |
| ENST00000358621 | FRMD4A | 2.01 | 9.13E-06 | 0.00137 | 3.65E-05 |
| ENST00000522566 | FZD6 | 8.59 | 8.12E-08 | 0.00028 | 1.62E-07 |
| ENST00000356674 | HNRNPA2B1 | 1.68 | 5.72E-11 | 0.00081 | 3.43E-10 |
| ENST00000541179 | POSTN | 19.31 | 8.16E-08 | 0.00087 | 4.08E-07 |
| NST00000478947 | POSTN | 9.48 | 4.80E-07 | 0.00037 | 2.40E-06 |
| ENST00000379743 | POSTN | 9.15 | 6.16E-05 | 0.00394 | 0.00031 |
| ENST00000511248 | PRKAA1 | 7.34 | 3.03E-08 | 0.00026 | 9.10E-08 |
| ENST00000495501 | RRBP1 | 5.89 | 2.49E-05 | 0.00139 | 7.46E-05 |
| ENST00000428959 | TRIP12 | -14.65 | 4.19E-08 | 0.00249 | 2.93E-07 |
| ENST00000497139 | VEGFA | 7.58 | 7.29E-07 | 0.00052 | 1.46E-06 |
Fig 2The isoform expression profiles for nine differentially expressed/spliced genes between 8 patients who are free of cancer vs. 6 patients who are not.
The distribution of the number of significant genes that passed Type 1 or Type 2 screening tests comparing the isoform expression patterns between 81 cases and 153 controls.
| No. of isoforms | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | ≥16 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| No. of genes that passed Type 1 screening test | 88 | 111 | 105 | 90 | 88 | 67 | 69 | 46 | 27 | 31 | 20 | 14 | 10 | 4 | 12 | 782 |
| No. of genes that passed Type 2 screening test | 100 | 123 | 116 | 98 | 101 | 68 | 75 | 52 | 29 | 34 | 21 | 14 | 10 | 5 | 11 | 857 |
Fig 3Isoform expression profiles for two gene with differentially spliced isoforms that are associated with survival outcome in AML.
Fig 4Evaluation of OFDR and power through the simulation with template gene MDM2 when Wald-test is used in the confirmatory stage.