| Literature DB >> 33720414 |
Abstract
Interactions between biological molecules in a cell are tightly coordinated and often highly dynamic. As a result of these varying signaling activities, changes in gene coexpression patterns could often be observed. The advancements in next-generation sequencing technologies bring new statistical challenges for studying these dynamic changes of gene coexpression. In recent years, methods have been developed to examine genomic information from individual cells. Single-cell RNA sequencing (scRNA-seq) data are count-based, and often exhibit characteristics such as overdispersion and zero inflation. To explore the dynamic dependence structure in scRNA-seq data and other zero-inflated count data, new approaches are needed. In this paper, we consider overdispersion and zero inflation in count outcomes and propose a ZEro-inflated negative binomial dynamic COrrelation model (ZENCO). The observed count data are modeled as a mixture of two components: success amplifications and dropout events in ZENCO. A latent variable is incorporated into ZENCO to model the covariate-dependent correlation structure. We conduct simulation studies to evaluate the performance of our proposed method and to compare it with existing approaches. We also illustrate the implementation of our proposed approach using scRNA-seq data from a study of minimal residual disease in melanoma.Entities:
Keywords: correlated count data; covariate-dependent correlation; dynamic coexpression; liquid association; single-cell RNA sequencing; zero inflation
Mesh:
Year: 2021 PMID: 33720414 PMCID: PMC8477913 DOI: 10.1111/biom.13457
Source DB: PubMed Journal: Biometrics ISSN: 0006-341X Impact factor: 1.701
FIGURE 1Profile plots of with varying X 3 (, , , and )
FIGURE 2Power curves comparing various methods. Both TLA and CNM‐Full approaches are Gaussian‐based models
Coverage probability of 95% credible intervals (CIs) and interval lengths based on 1000 MCMC simulations (, )
| Without zero inflation | With zero inflation | ||||
|---|---|---|---|---|---|
| Parameter | Coverage probability | CI length | Coverage probability | CI length | |
|
| τ0 | 1.000 | 0.237 | 1.000 | 0.246 |
| τ1 | 0.154 | 0.041 | 0.957 | 0.095 | |
|
| τ0 | 1.000 | 0.223 | 1.000 | 0.244 |
| τ1 | 0.006 | 0.022 | 0.961 | 0.059 | |
|
| τ0 | 0.957 | 0.205 | 1.000 | 0.242 |
| τ1 | 0.000 | 0.015 | 0.954 | 0.040 | |
Mean square errors (MSEs) and mean bias errors (MBEs) based on 1000 MCMC simulations (, )
| Without zero inflation | With zero inflation | ||||
|---|---|---|---|---|---|
| Parameter | MSE | MBE | MSE | MBE | |
|
| τ0 | 0.001 | 0.005 | 0.000 | −0.008 |
| τ1 | 0.002 | −0.039 | 0.001 | −0.006 | |
|
| τ0 | 0.002 | 0.024 | 0.000 | −0.009 |
| τ1 | 0.002 | −0.040 | 0.000 | −0.001 | |
|
| τ0 | 0.004 | 0.048 | 0.000 | −0.009 |
| τ1 | 0.002 | −0.041 | 0.000 | 0.000 | |
Top table of dynamic correlations differences. is the difference between τ1 estimates in phase 3 (P3) and phase 1 (P1)
| # | Gene1 | Gene2 |
|
|
|
|---|---|---|---|---|---|
| 1 | PDGFC | FGFR1 | 0.045 (0.021, 0.068) | −0.003 (−0.010, 0.005) | −0.047 (−0.072,−0.023) |
| 2 | AKT1 | BAX | 0.040 (0.008, 0.071) | −0.003 (−0.014, 0.008) | −0.043 (−0.075,−0.010) |
| 3 | AKT1 | PIK3R1 | −0.016 (−0.035, 0.004) | 0.024 (0.009, 0.038) | 0.040 (0.015, 0.062) |
| 4 | PDGFC | MAP2K2 | 0.016 (−0.002, 0.032) | −0.023 (−0.036,−0.006) | −0.039 (−0.059,−0.013) |
| 5 | IGF1R | FGFR1 | −0.024 (−0.048, 0.000) | 0.007 (0.000, 0.014) | 0.032 (0.006, 0.056) |
| 6 | MDM2 | CCND1 | 0.021 (0.007, 0.031) | −0.011 (−0.018,−0.004) | −0.031 (−0.044,−0.017) |
| 7 | AKT1 | ARAF | −0.025 (−0.047, 0.002) | 0.007 (−0.007, 0.018) | 0.031 (0.002, 0.056) |
| 8 | AKT1 | MAP2K1 | 0.025 (0.004, 0.057) | −0.006 (−0.017, 0.009) | −0.030 (−0.063,−0.006) |
| 9 | AKT1 | MAPK1 | −0.003 (−0.012, 0.006) | 0.026 (0.007, 0.055) | 0.029 (0.007, 0.058) |
| 10 | KRAS | PDGFC | 0.012 (−0.005, 0.024) | −0.017 (−0.042, 0.005) | −0.029 (−0.057,−0.002) |
| 11 | IGF1R | MAP2K2 | 0.025 (0.002, 0.056) | −0.004 (−0.011, 0.006) | −0.028 (−0.060,−0.004) |
| 12 | PTEN | PDGFC | −0.022 (−0.036,−0.004) | 0.007 (−0.003, 0.014) | 0.028 (0.008, 0.044) |
| 13 | PTEN | PIK3R1 | 0.031 (0.007, 0.050) | 0.005 (−0.006, 0.014) | −0.027 (−0.048,−0.002) |
| 14 | BAX | POLK | 0.025 (0.006, 0.048) | 0.000 (−0.012, 0.010) | −0.026 (−0.051,−0.003) |
| 15 | KRAS | NRAS | 0.017 (−0.003, 0.034) | −0.008 (−0.015, 0.002) | −0.024 (−0.043,−0.003) |
| 16 | ARAF | RB1 | 0.020 (0.008, 0.032) | −0.004 (−0.009, 0.002) | −0.024 (−0.037,−0.011) |
| 17 | AKT1 | RAF1 | −0.016 (−0.033,−0.003) | 0.007 (−0.004, 0.017) | 0.023 (0.006, 0.042) |
| 18 | NRAS | MAPK1 | 0.017 (0.002, 0.029) | −0.005 (−0.013, 0.006) | −0.021 (−0.037,−0.004) |
| 19 | PIK3R1 | MDM2 | 0.020 (0.004, 0.035) | −0.001 (−0.010, 0.008) | −0.021 (−0.038,−0.002) |
| 20 | IGF1R | TP53 | −0.016 (−0.034, 0.002) | 0.005 (−0.003, 0.011) | 0.020 (0.002, 0.039) |
| 21 | BAK1 | POLK | −0.018 (−0.030,−0.006) | 0.002 (−0.006, 0.010) | 0.020 (0.006, 0.034) |
| 22 | AKT3 | MAP2K2 | 0.016 (0.005, 0.025) | −0.003 (−0.011, 0.007) | −0.018 (−0.030,−0.006) |
| 23 | PTEN | KRAS | −0.005 (−0.016, 0.011) | 0.012 (0.003, 0.020) | 0.017 (0.000, 0.030) |
| 24 | BAD | RAF1 | −0.016 (−0.031,−0.006) | 0.000 (−0.009, 0.008) | 0.016 (0.002, 0.032) |
| 25 | IGF1R | CDK6 | 0.014 (−0.001, 0.026) | −0.002 (−0.008, 0.003) | −0.016 (−0.029,−0.001) |
| 26 | RB1 | CCND1 | 0.011 (0.000, 0.020) | −0.004 (−0.010, 0.004) | −0.014 (−0.025,−0.002) |
| 27 | AKT2 | FGFR1 | −0.003 (−0.015, 0.006) | 0.011 (0.004, 0.017) | 0.014 (0.002, 0.027) |
| 28 | BAD | TP53 | −0.001 (−0.010, 0.007) | 0.013 (0.002, 0.021) | 0.014 (0.001, 0.026) |
| 29 | NRAS | BAK1 | 0.001 (−0.008, 0.008) | 0.014 (0.006, 0.022) | 0.014 (0.002, 0.025) |
| 30 | AKT2 | BAK1 | −0.004 (−0.013, 0.005) | 0.010 (0.000, 0.019) | 0.014 (0.001, 0.026) |