| Literature DB >> 35158906 |
Alice Blandino1, Dominique Scherer1, Trine B Rounge2,3, Sinan U Umu2, Felix Boekstegers1, Carol Barahona Ponce1, Katherine Marcelain4, Valentina Gárate-Calderón1,4, Melanie Waldenberger5, Erik Morales6,7, Armando Rojas7, César Munoz6,7, Javier Retamales8, Gonzalo de Toro9,10, Olga Barajas4,11, María Teresa Rivera12, Analía Cortés12, Denisse Loader13, Javiera Saavedra13, Lorena Gutiérrez14, Alejandro Ortega15, Maria Enriqueta Bertrán16, Fernando Gabler17, Mónica Campos17, Juan Alvarado18, Fabrizio Moisán18, Loreto Spencer18, Bruno Nervi19, Daniel E Carvajal-Hausdorf20, Héctor Losada21, Mauricio Almau22, Plinio Fernández22, Ivan Gallegos4,11, Jordi Olloquequi23,24, Macarena Fuentes-Guajardo25, Rolando Gonzalez-Jose26, Maria Cátira Bortolini27, Carla Gallo28, Andres Ruiz Linares29,30,31, Francisco Rothhammer32, Justo Lorenzo Bermejo1.
Abstract
Long noncoding RNAs (lncRNAs) play key roles in cell processes and are good candidates for cancer risk prediction. Few studies have investigated the association between individual genotypes and lncRNA expression. Here we integrate three separate datasets with information on lncRNA expression only, both lncRNA expression and genotype, and genotype information only to identify circulating lncRNAs associated with the risk of gallbladder cancer (GBC) using robust linear and logistic regression techniques. In the first dataset, we preselect lncRNAs based on expression changes along the sequence "gallstones → dysplasia → GBC". In the second dataset, we validate associations between genetic variants and serum expression levels of the preselected lncRNAs (cis-lncRNA-eQTLs) and build lncRNA expression prediction models. In the third dataset, we predict serum lncRNA expression based on individual genotypes and assess the association between genotype-based expression and GBC risk. AC084082.3 and LINC00662 showed increasing expression levels (p-value = 0.009), while C22orf34 expression decreased in the sequence from gallstones to GBC (p-value = 0.04). We identified and validated two cis-LINC00662-eQTLs (r2 = 0.26) and three cis-C22orf34-eQTLs (r2 = 0.24). Only LINC00662 showed a genotyped-based serum expression associated with GBC risk (OR = 1.25 per log2 expression unit, 95% CI 1.04-1.52, p-value = 0.02). Our results suggest that preselection of lncRNAs based on tissue samples and exploitation of cis-lncRNA-eQTLs may facilitate the identification of circulating noncoding RNAs linked to cancer risk.Entities:
Keywords: eQTLs; gallbladder cancer; genetic association study; lncRNAs; molecular phenotypes
Year: 2022 PMID: 35158906 PMCID: PMC8833674 DOI: 10.3390/cancers14030634
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Flowchart representing the three-stage approach used in the study.
Figure 2LncRNA preselection. (A) PCA based on normalized log2 expression counts for lncRNAs with a nonzero MAD expression in the preselection dataset. (B) Volcano plot for the lncRNAs with nonzero MAD expression investigated in the lncRNA preselection dataset. The y-axis shows −log10 p-values from J–T tests. The black line represents the applied threshold (FDR = 0.05). The red dots highlight lncRNAs preselected according to both J–T tests and ML, which showed low expression variability (MAD = 0) in serum samples. The blue dots show the six candidates that fulfilled both J–T and ML preselection criteria, with nonzero MAD expression in serum samples. (C) Dot-and-box plots of log2 expression in GS, Dys, and GBC tissue samples for the three preselected lncRNAs.
FFPE tissue expression of the three preselected lncRNAs in the complete dataset and stratified results by gender and age.
| Subgroup | lncRNA | FDR * | log2 Expression in GS Samples | log2 Expression | log2 Expression Difference †
|
|---|---|---|---|---|---|
| All | AC084082.3 | 0.009 | 8.23 (1.45–9.93) | 0.51 (0.04; 0.99) | 0.76 (0.09; 1.44) |
| LINC00662 | 0.009 | 1.48 (0.55–4.38) | 1.09 (0.62; 1.56) | 0.86 (0.30; 1.42) | |
| C22orf34 | 0.04 | 1.44 (0.48–3.68) | −0.24 (−0.49; 0.005) | −0.28 (−0.54; −0.01) | |
| Women | AC084082.3 | 0.04 | 8.23 (1.45–9.78) | 0.67 (0.18; 1.15) | 0.89 (0.15; 1.63) |
| LINC00662 | 0.01 | 1.47 (0.54–4.07) | 1.09 (0.61; 1.56) | 1.01 (0.45; 1.57) | |
| C22orf34 | 0.02 | 1.44 (0.48–3.80) | −0.30 (−0.57; −0.03) | −0.34 (−0.63; −0.04) | |
| Men | AC084082.3 | 0.99 | 10.01 | −0.52 (−1.02; −0.03) | −0.30 (−2.19; 1.59) |
| LINC00662 | 0.99 | 4.53 | −0.52 (−1.24; 0.21) | −1.09 (−2.85; 0.68) | |
| C22orf34 | 0.99 | 0.49 | 0.43 (−0.66; 1.53) | 0.27 (−0.19; 0.72) | |
| Age < 60 | AC084082.3 | 0.43 | 8.23 (1.45–10.19) | 0.73 (0.13; 1.33) | 0.64 (−0.22; 1.50) |
| LINC00662 | 0.51 | 1.81 (0.58–4.33) | 0.93 (0.30; 1.55) | 0.66 (−0.13; 1.45) | |
| C22orf34 | 0.58 | 1.43 (0.47–3.08) | −0.35 (−0.72; 0.02) | −0.29 (−0.67; 0.09) | |
| Age > =60 | AC084082.3 | 0.17 | 8.96 (1.47–9.86) | 0.29 (−0.33; 0.90) | 0.84 (−0.10; 1.77) |
| LINC00662 | 0.05 | 1.46 (0.78–3.84) | 1.24 (0.67; 1.81) | 1.06 (0.36; 1.77) | |
| C22orf34 | 0.17 | 1.46 (0.50–3.44) | −0.18 (−0.52; 0.16) | −0.34 (−0.68; 0.006) |
* FDR: false discovery rate from two-sided Jonckheere–Terpstra test. Small p-values suggest monotonically increasing or decreasing expression levels. † Average log2 expression differences were estimated using robust linear regression.
Figure 3LncRNA-eQTL validation. (A) Genetic PCA based on LD-pruned genotypes from the lncRNA-eQTL validation dataset. (B,C) Measured vs. predicted log2 expression for LINC00662 and C22orf34, respectively.
Identified and subsequently validated cis-lncRNA-eQTLs for the three preselected lncRNAs.
| lncRNA | log2 | Chromosomal | No. of | No. of | No. of | Adjusted |
|---|---|---|---|---|---|---|
| AC084082.3 | 6.59 (1.74; 9.06) | chr8:66112667–66115207 | 161 | - | - | - |
| LINC00662 | 3.40 (0.35; 5.60) | chr19:27684580–27793940 | 1576 | 2 | 2 | 0.26 |
| C22orf34 | 0.58 (0.03; 2.65) | chr22:49414524–49657542 | 395 | 45 | 3 | 0.24 |
Figure 4Predicted genotype-based log2 LINC00662 expression in the lncRNA-GBC association dataset. Rhombuses represent the average genotype-based log2 expression in population-based controls and GBC patients.
Predicted genotype-based log2 expression of LINC00662 and C22orf34 and their association with GBC risk in the lncRNA-GBC association dataset.
| lncRNA | Median Predicted | OR * ( | 95% CI † | |
|---|---|---|---|---|
| LINC00662 | 1.27 | 1.25 | 1.04; 1.52 | 0.02 |
| C22orf34 | 0.39 | 0.90 | 0.61; 1.32 | 0.59 |
* OR: odds ratio, adjusted by age and gender. † CI: confidence interval.