| Literature DB >> 30410588 |
Yi-Xiao Chen1,2, Yan Guo2, Shan-Shan Dong2, Xiao-Feng Chen2, Jia-Bin Chen2, Yu-Jie Zhang2, Shi Yao2, Hlaing Nwe Thynn2, Liqiang Zhi1, Tie-Lin Yang1,2.
Abstract
Although genome-wide association studies (GWASs) have identified some risk single-nucleotide polymorphisms in East Asian never-smoking females, the unexplained missing heritability is still required to be investigated. Runs of homozygosity (ROHs) are thought to be a type of genetic variation acting on human complex traits and diseases. We detected ROHs in 8,881 East Asian never-smoking women. The summed ROHs were used to fit a logistic regression model which noteworthily revealed a significant association between ROHs and the decreased risk of lung cancer (P < 0.05). We identified 4 common ROHs regions located at 2p22.1, which were significantly associated with decreased risk of lung cancer (P = 2.00 × 10-4 - 1.35 × 10-4). Functional annotation was conducted to investigate the regulatory function of ROHs. The common ROHs were overlapped with potential regulatory elements, such as active epigenome elements and chromatin states in lung-derived cell lines. SOS1 and ARHGEF33 were significantly up-regulated as the putative target genes of the identified ROHs in lung cancer samples according to the analysis of differently expressed genes. Our results suggest that ROHs could act as recessive contributing factors and regulatory elements to influence the risk of lung cancer in never-smoking East Asian females.Entities:
Keywords: GWASs; genetic risk factors; lung cancer; regulatory elements; runs of homozygosity
Year: 2018 PMID: 30410588 PMCID: PMC6218761 DOI: 10.7150/jca.22855
Source DB: PubMed Journal: J Cancer ISSN: 1837-9664 Impact factor: 4.207
Basic characteristics of the study samples
| Characteristic | Never-smoking East Asian females | GSE40791 |
|---|---|---|
| Sample Size (n) | 8,881 | 194 |
| Age (year) | 58.2(9.29) | 68.01(10.46) |
| Female (%) | 100% | 42.8% |
| Cases (%) | 55.42% | 48.45% |
| Never-smokers (%) | 100.00% | 7.22% |
| Histology | ||
| Adenocarcinoma (%) | 73.04% | 100% |
| Squamous (%) | 13.41% | 0% |
| Other (%) | 13.55% | 0% |
Age values are presented as mean (standard deviation).
Summary of ROHs Characteristics identified with different length
| Size | Total ROHs size | Total ROHs number | Mean size per ROHs (kb) | ||||
|---|---|---|---|---|---|---|---|
| Mean(kb) | Range | Mean(n) | Range | ||||
| Min(kb) | Max(kb) | Min(n) | Max(n) | ||||
| > 500 kb | 148,270 | 58,436 | 445,666 | 176.90 | 75 | 272 | 837.8 |
| > 1.00 Mb | 51,936 | 13,854 | 378,231 | 34.79 | 11 | 115 | 1,471 |
| > 1.25 Mb | 32,633 | 2,769 | 358,867 | 17.39 | 2 | 81 | 1,808 |
| > 1.50 Mb | 21,650 | - | 350,432 | 9.32 | - | 58 | 2,153 |
| > 1.75 Mb | 14,884 | - | 343,937 | 5.12 | - | 44 | 2,513 |
| > 2.00 Mb | 10,938 | - | 334,826 | 2.99 | - | 33 | 2,760 |
| > 3.00 Mb | 5,947 | - | 335,623 | 0.84 | - | 25 | 2,185 |
Size: the minimum threshold for calling ROHs segments. The mean and range data were calculated by adding together all the values for the sizes of ROHs in each individual.
Effects of genome-wide burden of ROHs on subjects
| Size | |||
|---|---|---|---|
| > 500 kb | 0.007 | -11.50 | 4.24 |
| > 1.00 Mb | 0.010 | -11.09 | 4.33 |
| > 1.25 Mb | 0.007 | -11.81 | 4.38 |
| > 1.50 Mb | 0.009 | -11.58 | 4.42 |
| > 1.75 Mb | 0.006 | -12.20 | 4.48 |
| > 2.00 Mb | 0.007 | -12.13 | 4.52 |
| > 3.00 Mb | 0.009 | -12.08 | 4.65 |
P: P-value for association with the logistic regression model; β: the estimated effects of ROHs on whole genome in units of standard deviations; se: standard error.
Basic characteristics of significant ROHs associated with lung cancer
| ROH | Chr | Position | Freq | Cases: controls | Chi2 | Located gene | iHS max | Fay and Wu's H max | Fst max | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Start | End | ||||||||||||
| ROH_14857 | 2 | 39025631 | 39047154 | 0.048 | 198:230 | 14.47 | 1.43×10-4 | 0.044 | 1.284 | -22.097 | 0.360 | ||
| ROH_14715 | 2 | 39084756 | 39135927 | 0.049 | 200:232 | 14.49 | 1.41×10-4 | 0.044 | 2.623 | -36.364 | 0.433 | ||
| ROH_14344 | 2 | 39198965 | 39226271 | 0.049 | 206:238 | 14.57 | 1.35×10-4 | 0.044 | 1.798 | -39.925 | 0.290 | ||
| ROH_14342 | 2 | 39247288 | 39293530 | 0.050 | 207:237 | 13.85 | 2.00×10-4 | 0.044 | 2.010 | -56.961 | 0.271 | ||
Freq: frequency; Chr: chromosome; Chromosomal positions are shown according to NCBI Build 37 (hg19); P: P-value for testing the differences of homozygosity status between cases and controls with two-sided chi-square test; P: P-value adjusting with the Benjamini and Hochberg (BH) procedure; Located gene: ROHs located gene; iHS max: the maximal absolute values for iHS; Fay and Wu's H max: the maximal absolute values for Fay and Wu's H; and Fst max: the maximal absolute values for Fst; iHS max, Fay and Wu's H max and Fst max were derived for Asian population from Haplotter (http://haplotter.uchicago.edu/) using Phase II HapMap Project data.
P-values for the statistical analyses of inbreeding coefficients between cases and controls
| Chromosome | Total length | FI |
|---|---|---|
| 1 | 249250621 | 0.088905 |
| 2 | 243199373 | |
| 3 | 198022430 | 0.717751 |
| 4 | 191154276 | 0.575738 |
| 5 | 180915260 | 0.171943 |
| 6 | 171115067 | |
| 7 | 159138663 | 0.489741 |
| 8 | 146364022 | |
| 9 | 141213431 | 0.207553 |
| 10 | 135534747 | 0.163951 |
| 11 | 135006516 | 0.223446 |
| 12 | 133851895 | 0.599272 |
| 13 | 115169878 | 0.985238 |
| 14 | 107349540 | 0.543299 |
| 15 | 102531392 | 0.151867 |
| 16 | 90354753 | |
| 17 | 81195210 | 0.75997 |
| 18 | 78077248 | 0.282267 |
| 19 | 59128983 | 0.303959 |
| 20 | 63025520 | 0.561602 |
| 21 | 48129895 | 0.995754 |
| 22 | 51304566 | 0.537312 |
*bold values represent significant differences between cases and controls at P < 0.05
Figure 1Functional annotation for the common ROHs regions. The common ROHs regions overlapped with histone marks which classified as active enhancer in A549 cell line, including H3k4me1, H3k4me2 and H3k27ac, and annotated as enhancers according to 15-state chromatin state segmentation.
Figure 2Regulatory annotation of the common ROHs regions in IMR90 cell line. ROH_14342 and ROH_14857 were annotated with enhancers and interactions within the chromosome regions in IMR90 cell line. SOS1 and ARHGEF33 with active promoters were the target genes of the enhancer region overlapped with ROH_14342. Status of the enhancer regions were annotated according to 15-state chromatin state segmentation data generated by ChromHMM based on the Roadmap histone modification data. The interactions between regions on the chromosome detected by chromatin capture Hi-C are shown in the center.