| Literature DB >> 28539126 |
Dokyoon Kim1,2, Anna O Basile2, Lisa Bang1, Emrin Horgusluoglu3, Seunggeun Lee4, Marylyn D Ritchie1,2, Andrew J Saykin3, Kwangsik Nho5.
Abstract
BACKGROUND: Rapid advancement of next generation sequencing technologies such as whole genome sequencing (WGS) has facilitated the search for genetic factors that influence disease risk in the field of human genetics. To identify rare variants associated with human diseases or traits, an efficient genome-wide binning approach is needed. In this study we developed a novel biological knowledge-based binning approach for rare-variant association analysis and then applied the approach to structural neuroimaging endophenotypes related to late-onset Alzheimer's disease (LOAD).Entities:
Keywords: Alzheimer’s disease; Imaging genomics; Rare variant analysis
Mesh:
Substances:
Year: 2017 PMID: 28539126 PMCID: PMC5444041 DOI: 10.1186/s12911-017-0454-0
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Illustration of rare variant association analysis using Bin-KAT for neuroimaging genomics. First, rare variants were binned/collapsed based on biological knowledge, such as exon, gene, pathway, protein family, evolutionary conversed regions (ECR) or regulatory region, using BioBin. Then, statistical tests including a burden test and a dispersion test (SKAT), were incorporated into BioBin, called Bin-KAT [19]. Bin-KAT provides an option of performing unified rare variant association analysis methods in one tool to identify biologically-informed bins significantly associated with imaging endophenotypes of interest. VCF, variant call format
Demographic characteristics of study population
| CN | EMCI | LMCI | AD | |
|---|---|---|---|---|
| N | 255 | 218 | 232 | 45 |
| Gender (M/F) | 129/126 | 120/98 | 148/84 | 28/17 |
| Age | 74.38 (5.47) | 71.12 (7.46) | 73.16 (7.27) | 74.76 (9.25) |
| Education (mean (SD)) | 16.4 (2.7) | 16.0 (2.7) | 16.1 (3.0) | 15.7 (2.7) |
|
| 185/70 | 131/87 | 113/119 | 12/33 |
CN cognitive normal older subject, EMCI early mild cognitive impairment, LMCI late MCI, SD standard deviation
Fig. 2Manhattan plot of genome-wide gene-based rare variant association analysis for a LOAD-related neuroimaging endophenotype, entorhinal cortex thickness. –log10 p-value was plotted against the chromosomal location of each gene. FANCC exceeded the genome-wide significant threshold (FDR-corrected p-value = 0.05) (red line)
Variant effects of FANCC on entorhinal cortex thickness. P-values from SKAT were obtained by removing a rare variant on FANCC at a time
| Variant |
| Annotation |
|---|---|---|
| rs1800361 | 3.01E-04 | nonsynonymous |
| rs145497019 | 1.20E-05 | nonsynonymous |
| rs1800362 | 9.71E-06 | nonsynonymous |
| rs1800368 | 9.71E-06 | nonsynonymous |
| rs138629441 | 5.79E-06 | nonsynonymous |
| rs143152201 | 3.19E-06 | nonsynonymous |
| 9:97869388 | 2.33E-06 | nonsynonymous |
|
| 2.27E-06b | |
| 9:97887391 | 2.27E-06 | nonsynonymous |
| rs1800367 | 2.27E-06 | nonsynonymous |
| 9:97876956 | 2.20E-06 | nonsynonymous |
| rs140687953 | 1.87E-06 | nonsynonymous |
| rs140781259 | 1.70E-06 | nonsynonymous |
| 9:97934335 | 1.66E-06 | nonsynonymous |
| rs1800366 | 1.65E-06 | nonsynonymous |
| rs121917783 | 1.59E-06 | stop-gain |
| rs1800365 | 1.48E-06 | nonsynonymous |
a p-value from SKAT for FANCC after removing the variant
b p-value from SKAT for FANCC that contains every variant
Top 10 genes associated with entorhinal cortex thickness
| Gene |
| FDR-corrected |
|---|---|---|
|
| 2.27E-06 |
|
|
| 7.22E-06 | 0.052 |
|
| 1.95E-05 | 0.094 |
|
| 3.60E-05 | 0.098 |
|
| 3.75E-05 | 0.098 |
|
| 4.06E-05 | 0.098 |
|
| 5.33E-05 | 0.110 |
|
| 7.08E-05 | 0.128 |
|
| 8.16E-05 | 0.132 |
|
| 1.07E-04 | 0.156 |
Fig. 3Functional networks based on top 5 genes associated with entorhinal cortex thickness. The Integrated Multi-species Prediction (IMP) performs a graphical search of a functional network to identify the genes most likely to participate in similar pathways as query genes including FANCC, RFX1, FAF1, ABCA5 and SORCS2. Nodes represent genes and edges represent the predicted probability that the connected genes are involved in the same biological process. Large nodes represent query genes and the color of the edge signifies the strength of the relationship confidence. Red edge represents higher confidence scores between nodes
Evolutionary conserved regions (ECR) associated with entorhinal cortex thickness
| ECR | Mapped gene |
| FDR-corrected |
|---|---|---|---|
| ucsc_ecr:ecr_placentalMammals_chr1_band514 |
| 1.72E-06 | 0.018 |
| ucsc_ecr:ecr_placentalMammals_chr15_band358 |
| 1.72E-06 | 0.018 |
| ucsc_ecr:ecr_primates_chr15_band292 |
| 5.81E-06 | 0.025 |
| ucsc_ecr:ecr_vertebrate_chr1_band599 |
| 7.22E-06 | 0.025 |
| ucsc_ecr:ecr_vertebrate_chr1_band1917 |
| 7.22E-06 | 0.025 |
| ucsc_ecr:ecr_vertebrate_chr12_band1255 |
| 7.22E-06 | 0.025 |
| ucsc_ecr:ecr_vertebrate_chr15_band428 |
| 8.45E-06 | 0.025 |