| Literature DB >> 31216356 |
Hafdis T Helgadottir1, Pär Lundin2, Emelie Wallén Arzt1, Anna-Karin Lindström3,4, Caroline Graff3,4, Maria Eriksson1.
Abstract
Alzheimer's disease (AD) is the most common neurodegenerative disease worldwide. Familial cases suggest genetic components; however, monogenetic causes are few, and the vast majority of incidences have unknown cause. Sequencing efforts have focused on germline mutations, but improved technology has opened up for studies on somatic mutations in affected brain tissue samples. Here we use ultra-deep sequencing on brain and blood from early-onset AD (EOAD) and late-onset AD (LOAD) patients and non-AD individuals (n = 16). In total, 2.86 Mb of genomic regions, previously associated with AD, were targeted included 28 genes and upstream and downstream regulatory regions. Tailored downstream bioinformatics filtering identified 11 somatic single nucleotide variants in the temporal cortex in AD patients and none in the controls. One variant was validated to be present at 0.4% allele frequency in temporal cortex of a LOAD patient. This variant was predicted to affect transcription factor binding sites upstream of the CD55 gene, contributing to AD pathogenesis by affecting the complement system. Our results suggest that future studies targeting larger portions of the genome for somatic mutation analysis are important to obtain an increased understanding for the molecular basis of both EOAD and LOAD.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31216356 PMCID: PMC6688063 DOI: 10.1093/hmg/ddz085
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
Figure 1Workflow and bioinformatics overview. (A) Blood and brain (temporal cortex) samples were obtained from EOAD patients (n = 4), LOAD patients (n = 4) and age- and gender-matched non-AD individuals (n = 8). The DNA was extracted and sonicated before library preparation. During the library preparation, index primers were added to the DNA, and every sample was captured on an individual array containing 2.86 Mb of the human genome. The 32 libraries were mixed in 2 pools of 16 samples each and sequenced on HiSeq 2500, to be followed with bioinformatics downstream filtering to reveal brain-specific SNVs and validation by ddPCR. (B) Raw fastq files were aligned using BWA and processed using GATK-best practice that resulted in realigned BAM files. Variants were called using four somatic mutation callers, and downstream filtering was applied to identify tissue-specific somatic SNVs in the brain.
Figure 2Tissue-specific SNVs in the brain detected in AD patients. (A) Bielschowsky-stained frontal cortex from AD and non-AD individuals. AD patients show signs of plaques and tangles. (B) Overview of SNVs called by different somatic callers. SNVs identified by minimum three somatic callers were considered possible brain-specific SNVs. (C) In total, 11 tissue-specific SNVs were detected in the brain of the AD patients while no SNVs were detected in non-AD, and more SNVs were detected in EOAD than in LOAD. (D) Although majority of the regions targeted were intergenic, more SNVs were detected within genes.
Tissue-specific SNVs identified in the brain DNA
| MuTect1 | MuTect2 | Strelka2 | Varscan2 | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Blood | Brain | Blood | Brain | Blood | Brain | Blood | Brain | ||||||||||||
| Group sample | SNV | Locus | Location | Ref/Alt | AAF (%) | Ref/Alt | AAF (%) | Ref/Alt | AAF (%) | Ref/Alt | AAF (%) | Ref/Alt | AAF (%) | Ref/Alt | AAF (%) | Ref/Alt | AAF (%) | Ref/Alt | AAF (%) |
| LOAD 1/2 | chr1:207351003C>A | CR1 | Intergenic | na | na | na | na | 423/0 | 0 | 546/8 | 1.2 | 425/0 | 0 | 586/8 | 1.3 | 424/0 | 0 | 584/7 | 1.2 |
| LOAD 19/20 | chr1:207461994C>T | CR1 | Upstream | 794/0 | 0 | 582/6 | 1.0 | 1472/0 | 0 | 997/10 | 1.0 | 1476/0 | 0 | 1002/10 | 1.0 | 1477/1 | 0 | 1001/10 | 1.0 |
| EOAD 27/28 | chr1:207550332T>C | CR1 | Intergenic | na | na | na | na | 697/0 | 0 | 398/6 | 1.6 | 701/0 | 0 | 400/6 | 1.5 | 700/0 | 0 | 400/5 | 1.2 |
| EOAD 27/28 | chr1:227069718G>T | PSEN2 | Missense | na | na | na | na | 491/0 | 0 | 290/5 | 1.3 | 490/0 | 0 | 289/5 | 1.7 | 490/0 | 0 | 287/5 | 1.7 |
| EOAD 25/26 | chr2:128054946G>T | BIN1 | Upstream | na | na | na | na | 403/0 | 0 | 391/5 | 1.0 | 409/0 | 0 | 397/5 | 1.2 | 409/0 | 0 | 396/5 | 1.2 |
| EOAD 11/12 | chr8:27316070C>A | CLU/PTK2B | 3′UTR | na | na | na | na | 332/0 | 0 | 414/6 | 1.0 | 334/0 | 0 | 417/6 | 1.4 | 330/0 | 0 | 413/6 | 1.4 |
| EOAD 25/26 | chr11:121250381G>T | SORL1 | Intergenic | na | na | na | na | 394/0 | 0 | 467/7 | 1.0 | 394/0 | 0 | 473/7 | 1.5 | 393/0 | 0 | 472/7 | 1.5 |
| EOAD 11/12 | chr11:121363100C>A | SORL1 | Intron | na | na | na | na | 609/1 | 0 | 209/6 | 2.6 | 614/1 | 0 | 222/6 | 2.6 | 612/0 | 0 | 221/6 | 2.6 |
| EOAD 27/28 | chr11:121401561A>G | SORL1 | Intron | na | na | na | na | 880/0 | 0 | 391/6 | 1.1 | 882/0 | 0 | 392/6 | 1.5 | 876/0 | 0 | 389/6 | 1.5 |
| EOAD 11/12 | chr21:27421506G>T | APP | Intron | na | na | na | na | 696/0 | 0 | 371/5 | 1.0 | 704/0 | 0 | 428/5 | 1.1 | 697/0 | 0 | 427/5 | 1.2 |
| EOAD 11/12 | chr21:27489758G>A | APP | Intron | na | na | na | na | 987/0 | 0 | 579/6 | 0.7 | 987/0 | 0 | 594/6 | 1.0 | 979/0 | 0 | 585/6 | 1.0 |
Sample ID is according to Supplementary Material, Table S1; chromosomal position is according to hg19. For the sequencing results, the number of reads for reference (Ref) and alternative (Alt) allele are reported, as well as the AAF for the brain and the blood. na means not available.
Comments from the somatic variant callers on the tissue-specific SNVs in brain
| Group sample | SNV | MuTect1 | MuTect2 | Strelka | Varscan2 |
|---|---|---|---|---|---|
| LOAD 1/2 | chr1:207351003C>A | fstar_tumor_lod, possible_contamination | t_lod_fstar | LowEVS | PASS |
| LOAD 19/20 | chr1:207461994C>T | possible_contamination | PASS | LowEVS | PASS |
| EOAD 27/28 | chr1:207550332T>C | possible_contamination | PASS | LowEVS | PASS |
| EOAD 27/28 | chr1:227069718G>T | fstar_tumor_lod, possible_contamination | t_lod_fstar | LowEVS | PASS |
| EOAD 25/26 | chr2:128054946G>T | fstar_tumor_lod, possible_contamination | t_lod_fstar | LowEVS | PASS |
| EOAD 11/12 | chr8:27316070C>A | fstar_tumor_lod, possible_contamination | t_lod_fstar | LowEVS | PASS |
| EOAD 25/26 | chr11:121250381G>T | fstar_tumor_lod, possible_contamination | t_lod_fstar | LowEVS | PASS |
| EOAD 11/12 | chr11:121363100C>A | fstar_tumor_lod, possible_contamination | t_lod_fstar | PASS | PASS |
| EOAD 27/28 | chr11:121401561A>G | fstar_tumor_lod, possible_contamination | t_lod_fstar | LowEVS | PASS |
| EOAD 11/12 | chr21:27421506G>T | na | t_lod_fstar | LowEVS | PASS |
| EOAD 11/12 | chr21:27489758G>A | fstar_tumor_lod, possible_contamination | t_lod_fstar | LowEVS | PASS |
Sample ID is according to Supplementary Material, Table S1; chromosomal position is according to hg19. For the SNVs, the comment from default setting of every caller is showed.
Validation of tissue-specific SNVs in brain using ddPCR
| Brain Validation | Blood Validation | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Group sample | SNV | Locus | Somatic caller | Fractional abundance (95% CI) [a/(a + b)%] | Total DNA | Total haploid copies | Proportion mutant allele, 1 in X | Fractional abundance (95% CI) [a/(a + b)%] | Total DNA | Total haploid copies | Proportion mutant allele, 1 in X |
| LOAD 19/20 | chr1:207461994C>T | CR1 | a | 0.39 (0.31;0.47) | 151 | 45 740 | 256 | 0 | 167 | 50 725 | 0 |
| LOAD 1/2 | chr8:26930598T>C | CLU/PTK2B | b | 0.106 (0.061;0.145) | 215 | 65 268 | 943 | 0.097 (0.05;0.15) | 96 | 28 988 | 1031 |
| NL_Old 21/22 | chr2:127905153G>T | BIN1 | b | 0 | 88 | 20 287 | na | na | |||
| NL_Old 21/22 | chr11:121034807G>T | SORL1 | c | 0 | 84 | 19 288 | na | na | |||
| EOAD 25/26 | chr11:121332780G>T | SORL1 | b | 0 | 12 | 3681.8 | na | na | |||
| NL_Old 23/24 | chr11:121415560G>T | SORL1 | b | 0 | 34 | 10 060 | na | na | |||
| LOAD 19/20 | chr11:121382085G>T | SORL1 | b | Assay failure | Assay failure | ||||||
| NL_Old 21/22 | chr1:207586559G>T | CR1 | b | Design fail | Design fail | ||||||
| EOAD 25/26 | chr2:128054946G>T | BIN1 | d | Design fail | Design fail | ||||||
| EOAD 25/26 | chr11:120914367G>T | SORL1 | b | Design fail | Design fail | ||||||
| NL_Old 21/22 | chr11:121256750G>T | SORL1 | b | Design fail | Design fail | ||||||
| EOAD 11/12 | chr11:121436267G>T | SORL1 | b | Design fail | Design fail | ||||||
| EOAD 25/26 | chr11:121250381G>T | SORL1 | d | Design fail | Design fail | ||||||
| EOAD 11/12 | chr11:121363100C>A | SORL1 | d | Design fail | Design fail | ||||||
Values are given as fractional abundance with 95% CIs. Total DNA and haploid copies are given, as well as the calculated proportion of the mutant allele in the tissue.
Old non-AD individual (NL_Old). Sample ID is according to Supplementary Material, Table S1, chromosomal position is according to hg19 and locus is according to Supplementary Material, Table S2. Somatic sequencing callers: a, MuTect1, MuTect2, Strelka2, Varscan2; b, MuTect1, MuTect2; c, MuTect1, Varscan2; d, MuTect2, Strelka2, Varscan2.
Figure 3SNV detected by all somatic callers and validated using ddPCR. (A) Overview of the CR1-locus selected on the library, covering five genes. A brain-specific SNV was identified in this region where chr1:207461994C>T was detected in the brain by ultra-deep sequencing. GeneHancer Regulatory Elements and Gene Interactions showed that the SNV is located in a regulatory region of the gene CD55. (B) The identification of chr1:207461994C>T by ultra-deep sequencing was validated with ddPCR to be present in DNA from temporal cortex and was absent in blood. (C) Predicted transcription factor binding sites in the DNA sequence surrounding chr1:207461994C>T for the wild-type allele (C allele) and the mutant allele (T allele). a, JASPAR; b, PROMO. (D) Transcriptional regulation of the CD55 gene is regulated by the FOXO3 transcription factor (data from PathwayNet).