| Literature DB >> 30214067 |
Wei Wei1, Michael J Keogh1, Juvid Aryaman2, Zoe Golder1, Peter J Kullar1, Ian Wilson3, Kevin Talbot4, Martin R Turner4, Chris-Anne McKenzie5, Claire Troakes6, Johannes Attems7, Colin Smith5, Safa Al Sarraj6, Chris M Morris7, Olaf Ansorge4, Nick S Jones2, James W Ironside5, Patrick F Chinnery8,9.
Abstract
PURPOSE: To systematically study somatic variants arising during development in the human brain across a spectrum of neurodegenerative disorders.Entities:
Keywords: brain; embryogenesis; exome sequencing; neurodegenerative disorders; somatic variant
Mesh:
Year: 2018 PMID: 30214067 PMCID: PMC6544539 DOI: 10.1038/s41436-018-0274-3
Source DB: PubMed Journal: Genet Med ISSN: 1098-3600 Impact factor: 8.822
Fig. 1Detection of somatic variants in 1461 postmortem human brains. (a) Brain regions sampled within the study. (b) The proportion and number of individuals in each cohort. (c) Unfiltered variant allele fraction (VAF) with between 10% and 35% against relative exome sequencing depth. Those that were present before and after filtering are shown (red and blue respectively). (d) Variant detection pipeline. Section I: Exons are shown in red, with intergenic and intronic regions as a black line. II: Regions of high genomic complexity and common structural variants (determined from population databases and previous studies) were removed (yellow line/gray box). III: Relative sequencing depth of each exon is shown in blue above the relevant exon. Bases in which the sequencing depth was below 30 (as depicted by the red dashed line) in an individual were removed. These regions are then shown by gray boxes on the schematic exome and were also removed. IV: Finally, regions in which copy-number variants (gains or losses) were called from array genotyping[10] were also removed from the overall panel. An example plot of the array genotyping in which a copy-number gain has been detected is shown. Again the corresponding region was removed from the exome depicted by a gray box on the exome panel. After these steps, remaining regions were subsequently subjected to analysis by deepSNV and a binomial test against the mean VAF for heterozygous variants (47%). (e) Schematic representation of the putative somatic alleles in the data set. A distribution of VAF in the whole data set is shown (pink histogram). Putative somatic alleles were those in which the VAF was greater than base error rate (as determined from deepSNV [green box and linked inset]), and those that also differed from the binomial threshold (<1 × 10−5) compared with an assumed VAF of 47% for heterozygosity. CJD Creutzfeldt–Jakob disease, PD-DLB Parkinson disease and dementia with Lewy bodies
Fig. 2Distribution and mutational profile of the validated somatic variants. (a) Distribution of allele frequencies for the validated variants in the study are shown, with the relative variant allele frequency (VAF) for each allele as detected on both the MiSeq (pink), exome sequencing (blue), and overlapping between two platforms (purple) shown. (b) Probability of a variant occurring in each cohort assuming a uniform prior probability and that each person is a Bernoulli trial with probability p of developing a pathogenic variant. (c) Mutational signature of all validated somatic variants. The mutated allele plus the flanking 3′ and 5′ base are shown. (d) Correlation between the mutational signature of validated somatic variants and the mutational profiles observed in de novo germline variants detected in the population[21] (top orange bar, signature A) and 21 forms of cancer[20] (purple bars). The probable disease associations, or type of cancer in which the signature was detected by Alexandrov[20] are shown next to the signature number. The Pearson correlation coefficient is shown for each signature. (e) Proportion of validated variants within genes grouped by brain proteome expression.[22] (f) Proportion of validated variants based on each quartile of the gene conservation scores within the germline (4th quartile being the most conserved in the germline). CJD Creutzfeldt–Jakob disease, FTD-ALS frontotemporal dementia or amyotrophic lateral sclerosis, PD-DLB Parkinson disease and dementia with Lewy bodies
Validated somatic variants in 1461 human brains
| Variant data | Clinical data | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Chromosome | Base position | Ref allele | Alt allele | Mutation | Gene | AA change | ExAC | SIFT score | SIFT | Human Proteome expression | Conservation quartile | Methylated in ICM | Exome VAF | Sample ID | Gender | Brain region | Disease group |
| chr1 | 248224344 | C | T | Non-syn |
| p.R121C | 4.94E-05 | 0.02 | D | Not detected | 2 | N | 15.2% | 1 | M | Cerebellum | Control |
| chr7 | 150815676 | C | T | Non-syn |
| p.S81L | N/A | 0.02 | D | Expressed in all | 4 | N | 22.3% | 2 | M | Temporal cortex | Control |
| chr11 | 104905100 | T | G | Non-syn |
| p.K37Q | 4.10E-03 | 0.59 | T | Expressed in all | 2 | N | 16.9% | 3 | M | Cerebellum | Control |
| chr17 | 39502849 | T | G | Non-syn |
| p.R316S | 1.10E-03 | 0.67 | T | Not expressed brain | 2 | N | 22.1% | 4 | M | Cerebellum | CJD |
| chr3 | 45837911 | T | C | Start lost |
| p.M1V | 8.77E-05 | 0.43 | T | Not expressed brain | 3 | N | 20.3% | 5 | M | Cerebellum | Alzheimer disease |
| chr3 | 122629742 | T | C | Non-syn |
| p.H59R | N/A | 0.02 | D | Elevated brain | 4 | N | 19.3% | 5 | M | Cerebellum | Alzheimer disease |
| chr19 | 36275201 | G | A | Non-syn |
| p.A517T | N/A | 0.13 | T | Mixed expression | 3 | N | 26.0% | 6 | M | Cerebellum | Other (PSP) |
| chr12 | 6138596 | C | T | Non-syn |
| p.R960P | 8.24E-06 | 0.26 | T | Mixed expression | 3 | N | 19.5% | 7 | F | Cerebellum | Control |
| chr11 | 56344581 | G | T | Non-syn |
| p.T206N | 1.20E-03 | 1 | T | Not detected | 2 | N | 13.0% | 8 | F | Cerebellum | Other (epilepsy) |
| chr16 | 4833750 | A | G | Non-syn |
| p.I131T | 1.68E-05 | 0.01 | D | Not expressed brain | 1 | N | 22.8% | 9 | M | Frontal cortex | Control |
| chr7 | 1535876 | C | T | Non-syn |
| p.D671N | 8.26E-06 | 0 | D | Expressed in all | 3 | N | 14.7% | 9 | M | Frontal cortex | Control |
| chr8 | 144921555 | T | C | Non-syn |
| p.I171V | 3.32E-04 | 0.15 | T | Expressed in all | 2 | N | 30.4% | 9 | M | Frontal cortex | Control |
| chr2 | 85991195 | C | T | Non-syn |
| p.R284W | 8.30E-06 | 0 | D | Mixed expression | 4 | Y | 28.3% | 10 | F | Temporal cortex | Control |
| chr1 | 24125194 | G | A | Non-syn |
| p.R50W | 1.66E-05 | 0.02 | D | Expressed in all | 2 | Y | 21.7% | 11 | F | Cerebellum | Control |
| chr1 | 17570577 | T | C | Non-syn |
| p.C126R | N/A | 0.01 | D | Not expressed brain | 1 | N | 25.3% | 12 | M | Cerebellum | Other (dementia) |
| chr17 | 76499013 | G | A | Syn |
| N/A | 4.05E-03 | N/A | N/A | Not expressed brain | N/A | Y | 28.2% | 12 | M | Cerebellum | Other (dementia) |
| chr11 | 1718844 | T | C | Syn |
| N/A | 1.65E-05 | N/A | N/A | Not detected | N/A | N | 14.9% | 13 | F | Cerebellum | FTD-ALS |
| chr19 | 9361855 | G | A | Non-syn |
| p.A46T | 2.50E-05 | 0 | D | Not expressed brain | 1 | Y | 30.2% | 13 | F | Cerebellum | FTD-ALS |
| chr20 | 60888258 | G | A | Syn |
| N/A | N/A | N/A | N/A | Expressed in all | N/A | N | 23.0% | 13 | F | Cerebellum | FTD-ALS |
| chr16 | 88712548 | G | A | Syn |
| N/A | N/A | N/A | N/A | Expressed in all | 2 | Y | 23.5% | 14 | F | Cerebellum | Control |
| chr22 | 50752254 | G | A | Non-syn |
| p.R398W | 1.66E-05 | 0 | D | Mixed expression | 1 | Y | 20.2% | 15 | M | Cerebellum | FTD-ALS |
| chr6 | 5004177 | G | A | Syn |
| N/A | 4.12E-05 | N/A | N/A | Expressed in all | N/A | N | 22.9% | 15 | M | Cerebellum | FTD-ALS |
Variant data shows the chromosome, base position, and reference and alternate allele (hg19 build), together with the amino acid change, frequency in the ExAC population data set,[13] SIFT annotation score and classification,[24] expression cohort in the Human Proteome Atlas,[21] the quartile of genetic conservation within the human genome,[13] presence of methylation at that base in the ICM of an early developmental human embryo,[18] and the VAF in the exome sequencing (ES) data. Clinical data for each individual comprising sample ID, gender, brain region, and disease group are shown.
AA amino acid, CJD Creutzfeldt–Jakob disease, D deleterious, FTD-ALS frontotemporal dementia–amyotrophic lateral sclerosis, ICM inner cell mass, PD-DLB Parkinson disease–dementia with Lewy bodies, N/A not applicable, Non-syn nonsynonymous, PSP progressive supranuclear palsy, Syn synonymous, T tolerated, VAF variant allele frequency
Fig. 3Early cell division after fertilization. Schematic diagram showing early embryonic development. An example of somatic variant (red) is shown, with the subsequent distribution of this variant within the embryo