| Literature DB >> 26474449 |
Abstract
For biological and statistical reasons it makes sense to combine information from variants at the level of the gene. One may wish to give more weight to variants which are rare and those that are more likely to affect function. A combined weighting scheme, implemented in the SCOREASSOC program, was applied to whole exome sequence data for 1392 subjects with schizophrenia and 982 with obesity from the UK10K project. Results conformed fairly well with null hypothesis expectations and no individual gene was strongly implicated. However, a number of the higher ranked genes appear plausible candidates as being involved in one or other phenotype and may warrant further investigation. These include MC4R, NLGN2, CRP, DONSON, GTF3A, IL36B, ADCYAP1R1, ARSA, DLG1, SIK2, SLAIN1, UBE2Q2, ZNF507, CRHR1, MUSK, NSF, SNORD115, GDF3 and HIBADH. Some individual variants in these genes have different frequencies between cohorts and could be genotyped in additional subjects. For other genes, there is a general excess of variants at many different sites so attempts at replication would be more difficult. Overall, the weighted burden test provides a convenient method for using sequence data to highlight genes of interest.Entities:
Keywords: Association; DNA variant; burden test; exome
Mesh:
Year: 2015 PMID: 26474449 PMCID: PMC4833177 DOI: 10.1111/ahg.12135
Source DB: PubMed Journal: Ann Hum Genet ISSN: 0003-4800 Impact factor: 1.670
Scheme used to assign weights to each variant according to the predicted effect. (In the analyses described in this report, INDEL variants were not in fact used.)
| Predicted effect | Weight |
|---|---|
| NULL_CONSEQUENCE | 1 |
| INTERGENIC | 1 |
| DOWNSTREAM | 1 |
| INTRONIC | 3 |
| 3PRIME_UTR | 5 |
| SYNONYMOUS_CODING | 3 |
| UPSTREAM | 5 |
| 5PRIME_UTR | 5 |
| SPLICE_SITE | 5 |
| STOP_LOST | 5 |
| NON_SYNONYMOUS_CODING | 10 |
| CODINGINDEL | 15 |
| FRAMESHIFT_CODING | 20 |
| STOP_GAINED | 20 |
Figure 1Q:Q plots for SLP obtained from SCOREASSOC compared to expected under null hypothesis. Positive SLPs indicate an excess of rare, functional variants in SZ subjects, negative SLPs indicate an excess in OB subjects. (A) Shows results using broad category of variants, (B) for narrow category.
Highest and lowest ranked genes with corresponding SLPs using SZ and OB definitions of caseness and including broad and narrow categories of variant
| Highest SLPs (SZ cases) | Lowest SLPs (OB cases) | ||||||
|---|---|---|---|---|---|---|---|
| Broad category | Narrow category | Broad category | Narrow category | ||||
| Symbol | SLP | Symbol | SLP | Symbol | SLP | Symbol | SLP |
|
| 4.3 |
| 3.8 |
| −5.5 |
| −4.8 |
|
| 4.1 |
| 3.4 |
| −4.8 |
| −4.6 |
|
| 3.8 |
| 3.3 |
| −4.7 |
| −4.5 |
|
| 3.5 |
| 3.3 |
| −4.7 |
| −4.4 |
|
| 3.4 |
| 3.3 |
| −4.7 |
| −4.3 |
|
| 3.4 |
| 3.2 |
| −4.6 |
| −4.3 |
|
| 3.2 |
| 3.1 |
| −4.6 |
| −4.1 |
|
| 3.1 |
| 3.0 |
| −4.5 |
| −4.0 |
|
| 3.0 |
| 3.0 |
| −4.5 |
| −3.9 |
|
| 2.9 |
| 2.9 |
| −4.3 |
| −3.8 |
|
| 2.9 |
| 2.9 |
| −4.3 |
| −3.8 |
|
| 2.9 |
| 2.9 |
| −4.2 |
| −3.8 |
|
| 2.9 |
| 2.8 |
| −4.1 |
| −3.7 |
|
| 2.9 |
| 2.8 |
| −4.1 |
| −3.6 |
|
| 2.8 |
| 2.8 |
| −4.0 |
| −3.6 |
|
| 2.8 |
| 2.8 |
| −4.0 |
| −3.5 |
|
| 2.8 |
| 2.8 |
| −3.9 |
| −3.5 |
|
| 2.7 |
| 2.7 |
| −3.9 |
| −3.5 |
|
| 2.7 |
| 2.7 |
| −3.9 |
| −3.5 |
|
| 2.7 |
| 2.6 |
| −3.9 |
| −3.4 |
|
| 2.6 |
| 2.6 |
| −3.8 |
| −3.3 |
|
| 2.6 |
| 2.6 |
| −3.8 |
| −3.3 |
|
| 2.6 |
| 2.6 |
| −3.8 |
| −3.2 |
|
| 2.6 |
| 2.5 |
| −3.8 |
| −3.2 |
|
| 2.5 |
| 2.5 |
| −3.8 |
| −3.2 |
|
| 2.5 |
| 2.5 |
| −3.7 |
| −3.2 |
|
| 2.5 |
| 2.5 |
| −3.6 |
| −3.2 |
|
| 2.5 |
| 2.5 |
| −3.6 |
| −3.2 |
|
| 2.4 |
| 2.4 |
| −3.5 |
| −3.2 |
|
| 2.4 |
| 2.4 |
| −3.5 |
| −3.2 |
Output from SCOREASSOC for the analysis of MC4R using the broad category of variants and treating SZ subjects as cases
| OB | SZ | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Position (hg19, chr18) | AA | AB | BB | MAF | AA | AB | BB | MAF | Weight | Variant effect | VCF annotation |
| 58038462 | 982 | 0 | 0 | 0.0000 | 1391 | 1 | 0 | 0.0004 | 9.99 | DOWNSTREAM | T>C |
| 58038470 | 981 | 1 | 0 | 0.0005 | 1391 | 0 | 0 | 0.0000 | 9.99 | DOWNSTREAM | C>G |
| 58038489 | 982 | 0 | 0 | 0.0000 | 1391 | 1 | 0 | 0.0004 | 9.99 | DOWNSTREAM | G>A |
| 58038514 | 982 | 0 | 0 | 0.0000 | 1391 | 1 | 0 | 0.0004 | 9.99 | DOWNSTREAM | C>T |
| 58038524 | 979 | 3 | 0 | 0.0015 | 1386 | 6 | 0 | 0.0022 | 9.93 | DOWNSTREAM | G>A |
| 58038612 | 982 | 0 | 0 | 0.0000 | 1391 | 1 | 0 | 0.0004 | 99.92 | NON_SYNONYMOUS_CODING | C>T:PolyPhen:benign(0.048) |
| 58038826 | 982 | 0 | 0 | 0.0000 | 1391 | 1 | 0 | 0.0004 | 99.92 | NON_SYNONYMOUS_CODING | C>T:PolyPhen:possibly_damaging(0.45) |
| 58038829 | 982 | 0 | 0 | 0.0000 | 1390 | 2 | 0 | 0.0007 | 99.85 | NON_SYNONYMOUS_CODING | C>T:PolyPhen:probably_damaging(1) |
| 58038832 | 968 | 14 | 0 | 0.0071 | 1361 | 31 | 0 | 0.0111 | 96.62 | NON_SYNONYMOUS_CODING | T>G:PolyPhen:benign(0.008) |
| 58038989 | 982 | 0 | 0 | 0.0000 | 1391 | 1 | 0 | 0.0004 | 49.96 | SYNONYMOUS_CODING | G>A |
| 58039013 | 982 | 0 | 0 | 0.0000 | 1391 | 1 | 0 | 0.0004 | 49.96 | SYNONYMOUS_CODING | A>G |
| 58039049 | 981 | 1 | 0 | 0.0005 | 1392 | 0 | 0 | 0.0000 | 49.96 | SYNONYMOUS_CODING | C>T |
| 58039203 | 982 | 0 | 0 | 0.0000 | 1391 | 1 | 0 | 0.0004 | 99.92 | NON_SYNONYMOUS_CODING | G>A:PolyPhen:probably_damaging(0.997) |
| 58039215 | 982 | 0 | 0 | 0.0000 | 1390 | 2 | 0 | 0.0007 | 99.85 | NON_SYNONYMOUS_CODING | T>C:PolyPhen:probably_damaging(0.99) |
| 58039219 | 981 | 1 | 0 | 0.0005 | 1392 | 0 | 0 | 0.0000 | 99.92 | NON_SYNONYMOUS_CODING | C>A:PolyPhen:probably_damaging(0.996) |
| 58039276 | 964 | 18 | 0 | 0.0092 | 1337 | 55 | 0 | 0.0198 | 94.55 | NON_SYNONYMOUS_CODING | C>T:PolyPhen:benign(0.042) |
| 58039301 | 982 | 0 | 0 | 0.0000 | 1391 | 1 | 0 | 0.0004 | 49.96 | SYNONYMOUS_CODING | G>A |
| 58039402 | 982 | 0 | 0 | 0.0000 | 1391 | 1 | 0 | 0.0004 | 99.92 | NON_SYNONYMOUS_CODING | C>T:PolyPhen:probably_damaging(0.985) |
| 58039473 | 982 | 0 | 0 | 0.0000 | 1391 | 1 | 0 | 0.0004 | 99.92 | NON_SYNONYMOUS_CODING | T>A>C:PolyPhen:benign(0) |
| 58039478 | 982 | 0 | 0 | 0.0000 | 1391 | 1 | 0 | 0.0004 | 199.85 | STOP_GAINED | G>T |
| 58039552 | 982 | 0 | 0 | 0.0000 | 1391 | 1 | 0 | 0.0004 | 99.92 | NON_SYNONYMOUS_CODING | T>C:PolyPhen:benign(0) |
| 58039642 | 981 | 1 | 0 | 0.0005 | 1391 | 1 | 0 | 0.0004 | 49.92 | 5PRIME_UTR | G>C |
The table shows genotype counts, frequencies, weights and effects for each variant. The weighted scores were calculated for each subject and the means compared. Mean scores OB = 3.4, SZ = 7.0, t(2372 df) = 3.8, p = 0.00015, SLP = 3.8.
List of some of the highest and lowest ranked genes with explanatory notes
| Symbol | SLP | Analysis | Gene name | Comments |
|---|---|---|---|---|
|
| 3.8 | SZ, broad | Melanocortin 4 receptor | Increased frequency among SZ subjects of two nonsynonymous variants previously reported to be protective against obesity |
|
| 3.5 | SZ, broad | Neuroligin 2 | Codes for postsynaptic protein and regarded as candidate gene for schizophrenia. Overall generally increased numbers of rare variants in SZ subjects with no individual variant strongly associated |
|
| 3.4 | SZ, broad | C‐reactive protein, pentraxin‐related | Involved in immunity and inflammation systems. Variants generally commoner in SZ subjects. Nonsynonymous variant at 1:159683814 is present in 17 SZ and 2 OB subjects. This is rs77832441, which has been reported to be associated with reduced CRP levels |
|
| 3.4 | SZ, narrow | Downstream neighbour of SON | Function unknown. Nonsynonymous variants at 21:34950728 seen in 21 SZ against 7 OB subjects and at 21:34955922 in 10 SZ and 0 OB subjects |
|
| 2.9 | SZ, broad | GABA A receptor, alpha 3 | Some previous reports of involvement of GABA receptors in schizophrenia. However, the gene is on the X chromosome and the result is likely an artifact due to counting hemizygote males as homozygotes |
|
| 4.1 | SZ, broad | General transcription factor IIIA | Result is driven by modest excess of many different variants |
|
| 2.9 | SZ, broad | Interleukin 36, beta | Involved in inflammation. There are stop variants at 2:113785602 and 2:113788694 in 7 and 2 SZ subjects and no OB subjects |
|
| 3.2 | SZ, narrow | Adenylate cyclase activating polypeptide 1 (pituitary) receptor | Previous reports of association with schizophrenia and involvement in adipose tissue expandability. Nonsynonymous variants at 7:31104520 and 7:31124376 commoner in SZ than OB subjects |
|
| 2.9 | SZ, narrow | Arylsulfatase A | Mutations in this gene are the known cause of metachromatic leucodystrophy, which can have features similar to schizophrenia. A few very rare nonsynonymous and splice site variants are seen only in SZ cases and the common nonsynonymous variant at 22:51065361 (rs6151415) has MAF 0.085 in SZ and 0.061 in OB subjects |
|
| 2.4 | SZ, narrow | Discs, large homolog 1 (drosophila) | Involved in synaptogenesis. A number of nonsynonymous variants somewhat commoner in SZ than OB subjects and non‐synonymous variant at 3:196792663 occurs in 10 SZ and no OB subjects |
|
| 2.5 | SZ, narrow | Salt‐inducible kinase 2 | Known to be involved in lipid homeostasis and adipogenesis. A number of nonsynonymous variants occur only in SZ subjects and nonsynonymous variant at 11:111590605 occurs in 14 SZ and 2 OB subjects |
|
| 3.3 | SZ, narrow | SLAIN motif family, member 1 | Involved in neurodevelopment. Nonsynonymous variant at 13:78320801 occurs in 18 SZ subjects and 1 OB subject |
|
| 2.6 | SZ, narrow | Ubiquitin‐conjugating enzyme E2Q family member 2 | Differentially expressed in mice with diet‐induced obesity. Excess of several nonsynonymous variants in SZ versus OB subjects |
|
| 3.3 | SZ, narrow | Zinc finger protein 507 | Disruption associated with neurodevelopmental disorders. Several nonsynonymous variants common in SZ subjects and nonsynonymous variant at 19:32844995 occurs in 9 SZ and no OB subjects |
|
| ‐3.9 | OB, broad | Corticotropin‐releasing hormone receptor 1 | Previously implicated in physiological pathways including obesity and response to stress. A haplotype of several noncoding variants is somewhat commoner in OB than SZ subjects. This haplotype extends through |
|
| −4.2 | OB, broad | Histone cluster 1, H2aj | Downstream variant at 6:27782031 has MAF 0.14 in OB and 0.11 in SZ subjects |
|
| −4.7 | OB, broad | Histone cluster 1, H4a | 3' UTR variant at 6:26022244 has MAF 0.14 in OB and 0.096 in SZ subjects. However, this variant is in LD with the one at 6:27782031 so these signals are not independent |
|
| ‐4.7 | OB, broad | Microtubule‐associated protein tau | A haplotype of several common variants is slightly commoner among OB subjects |
|
| −3.6 | OB, broad | Interacts with | |
|
| −5.5 | OB, broad | N‐ethylmaleimide‐sensitive factor | Interacts with |
|
| −4.6 | OB, broad | Small nucleolar RNA, C/D box 115‐15 | In Prader–Willi region and regulates alternative splicing of |
|
| −3.5 | OB, narrow | Growth differentiation factor 3 | Implicated in regulation of adiposity and energy expenditure. Nonsynonymous variant at 12:7842587 has frequency 0.040 in OB and 0.022 in SZ subjects |
|
| −3.2 | OB, narrow | 3‐hydroxyisobutyrate dehydrogenase | Differentially expressed in T2DM. SLP is driven by splice site or nonsynonymous variants of which 7 are singletons occurring in OB subjects and the other, at 7:27570942, occurs in 8 OB and 3 SZ subjects |