| Literature DB >> 33907183 |
Suqin Guo1,2, Jiewei Liu3, Wenqiang Li1,2, Yongfeng Yang1,2, Luxian Lv1,2, Xiao Xiao4, Ming Li4, Fanglin Guan5, Xiong-Jian Luo6,7,8.
Abstract
Early onset schizophrenia (EOS, defined as first onset of schizophrenia before age 18) is a rare form of schizophrenia (SCZ). Though genome-wide association studies (GWASs) have identified multiple risk variants for SCZ, most of the cases included in these GWASs were not stratified according to their first age at onset. To date, the genetic architecture of EOS remains largely unknown. To identify the risk variants and to uncover the genetic basis of EOS, we conducted a two-stage GWAS of EOS in populations of Han Chinese ancestry in this study. We first performed a GWAS using 1,256 EOS cases and 2,661 healthy controls (referred as discovery stage). The genetic variants with a P < 1.0 × 10-04 in discovery stage were replicated in an independent sample (903 EOS cases and 3,900 controls). We identified four genome-wide significant risk loci for EOS in the combined samples (2,159 EOS cases and 6,561 controls), including 1p36.22 (rs1801133, Pmeta = 4.03 × 10-15), 1p31.1 (rs1281571, Pmeta = 4.14 × 10-08), 3p21.31 (rs7626288, Pmeta = 1.57 × 10-09), and 9q33.3 (rs592927, Pmeta = 4.01 × 10-11). Polygenic risk scoring (PRS) analysis revealed substantial genetic overlap between EOS and SCZ. These discoveries shed light on the genetic basis of EOS. Further functional characterization of the identified risk variants and genes will help provide potential targets for therapeutics and diagnostics.Entities:
Mesh:
Year: 2021 PMID: 33907183 PMCID: PMC8079394 DOI: 10.1038/s41398-021-01360-4
Source DB: PubMed Journal: Transl Psychiatry ISSN: 2158-3188 Impact factor: 6.222
Fig. 1Manhattan plot of EOS GWAS associations.
Associations were from the GWAS meta-analysis of the discovery stage (N = 1,256 EOS cases and 2,661 controls). The dashed black horizontal line indicates the suggestive P threshold (P < 1.0 × 10−04). The genome-wide significant P threshold (P < 5.0 × 10−08) is represented as the dashed red horizontal line. The arrows indicate the four genome-wide significant SNPs in our combined discovery and replication samples.
Genome-wide significant loci identified in this study.
| Chra | Position | Index SNP | A1/A2 | Discovery stage (1256 cases/2661 controls) | Replication stage (903 cases/3900 controls) | Meta-analysis (2159 cases/6561 controls) | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ORb | AFc | OR | AF | OR | ||||||||
| 1 | 11856378 | rs1801133 | A/G | 7.48 × 10−05 | 1.24 | 0.59/0.50 | 1.20 × 10−12 | 1.45 | 0.54/0.45 | 4.03 × 10−15 | 1.35 | 82.65 |
| 1 | 82919925 | rs1281571 | A/G | 6.44 × 10−05 | 1.24 | 0.44/0.40 | 1.70 × 10−04 | 1.22 | 0.45/0.40 | 4.14 × 10−08 | 1.23 | 0 |
| 3 | 46485369 | rs7626288 | A/G | 9.08 × 10−05 | 0.81 | 0.39/0.43 | 3.63 × 10−06 | 0.78 | 0.38/0.44 | 1.57 × 10−09 | 0.80 | 0 |
| 9 | 129673226 | rs592927 | A/G | 4.21 × 10−05 | 1.30 | 0.24/0.20 | 1.56 × 10−07 | 1.38 | 0.25/0.19 | 4.01 × 10−11 | 1.34 | 0 |
aChromosome. bOR: odds ratio (based on allele A1). cAF: allele frequency (case AF/control AF, based on allele A1). dFixed-effect meta-analysis P value. eI2 heterogeneity index of meta-analysis of discovery stage ASA group 1, group 2, and replication stage samples.
Fig. 2Locuszoom plots of the genome-wide significant (LD independent) risk variants.
a rs1801133 is located in the fifth exon of MTHFR. b rs1281571 is located in the downstream of LPHN2. c rs7626288 is located in the intron 12 of LTF. d rs592927 is located in the upstream of RALGPS1.
Fig. 3Genomic location of rs1801133 and the 3D structure of MTHFR.
a rs1801133 is located in the fifth exon of MTHFR. b rs1801133 is located in an evolutionary highly conserved genomic region. The amino acid encoded by rs1801133 was Ala in most of the species. However, a new amino acid (Val) has emerged in humans. c The 3D structure of MTHFR with different amino acids at rs1801133.
Fig. 4Genetic risk prediction accuracy in subgroup 1 EOS cases and controls (discovery stage) using different training sets.
PRSs were computed using GWAS summary statistics from three training sets. The red plot represents PRS result from EAS training set (including 22,778 SCZ cases and 35,362 controls). The green plot represents PRS result from CLOZUK + PGC2 training set (including 40,675 SCZ cases and 64,643 controls). The blue plot represents PRS result from Chinese+PGC2 training set (including 43,175 SCZ cases and 65,166 controls).