| Literature DB >> 32066673 |
Mo Li1, Lu Shen1, Luan Chen1, Cong Huai1, Hailiang Huang2,3, Xi Wu1, Chao Yang1, Jingsong Ma1, Wei Zhou1, Huihui Du1, Lingzi Fan4, Lin He5,6, Chunling Wan7, Shengying Qin8,9.
Abstract
Schizophrenia (SCZ) is a highly heritable psychiatric disorder that affects approximately 1% of population around the world. However, early relevant studies did not reach clear conclusions of the genetic mechanisms of SCZ, suggesting that additional susceptibility loci that exert significant influence on SCZ are yet to be revealed. So, in order to identify novel susceptibility genes that account for the genetic risk of SCZ, we performed a systematic family-based study using whole exome sequencing (WES) in 65 Han Chinese families. The analysis of 51 SCZ trios with both unaffected parents identified 22 exonic and 1 splice-site de novo mutations (DNMs) on a total of 23 genes, and showed that 12 genes carried rare protein-altering compound heterozygous mutations in more than one trio. In addition, we identified 26 exonic or splice-site single nucleotide polymorphisms (SNPs) on 18 genes with nominal significance (P < 5 × 10-4) using a transmission disequilibrium test (TDT) in all the families. Moreover, TDT result confirmed a SCZ susceptibility locus on 3p21.1, encompassing the multigenetic region NEK4-ITIH1-ITIH3-ITIH4. Through several different strategies to predict the potential pathogenic genes in silico, we revealed 4 previous discovered susceptibility genes (TSNARE1, PBRM1, STAB1 and OLIG2) and 4 novel susceptibility loci (PSEN1, TLR5, MGAT5B and SSPO) in Han Chinese SCZ patients. In summary, we identified a list of putative candidate genes for SCZ using a family-based WES approach, thus improving our understanding of the pathology of SCZ and providing critical clues to future functional validation.Entities:
Mesh:
Year: 2020 PMID: 32066673 PMCID: PMC7026419 DOI: 10.1038/s41398-020-0708-y
Source DB: PubMed Journal: Transl Psychiatry ISSN: 2158-3188 Impact factor: 6.222
Fig. 1Study design.
Our study design consisted of looking for novel susceptibility genes that account for the genetic risk of schizophrenia through a whole-exome sequencing method. First, we recruited 65 schizophrenia families. In our recruited samples, there were 51 trios (one affected proband and their healthy parents), 13 families with one schizophrenic parent and 1 family with both schizophrenic parents. Next, we used multiple strategies to identify variants that are associated with susceptibility to SCZ including de novo mutation, compound heterozygous mutation and transmission disequilibrium test. Then, we performed validation study for each class of variants. Finally, confirmed a list of putative candidate genes that were found to be associated with schizophrenia.
Overview of whole exome sequencing data.
| Average | Percent (%) | |
|---|---|---|
| Total bases(bp) | 8,481,712,437 | 100 |
| Q30 bases(bp) | 7,101,724,855 | 85.09 |
| Total reads | 72,234,480 | 100 |
| Aligned paired reads | 72,018,643 | 99.7 |
| Standard Deviation | ||
| Read depth | 47.84 | 11.7078 |
| 1× coverage | 0.9839 | 0.0009 |
| 10× coverage | 0.9416 | 0.0400 |
| 20× coverage | 0.8177 | 0.1057 |
| 30× coverage | 0.6599 | 0.1415 |
List of the validated de novo mutations in Chinese trios.
| Proband ID | Chromosome | Position | Gene Name | Transcript ID | Mutation type | Nucleotide change | AA change | Polyphen2_HVAR_score | Polyphen2_HVAR_pred |
|---|---|---|---|---|---|---|---|---|---|
| Sample_1079-3 | chr17 | 42458249 | NM_000419 | nonsynonymous | c.C1391T | p.P464L | 1 | D | |
| Sample_1848-3 | chr17 | 74944111 | NM_198955 | nonsynonymous | c.G2150A | p.G717E | 1 | D | |
| Sample_1874-3 | chr21 | 32617835 | NM_003253 | nonsynonymous | c.G1553A | p.S518N | 0.998 | D | |
| Sample_846-3 | chr11 | 65486372 | NM_001206833 | nonsynonymous | c.A1297G | p.K433E | 0.95 | D | |
| Sample_1173-3 | chr18 | 74154775 | NM_014643 | nonsynonymous | c.T236G | p.I79S | 0.939 | D | |
| Sample_1097-3 | chr11 | 47819409 | NM_015231 | nonsynonymous | c.A3211G | p.R1071G | 0.798 | P | |
| Sample_1049-3 | chr1 | 223284633 | NM_003268 | nonsynonymous | c.T1741C | p.F581L | 0.584 | P | |
| Sample_1097-3 | chr2 | 32449750 | NM_001302504 | nonsynonymous | c.C872A | p.A291D | 0.398 | B | |
| Sample_689-3 | chr21 | 34399472 | NM_005806 | nonsynonymous | c.T302C | p.M101T | 0.295 | B | |
| Sample_1861-3 | chr17 | 74533618 | NM_134268 | nonsynonymous | c.A7G | p.K3E | 0.124 | B | |
| Sample_841-3 | chr2 | 73495972 | NM_001080410 | nonsynonymous | c.G787A | p.E263K | 0.112 | B | |
| Sample_1874-3 | chr19 | 54723009 | NM_001081450 | nonsynonymous | c.A1415G | p.H472R | 0.032 | B | |
| Sample_1056-3 | chr10 | 121658253 | NM_007190 | nonsynonymous | c.C478G | p.P160A | 0.009 | B | |
| Sample_841-3 | chr2 | 128712785 | NM_024545 | nonsynonymous | c.A2170G | p.I724V | 0.007 | B | |
| Sample_1488-3 | chr1 | 23763715 | NM_001143778 | nonsynonymous | c.A1223T | p.H408L | 0 | B | |
| Sample_1128-3 | chr4 | 52765485 | NM_001040402 | nonsynonymous | c.T556C | p.F186L | 0 | B | |
| Sample_975-3 | chr8 | 27507253 | NM_016240 | stop gain | c.C42A | p.C14X | . | . | |
| Sample_641-3 | chr14 | 92469762 | NM_004239 | splicing | . | . | . | . | |
| Sample_549-3 | chr14 | 73640385 | NM_000021 | synonymous | c.G450T | p.L150L | . | . | |
| Sample_1067-3 | chr8 | 59728197 | NM_014729 | synonymous | c.G1092A | p.Q364Q | . | . | |
| Sample_769-3 | chr7 | 107013182 | NM_001161520 | synonymous | c.T786G | p.A262A | . | . | |
| Sample_939-3 | chr9 | 99797910 | NM_001201575 | synonymous | c.A687T | p.A229A | . | . | |
| Sample_1173-3 | chr17 | 40275186 | NM_033194 | synonymous | c.C318T | p.L106L | . | . |
Fig. 2TDT analysis Manhattan and Q-Q plot.
a We performed a transmission disequilibrium test (TDT) for 65 schizophrenia families. The blue line means P-value is 1 × 10−4. The most significant SNP is rs1051823, which was located in replication initiator 1 (REPIN1) gene downstream region (OR: 2.86; CI 95%: 1.59–5.16; P-value = 1.495 × 10−5). b This figure shows the expected distribution of P-values plotted against the observed distribution of combined asymptotic test P-value (because of the heterogeneity brought by the 14 SCZ families with affected parents and the impact of parental illness states on transmission disequilibrium) for our transmission disequilibrium test. Our study did not show any signs of global inflation of test statistics with 1.002.
Fig. 3Biological network integration of candidate genes by GeneMania.
Each node indicates a gene and each edge denotes an interaction between a pair of genes. Larger gray nodes represent the candidate genes entered, and smaller yellow nodes represent the derived interaction genes. A purple edge denotes a co-expression are reported in previous studies, gray edge denotes paired genes shared protein domains, pink denotes paired genes had physical interactions, whereas the orange edge denotes a predicted gene co-expression (PubMed: 12934013). The thickness of the edge indicates the weight in whole interaction network.