| Literature DB >> 33837273 |
Xiaolong Cao1,2, Yeting Zhang1,2, Mohamed Abdulkadir1,2,3, Li Deng1,2, Thomas V Fernandez4,5, Blanca Garcia-Delgar6, Julie Hagstrøm7, Pieter J Hoekstra3, Robert A King4, Justin Koesterich1,2, Samuel Kuperman8, Astrid Morer6,9,10, Cara Nasello1,2, Kerstin J Plessen7,11, Joshua K Thackray1,2, Lisheng Zhou1, Andrea Dietrich3, Jay A Tischfield1,2, Gary A Heiman1,2, Jinchuan Xing12,13.
Abstract
Tourette's Disorder (TD) is a neurodevelopmental disorder (NDD) that affects about 0.7% of the population and is one of the most heritable NDDs. Nevertheless, because of its polygenic nature and genetic heterogeneity, the genetic etiology of TD is not well understood. In this study, we combined the segregation information in 13 TD multiplex families with high-throughput sequencing and genotyping to identify genes associated with TD. Using whole-exome sequencing and genotyping array data, we identified both small and large genetic variants within the individuals. We then combined multiple types of evidence to prioritize candidate genes for TD, including variant segregation pattern, variant function prediction, candidate gene expression, protein-protein interaction network, candidate genes from previous studies, etc. From the 13 families, 71 strong candidate genes were identified, including both known genes for NDDs and novel genes, such as HtrA Serine Peptidase 3 (HTRA3), Cadherin-Related Family Member 1 (CDHR1), and Zinc Finger DHHC-Type Palmitoyltransferase 17 (ZDHHC17). The candidate genes are enriched in several Gene Ontology categories, such as dynein complex and synaptic membrane. Candidate genes and pathways identified in this study provide biological insight into TD etiology and potential targets for future studies.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33837273 PMCID: PMC8501157 DOI: 10.1038/s41380-021-01094-1
Source DB: PubMed Journal: Mol Psychiatry ISSN: 1359-4184 Impact factor: 15.992
Summary of the multiplex families
| Family | Total Ind | TD | TD with OCD | TD with ADHD | WES (TD) | SNV/Indel | SNV/Indel Filtered | pVAAST (d) | pVAAST (r) | Candidate | Genotyping (TD) | CNV | CNV exonic |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| FAM1 | 9 | 8 | 5 | 2 | 6 (5) | 103,621 | 14,911 | 35 | 5 | 34 | 6 (5) | 276 | 134 |
| FAM2 | 9 | 7 | 4 | 5 | 6 (6) | 101,590 | 14,917 | 199 | 8 | 81 | 7 (6) | 170 | 56 |
| FAM3 | 15 | 7 | 1 | 0 | 12 (7) | 136,028 | 27,180 | 89 | 10 | 26 | 12 (6) | 261 | 85 |
| FAM4 | 18 | 11 | 5 | 2 | 17 (11) | 167,676 | 43,915 | 10 | 25 | 33 | 0 | NA | NA |
| FAM5 | 12 | 8 | 4 | 1 | 10 (7) | 140,006 | 28,164 | 1 | NA | 1 | 11 (7) | 240 | 44 |
| FAM6 | 9 | 5 | 1 | 0 | 6 (4) | 124,489 | 22,345 | 50 | NA | 40 | 6 (4) | 142 | 30 |
| FAM7 | 8 | 4 | 0 | 0 | 8 (4) | 124,813 | 22,209 | 35 | NA | 32 | 8 (4) | 184 | 37 |
| FAM8 | 5 | 3 | 3 | 1 | 4 (3) | 114,025 | 17,741 | 136 | NA | 125 | 4 (3) | 121 | 62 |
| FAM9 | 4 | 3 | 1 | 3 | 4 (3) | 105,502 | 15,191 | 93 | 11 | 91 | 4 (3) | 116 | 45 |
| FAM10 | 5 | 3 | 2 | 1 | 5 (3) | 110,102 | 17,159 | 51 | 10 | 50 | 0 | NA | NA |
| FAM11 | 18 | 9 | 3 | 2 | 13 (8) | 133,278 | 27,134 | 18 | NA | 16 | 14 (9) | 289 | 57 |
| FAM12 | 16 | 8 | 2 | 0 | 11 (6) | 129,089 | 23,991 | 14 | NA | 13 | 0 | NA | NA |
| FAM13 | 23 | 6 | 0 | 0 | 8 (5) | 123,730 | 21,873 | 218 | NA | 50 | 0 | NA | NA |
| Total | 151 | 82 | 31 | 17 | 110 (72) | 280,363 | 139,092 | 949 | 69 | 543 | 72 (47) | 1,799 | 550 |
Total Ind: individuals surveyed with phenotypes in each family.
TD: individuals diagnosed as TD (Tourette Disorder, Chronic Tic Disorder-Motor subtype, Chronic Tic Disorder-Vocal subtype, Chronic Tic Disorder-Combined subtype, transient tic disorder or tic disorder-Not Otherwise Specified. see Methods for diagnosis detail).
TD with OCD: individuals with both TD and OCD diagnosis.
TD with ADHD: individuals with both TD and ADHD diagnosis.
WES (TD): individuals subjected to whole exome sequencing in each family, the number of individuals diagnosed with TD are shown in parenthesis.
SNV/Indel: SNVs (single nucleotide variants) or Indels (insertion and deletions) in each family in the variant call results.
SNV/Indel Filtered: variants after removing those with AF ≥ 10% in the 1000 Genomes project or ≥ 5% in ExAC.
pVAAST genes (d)/ pVAAST genes (r): candidate gene counts reported by pVAAST running in the dominant/recessive (d/r) mode.
Candidate: candidate gene counts after filtering.
Genotyping (TD): the number of individuals subjected to microarray analysis for CNV detection and the number of individuals diagnosed with TD are shown in parenthesis.
CNV: count of candidate CNVs in the copy number variants analysis.
CNV exonic: count of candidate CNVs with overlap with exon regions of gene models.
Figure 1.Overall study design.
WES: whole-exome sequencing; SNP: single nucleotide polymorphism; AF: allele frequency; CNV: copy number variant; GO: gene ontology; PPI: protein-protein interaction.
Top candidate genes in each family
| Variant Annotation | Gene Annotation | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Family | Gene | Confidence | Chr | Position | AF | mutation | Effect | Segregation | pLI | mis_z | Other NDDs | TPM1 | TPM2 | TPM3 | IMPC | Interaction |
| FAM1 |
| H | 2 | 202,134,324 | 1.33E-05 | I->V | DT | 0|6 | 0.00 | 1.01 | 3 | 4 | 7 | 59 | ||
| FAM1 |
| H | 4 | 862,363 | 7.30E-03 | D->Y | DD | 0|6 | 0.01 | 0.02 | 145 | 44 | 22 | 103 | ||
| FAM1 |
| H | 2 | 219,204,814 | 3.33E-05 | Q->stop | .. | 0|6 | 0.00 | 0.32 | 167 | 254 | 52 | 4 | ||
| FAM1 |
| M | 19 | 1,048,950 | 6.40E-05 | G->R | DD | 0|5 | 0.00 | −1.47 | ASD | 66 | 45 | 19 | F | 5 |
| FAM2 |
| H | 16 | 57,413,660 | 5.10E-03 | A->V | TD | 1|6 | 0.61 | 0.84 | TD_simplex | 124 | 1165 | 43 | T | 36 |
| FAM2 |
| H | 2 | 196,866,433 | 1.07E-02 | M->R | DD | 1|6 | 0.00 | −1.09 | 6 | 6 | 30 | 5 | ||
| FAM2 |
| H | 16 | 19,194,929 | 4.00E-04 | E->D | DD | 1|6 | 0.00 | 1.18 | ASD | 47 | 150 | 24 | 3 | |
| FAM2 |
| M | 7 | 91,631,542 | 9.57E-05 | A->T | TT | 1|6 | 0.00 | −0.13 | ASD | 42 | 99 | 86 | 88 | |
| FAM2 |
| M | 15 | 23,048,912 | 1.10E-05 | V->M | .. | 1|6 | 0.01 | 1.94 | TD_CNV, ASD | 149 | 293 | 49 | F | 4 |
| FAM2 |
| L | 16 | 55,725,894 | 9.55E-05 | T->M | TD | 1|6 | 0.08 | 2.02 | ADHD | 0 | 1 | 9 | F | 27 |
| FAM3 |
| H | 10 | 73,270,925 | 3.00E-04 | A->T | TD | 2|12 | 0.00 | 0.71 | ADHD | 42 | 97 | 38 | T | 10 |
| FAM3 |
| H | 10 | 85,972,932 | 4.50E-03 | N->S | DT | 1|12 | 0.00 | −0.70 | 79 | 150 | 45 | 1 | ||
| FAM3 |
| H | 6 | 152,265,352 | 5.00E-04 | R->C | DD | 2|12 | 1.00 | 1.61 | 3 | 9 | 4 | 163 | ||
| FAM3 |
| H | 6 | 154,411,110 | 3.60E-03 | S->C | DD | 2|12 | 0.00 | −0.62 | ADHD | 3 | 13 | 7 | 51 | |
| FAM3 |
| L | 16 | 16,138,519 | 8.88E-05 | S->F | DT | 2|12 | 0.00 | 1.77 | 19 | 36 | 37 | 12 | ||
| FAM3 |
| L | 16 | 30,128,265 | 1.90E-03 | E->K | DD | 3|12 | 0.04 | 1.74 | ASD | 404 | 531 | 64 | 180 | |
| FAM4 |
| H | 13 | 25,458,540 | 3.18E-05 | E->stop | DD | 2|12 | 0.00 | 0.22 | 42 | 41 | 105 | T | 48 | |
| FAM4 |
| H | 12 | 77,243,235 | 3.20E-05 | I->T | TD | 1|12 | 1.00 | 2.57 | 91 | 134 | 165 | 31 | ||
| FAM4 |
| M | 11 | 10,327,330 | . | S->W | DD | 1|12 | 0.04 | −0.11 | 58 | 332 | 117 | 38 | ||
| FAM4 |
| M | 3 | 51,475,633 | 8.03E-06 | K->R | .T | 0|12 | 1.00 | 4.84 | 36 | 87 | 47 | 46 | ||
| FAM5 |
| H | 3 | 9,831,246 | 1.00E-03 | R->C | DT | 1|10 | 0.00 | 2.12 | 142 | 218 | 72 | 20 | ||
| FAM6 |
| H | 14 | 63,246,495 | 1.00E-04 | A->G | DD | 0|5 | 0.02 | 2.51 | 9 | 83 | 11 | 20 | ||
| FAM6 |
| H | 16 | 569,754 | 3.00E-04 | E->Q | TD | 0|5 | 1.00 | 1.27 | TD_simplex | 182 | 158 | 86 | T | 12 |
| FAM6 |
| H | 6 | 73,017,069 | 2.00E-04 | S->Y | DD | 1|6 | 0.99 | 2.01 | ASD | 169 | 282 | 34 | 19 | |
| FAM6 |
| L | 7 | 44,144,407 | 3.19E-05 | E->G | DT | 1|6 | 0.00 | 1.00 | 86 | 182 | 106 | F | 22 | |
| FAM7 |
| H | 7 | 122,033,277 | 2.00E-04 | S->L | DT | 1|8 | 0.37 | 1.09 | ASD | 332 | 679 | 83 | F | 4 |
| FAM7 |
| H | 20 | 25,457,679 | 6.00E-04 | A->S | DD | 0|8 | 0.00 | −0.98 | ASD | 63 | 61 | 68 | F | 40 |
| FAM7 |
| H | 20 | 30,816,172 | 3.18E-05 | E->K | TT | 0|8 | 0.96 | 0.95 | 18 | 46 | 35 | 4 | ||
| FAM7 |
| L | 20 | 32,880,242 | 3.60E-03 | G->R | DD | 0|8 | 0.04 | 1.55 | 109 | 361 | 263 | F | 51 | |
| FAM8 |
| H | 10 | 61,833,414 | 3.40E-03 | S->S | DD | 0|4 | 1.00 | 2.79 | TD_simplex, ASD, ADHD | 107 | 218 | 96 | 24 | |
| FAM8 |
| H | 5 | 13,807,706 | 9.56E-05 | M->I | TT | 0|4 | 0.00 | −0.77 | TD_simplex | 1 | 6 | 20 | F | 8 |
| FAM8 |
| H | 15 | 27,018,841 | 3.20E-03 | P->S | TT | 0|4 | 0.95 | 3.39 | TD_CNV, ASD, OtherNeuro | 58 | 182 | 81 | 15 | |
| FAM8 |
| H | 6 | 24,429,341 | 4.69E-03 | Q deletion | .. | 0|4 | 0.00 | −0.30 | TD_simplex | 36 | 42 | 14 | F | 17 |
| FAM8 |
| H | 17 | 47,117,430 | 6.70E-03 | H->Q | TT | 0|4 | 1.00 | 3.52 | TD_simplex | 0 | 113 | 290 | 27 | |
| FAM8 |
| H | X | 70,389,354 | 1.90E-03 | T->A | TT | 0|4 | 0.98 | 4.21 | ASD | 90 | 315 | 109 | T | 21 |
| FAM8 |
| M | 2 | 8,928,870 | . | A->V | DD | 0|4 | 0.07 | 2.26 | 99 | 792 | 376 | 20 | ||
| FAM8 |
| M | 14 | 92,583,927 | 4.30E-03 | M->L | TT | 0|4 | 0.00 | −0.21 | 249 | 667 | 218 | 12 | ||
| FAM8 |
| M | 9 | 79,938,036 | 3.80E-03 | R->C | DT | 0|4 | 0.00 | 1.76 | 22 | 33 | 26 | F | 26 | |
| FAM8 |
| L | X | 110,495,596 | 9.12E-05 | K->M | DD | 0|4 | 0.82 | 1.52 | 0 | 13 | 151 | 1 | ||
| FAM8 |
| L | 19 | 19,414,217 | . | R->G | DD | 0|4 | 1.00 | 1.21 | 76 | 132 | 39 | F | 26 | |
| FAM9 |
| H | 1 | 979,748 | 3.21E-02 | E->V | DD | 1|4 | 0.00 | 0.23 | 127 | 185 | 149 | 53 | ||
| FAM9 |
| H | 3 | 132,221,143 | 2.20E-03 | R->H | DD | 0|4 | 1.00 | 2.28 | 28 | 47 | 50 | 16 | ||
| FAM9 |
| H | 15 | 54,306,012 | 3.20E-03 | D->E | DD | 0|4 | 0.00 | −0.14 | 77 | 327 | 16 | F | 1 | |
| FAM9 |
| H | 7 | 100,806,657 | . | P->A | DD | 0|3 | 0.57 | 0.25 | 455 | 801 | 83 | T | 21 | |
| FAM9 |
| M | 1 | 109,910,100 | 2.20E-03 | I->V | TT | 0|4 | 1.00 | 2.10 | 440 | 544 | 102 | F | 28 | |
| FAM9 |
| M | 6 | 158,492,660 | 3.40E-03 | T->M | DD | 0|4 | 0.00 | 0.72 | 230 | 125 | 47 | F | 34 | |
| FAM9 |
| L | 5 | 125,880,710 | 4.50E-03 | T->A | DD | 0|4 | 0.00 | 0.37 | 96 | 138 | 139 | 9 | ||
| FAM9 |
| L | 1 | 26,764,735 | 2.50E-03 | R->Q | TT | 0|3 | 0.25 | 1.09 | 185 | 145 | 25 | F | 1 | |
| FAM10 |
| H | 16 | 87,432,453 | 7.10E-03 | E->Q | TD | 1|5 | 0.00 | 0.29 | 291 | 511 | 180 | 42 | ||
| FAM10 |
| H | 17 | 28,538,374 | 6.00E-04 | I->V | TT | 1|5 | 0.25 | 1.93 | ASD, ADHD | 1 | 8 | 19 | 25 | |
| FAM10 |
| M | 7 | 124,404,617 | 2.00E-04 | N->K | TT | 1|5 | 0.00 | 0.84 | ASD | 377 | 541 | 56 | 44 | |
| FAM10 |
| M | 11 | 124,740,144 | 3.80E-03 | D->N | DD | 1|5 | 0.00 | 0.61 | 26 | 38 | 688 | 2 | ||
| FAM10 |
| M | 7 | 49,951,764 | 4.00E-04 | E->Q | DD | 1|5 | 0.97 | 1.67 | 43 | 137 | 19 | 1 | ||
| FAM11 |
| M | 2 | 125,504,806 | 4.00E-04 | D->G | TT | 2|13 | 0.99 | 0.39 | ASD | 3 | 94 | 35 | F | 0 |
| FAM11 |
| M | 4 | 8,293,193 | 7.40E-03 | V->M | DD | 1|13 | 0.00 | 0.50 | 3 | 44 | 43 | F | 14 | |
| FAM11 |
| M | 12 | 50,189,151 | . | G->V | TT | 1|13 | 0.63 | 0.74 | 49 | 40 | 33 | 3 | ||
| FAM11 |
| L | 7 | 21,737,777 | . | S->R | DD | 3|13 | 0.00 | −5.61 | 1 | 15 | 33 | 4 | ||
| FAM11 |
| L | 7 | 107,836,262 | 3.00E-04 | N->S | TT | 3|13 | 0.02 | 1.91 | ASD | 243 | 675 | 309 | T | 21 |
| FAM12 |
| H | 1 | 175,097,769 | 2.50E-03 | V->I | TT | 0|11 | 0.00 | −0.54 | 0 | 11 | 13 | 8 | ||
| FAM12 |
| M | 13 | 111,082,914 | 4.00E-04 | H->L | TT | 1|11 | 0.00 | 2.19 | 19 | 925 | 439 | F | 62 | |
| FAM12 |
| M | 2 | 232,325,406 | 6.09E-03 | DE deletion | .. | 0|11 | 1.00 | 0.85 | 386 | 660 | 1028 | 95 | ||
| FAM12 |
| L | 11 | 103,339,392 | 1.16E-05 | V->M | TD | 0|11 | 0.00 | 0.91 | 9 | 19 | 70 | F | 13 | |
| FAM12 |
| L | 11 | 89,896,570 | 6.00E-04 | R->W | DD | 0|11 | 0.00 | 0.47 | 22 | 24 | 53 | 3 | ||
| FAM12 |
| L | 15 | 42,983,572 | 1.10E-03 | G->S | TT | 0|11 | 0.00 | 2.50 | 36 | 26 | 30 | F | 1 | |
| FAM13 |
| H | 16 | 21,098,323 | 5.70E-03 | R->S | DD | 1|8 | 0.00 | 0.03 | ASD | 1 | 5 | 22 | 9 | |
| FAM13 |
| H | 11 | 128,426,219 | . | E->K | DD | 1|8 | 0.78 | 2.52 | TD_simplex | 16 | 70 | 219 | 32 | |
| FAM13 |
| H | 16 | 9,858,173 | 3.70E-03 | N->K | TD | 1|8 | 1.00 | 2.83 | ASD, ADHD, OtherNeuro | 45 | 281 | 17 | F | 38 |
| FAM13 |
| H | 15 | 22,846,897 | 7.50E-03 | V->I | .. | 1|8 | TD_CNV, ASD | 23 | 32 | 24 | ||||
| FAM13 |
| M | 1 | 23,233,346 | 2.90E-03 | D->N | DT | 1|8 | 1.00 | 2.45 | ASD | 11 | 200 | 161 | 50 | |
| FAM13 |
| M | 1 | 233,802,497 | 7.10E-03 | R->R | TD | 1|8 | 0.03 | 0.72 | 261 | 501 | 154 | 22 | ||
| FAM13 |
| L | 5 | 141,336,635 | 3.30E-03 | T->M | DD | 1|8 | 0.00 | 0.78 | 13 | 24 | 20 | F | 1 | |
Confidence: confidence of genes as causal for TD based on evidence collected. H, high; M, medium; L: low.
AF: allele frequency in gnomAD. If a variant does not exist in gnomAD, the ExAC AF is used. Variants not found in both databases are empty.
Effect: deleterious effect predicted by SIFT and Pholyphen-2. D for deleterious, T for tolerated, “.” for data not available.
Segregation: the inconsistency between the observation and the expected mutation segregation in individuals. The text is: (false positive + false negative) | (total - unknown). Genes with empty values are candidate genes from pVAAST run using recessive mode.
pLI: probability of being loss-of-function intolerant score. Typically, pLI ≥ 0.9 is a cutoff for extremely loss of function intolerant.
mis_z: Z score for the deviation of observed counts from the expected number of missense mutations. Positive Z scores indicate increased intolerance to variation.
Other NDDs: overlap with other neurodevelopmental disorder gene lists in Table S1.
TPM1, TPM2, TPM3: max TPM values of brain tissues in GTEx, BrainSpan, and HDBR, respectively.
IMPC: mouse knock-out model results from IMPC. T: genes with “behavior/neurological phenotype” or “nervous system phenotype”; F: genes without “behavior/neurological phenotype” or “nervous system phenotype”; Empty: data not available.
Interaction: Number of interactions with NDD_all genes (excluding risk genes from the same family) defined by STRING, GIANT_v2 or ConsensusPathDB.
Figure 2.Information used for variant and gene prioritization.
Detailed description of the “Annotation” fields can be found in the Supplemental Methods and Table 2. PPI: Protein-protein interaction; GO: Gene Ontology; pLI: probability of being loss-of-function intolerant score; mis_z: Z score for the deviation of observed counts from the expected number of missense mutations; TPM: Transcript Per Million.
Figure 3.Protein-protein interaction (PPI) networks.
(A) PPI network of the 71 TD top candidate genes. Only genes that can be connected are shown. (B) PPI networks of the 71 TD top candidate genes not in (A). Other NDD_all genes were added as intermediate nodes if they interact with more than one TD top candidate genes. For intermediate nodes, only interactions with top candidate genes were included. (C) PPI network formed by NDD_all genes identified in axoneme (GO:0005930). (D) PPI networks formed by NDD_all genes in synaptic membrane (GO:0097060). To simplify the network, interactions between non-candidate genes were removed. PPI networks were defined by three databases, ConsensusPathDB, STRING, and GIANT_v2. Genes were colored by the gene lists (see Methods for details).