| Literature DB >> 29986017 |
Igor B Rogozin1, E Michael Gertz1, Pasha V Baranov2, Eugenia Poliakov3, Alejandro A Schaffer1.
Abstract
We previously proposed that changes in the efficiency of protein translation are associated with autism spectrum disorders (ASDs). This hypothesis connects environmental factors and genetic factors because each can alter translation efficiency. For genetic factors, we previously tested our hypothesis using a small set of ASD-associated genes, a small set of ASD-associated variants, and a statistic to quantify by how much a single nucleotide variant (SNV) in a protein coding region changes translation speed. In this study, we confirm and extend our hypothesis using a published set of 1,800 autism quartets (parents, one affected child and one unaffected child) and genome-wide variants. Then, we extend the test statistic to combine translation efficiency with other possibly relevant variables: ribosome profiling data, presence/absence of CpG dinucleotides, and phylogenetic conservation. The inclusion of ribosome profiling abundances strengthens our results for male-male sibling pairs. The inclusion of CpG information strengthens our results for female-female pairs, giving an insight into the significant gender differences in autism incidence. By combining the single-variant test statistic for all variants in a gene, we obtain a single gene score to evaluate how well a gene distinguishes between affected and unaffected siblings. Using statistical methods, we compute gene sets that have some power to distinguish between affected and unaffected siblings by translation efficiency of gene variants. Pathway and enrichment analysis of those gene sets suggest the importance of Wnt signaling pathways, some other pathways related to cancer, ATP binding, and ATP-ase pathways in the etiology of ASDs.Entities:
Mesh:
Year: 2018 PMID: 29986017 PMCID: PMC6086092 DOI: 10.1093/gbe/evy146
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
The Number of Rare and Moderately Common SNVs (10% MAF threshold) that Have Positive (POS) and Negative (NEG) Values of the Translation Shift Score
| Data Set | #Families | Affected | Unaffected | |||
|---|---|---|---|---|---|---|
| POS | NEG | POS | NEG | |||
| All SNVs | ||||||
| 1800 | ||||||
| Ma–Mu | 744 | 417012 | 322569 | 415560 | 324143 | 0.0059 |
| Ma–Fu | 828 | 476502 | 378133 | 477754 | 379383 | 0.4136 |
| 105 | ||||||
| Fa–Fu | 123 | 70472 | 55199 | 71263 | 55826 | 0.4949 |
| All synonymous SNVs | ||||||
| 1800 | ||||||
| Ma–Mu | 744 | 214452 | 124876 | 213574 | 124709 | 0.2934 |
| 828 | ||||||
| Fa–Mu | 105 | 34065 | 19332 | 33543 | 19382 | 0.0798 |
| 123 | ||||||
| Synonymous SNVs with absolute values of codon shift score ≥ 0.5 | ||||||
| 1800 | ||||||
| 744 | ||||||
| 828 | ||||||
| Fa–Mu | 105 | 18310 | 14456 | 17840 | 14421 | 0.0687 |
| 123 | ||||||
| Synonymous SNVs with absolute values of codon shift score ≥ 0.5 and all non-synonymous SNVs | ||||||
| 1800 | ||||||
| 744 | ||||||
| 828 | ||||||
| 105 | ||||||
| Fa–Fu | 123 | 53658 | 49888 | 54092 | 50458 | 0.3548 |
Note.—One-tail Fisher exact tests (http://www.langsrud.com/fisher.htm) were used to test whether SNVs in affected individuals tend to have relatively more SNVs with a positive shift than unaffected individuals. Ma–Mu is affected male-unaffected male siblings, Ma–Fu is affected male—unaffected female siblings, Fa–Mu is affected female—unaffected male siblings, Fa–Fu is affected female—unaffected female siblings. Significant deviations according to the Fisher exact test from the homogeneous 2 × 2 tables are bold and underlined.
. 1.—Differences between affected and unaffected siblings using median translation shift scores. Scores were calculated in each individual for (a) all siblings and (b) affected male-unaffected male siblings.
Differences between Affected and Unaffected Siblings Using Median Translation Shift Scores Calculated in Each Individual
| Data Set | Affected | Unaffected | Paired | Paired Wilcoxon Z ( | ||
|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | |||
| All | 0.012 | 0.041 | 0.006 | 0.042 | ||
| Ma–Mu | 0.011 | 0.042 | 0.004 | 0.042 | ||
| Ma–Fu | 0.014 | 0.039 | 0.009 | 0.041 | ||
| Fa–Mu | 0.021 | 0.043 | 0.009 | 0.037 | ||
| Fa–Fu | −0.002 | 0.046 | 0.001 | 0.046 | −0.6 (0.52963) | 0.4 (0.69766) |
Two-tailed paired tests were used to compare median values of translation shift scores calculated in each individual. Ma–Mu is affected male-unaffected male siblings, Ma–Fu is affected male—unaffected female siblings, Fa–Mu is affected female—unaffected male siblings, Fa–Fu is affected female—unaffected female siblings. Codon usage frequencies were taken from (Semon et al. 2006), as used in (Poliakov et al. 2014). Results for other codon usage sets (supplementary table S5, Supplementary Material online) are similar to the (Semon et al. 2006) codon usage data.
. 2.—Differences in scores between affected and unaffected siblings. Scores were computed using (a) sum of +1 and −1 indicating a positive or negative sign of translation shift scores calculated in each individual for all siblings, (b) median signed ribosome profiling scores for normal brain samples for all siblings (G14n, supplementary table S2, Supplementary Material online) calculated in each individual, (c) median conservation scores calculated in each individual for all siblings, (d) the fraction of SNVs in the CpG context calculated in each individual for all siblings.
Differences between Affected and Unaffected Siblings Using Sum of +1 and −1 Indicating a Positive or Negative Sign of Translation Shift Scores Aggregated Over Each Individual
| Mean | SD | Mean | SD | |||
|---|---|---|---|---|---|---|
| All | 40.87 | 33.51 | 35.63 | 33.64 | ||
| Ma–Mu | 37.84 | 33.37 | 33.01 | 33.63 | ||
| Ma–Fu | 43.78 | 32.95 | 37.7 | 33.59 | ||
| Fa–Mu | 50.19 | 38.72 | 44.3 | 34.8 | 1.6 (0.1209) | 1.6 (0.1201) |
| Fa–Fu | 31.37 | 29.33 | 30.2 | 30.99 | 0.4 (0.6967) | 0.3 (0.7968) |
Note.—Two-tailed paired tests were used to compare the sum of the sign of the translation shift scores calculated in each individual. Ma–Mu is affected male-unaffected male siblings, Ma–Fu is affected male—unaffected female siblings, Fa–Mu is affected female—unaffected male siblings, Fa–Fu is affected female—unaffected female siblings. Codon usage frequencies were taken from (Semon et al. 2006).
Differences between Affected and Unaffected Siblings Using Median Signed Ribosome Profiling Scores for Normal Brain Samples (G14n, supplementary table S2, Supplementary Material online) Calculated in Each Individual
| Data Set | Affected | Unaffected | Paired | Paired Wilcoxon Z ( | ||
|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | |||
| All | 1.21 | 1.42 | 1.00 | 1.45 | 4 | |
| Ma–Mu | 1.33 | 1.18 | 1.18 | 1.27 | ||
| Ma–Fu | 1.31 | 1.4 | 1.06 | 1.42 | ||
| Fa–Mu | 1.34 | 1.42 | 1.09 | 1.28 | 1.5 (0.14894) | 1.4 (0.16684) |
| Fa–Fu | 0.92 | 1.57 | 0.88 | 1.52 | 0.2 (0.84266) | 0.3 (0.77661) |
Note.—Two-tail tests were used to compare values of ribosome profiling scores calculated in each individual. Ma–Mu is affected male-unaffected male siblings, Ma–Fu is affected male—unaffected female siblings, Fa–Mu is affected female—unaffected male siblings, Fa–Fu is affected female—unaffected female siblings.
Differences between Affected and Unaffected Siblings Using Median Conservation Scores Calculated in Each Individual
| Data Set | Affected | Unaffected | Paired | Paired Wilcoxon Z ( | ||
|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | |||
| All | 0.052 | 0.061 | 0.045 | 0.062 | ||
| Ma–Mu | 0.048 | 0.061 | 0.04 | 0.063 | ||
| Ma–Fu | 0.058 | 0.061 | 0.048 | 0.061 | ||
| Fa–Mu | 0.057 | 0.06 | 0.053 | 0.06 | 0.5 (0.60275) | 0.1 (0.88814) |
| Fa–Fu | 0.03 | 0.058 | 0.044 | 0.064 | −1.7 (0.09702) | 1.4 (0.16052) |
Note.—Two-tail tests were used to compare values of signed PhyloP conservation scores calculated in each individual. Ma–Mu is affected male-unaffected male siblings, Ma–Fu is affected male—unaffected female siblings, Fa–Mu is affected female—unaffected male siblings, Fa–Fu is affected female—unaffected female siblings.
Differences between Affected and Unaffected Siblings Using the Fraction of SNVs in the CpG Context and the Methylation Shift Score (Ms) Calculated in Each Individual
| Data Set | Affected | Unaffected | Paired | Paired Wilcoxon Z ( | ||
|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | |||
| Fraction of SNVs in the CpG context | ||||||
| All | 0.447 | 0.020 | 0.447 | 0.021 | −0.2 (0.816) | 0.6 (0.529) |
| Ma–Mu | 0.448 | 0.02 | 0.448 | 0.022 | −0.4 (0.718) | 0.5 (0.618) |
| Ma–Fu | 0.447 | 0.019 | 0.447 | 0.020 | −0.7 (0.489) | 1.1 (0.277) |
| Fa–Mu | 0.448 | 0.019 | 0.449 | 0.018 | −0.5 (0.621) | 0.5 (0.644) |
| Fa–Fu | 0.447 | 0.021 | 0.442 | 0.028 | ||
| Methylation shift score | ||||||
| All | 83.6 | 24.7 | 81.7 | 24.6 | ||
| Ma–Mu | 82.6 | 26.4 | 80.2 | 27.0 | 1.9 (0.062) | 1.7 (0.081) |
| Ma–Fu | 84.5 | 21.3 | 83.6 | 21.5 | 1.0 (0.312) | 0.9 (0.352) |
| Fa–Mu | 76.5 | 26.6 | 79.9 | 24.6 | −1.0 (0.297) | 0.9 (0.364) |
| Fa–Fu | 88.7 | 31.3 | 80.0 | 27.3 | ||
Note.—Two-tail tests were used to compare fractions/methylation shift scores calculated in each individual. Ma–Mu is affected male-unaffected male siblings, Ma–Fu is affected male—unaffected female siblings, Fa–Mu is affected female—unaffected male siblings, Fa–Fu is affected female—unaffected female siblings.
. 3.—Network of functionally connected genes. The network was reconstructed using the STRING program obtained by intersection of LASSO male–male list and downregulated modules in ASD patients as the input (Gupta et al. 2014; Voineagu et al. 2011). GO: 0045202 synapse-classified genes are shown in red.
Count of the Sign of the Coefficients in the Best Model Generated by Glmnet, for Those Genes that Also Passed Quality Filters
| Data Set | Score Used | Positive | Negative | Close to Zero | Total Tested |
|---|---|---|---|---|---|
| Ma–Mu | Sum of signs of translation shift scores | 597 | 569 | 58 | 16,942 |
| Fa–Fu | Sum of signs of translation shift scores | 48 | 60 | 5 | 15,045 |
| Fa–Fu | Count of CpG | 84 | 85 | 15 | 12,491 |
Note.—Models were produced separately for the Ma–Mu and Fa–Fu data sets, using either the sum of signs of the translation shift scores or the count of SNVs in CpG dinucleotides, as indicated in the second column. The “Positive” and “Negative” columns show the count of positive and negative coefficients with absolute value of at least 0.005. The “Close to Zero” column shows the count of coefficients with nonzero, but smaller, absolute value. The “Total Tested” column counts the number of genes considered by glmnet when producing the corresponding model, namely those genes for which at least one individual in the respective data set had a nonzero score.