| Literature DB >> 29686828 |
Thanit Saeliw1, Chayanin Tangsuwansri1, Surangrat Thongkorn1, Weerasak Chonchaiya2, Kanya Suphapeetiporn3,4, Apiwat Mutirangura5, Tewin Tencomnao6, Valerie W Hu7, Tewarit Sarachana6.
Abstract
Background: Alu elements are a group of repetitive elements that can influence gene expression through CpG residues and transcription factor binding. Altered gene expression and methylation profiles have been reported in various tissues and cell lines from individuals with autism spectrum disorder (ASD). However, the role of Alu elements in ASD remains unclear. We thus investigated whether Alu elements are associated with altered gene expression profiles in ASD.Entities:
Keywords: Alu elements; Autism spectrum disorder; DNA methylation; Epigenetic regulation; Gene expression profiles; Lymphoblastoid cell lines; Neuroinflammation; Retrotransposon; Sex bias; Subgrouping
Mesh:
Year: 2018 PMID: 29686828 PMCID: PMC5902935 DOI: 10.1186/s13229-018-0213-9
Source DB: PubMed Journal: Mol Autism Impact factor: 7.509
Fig. 1Schematic diagram of experimental workflow. Our workflow initiated with the acquisition of blood-based gene expression profiles from GEO DataSets and human Alu-inserted gene lists. Fisher’s exact test was then used to identify differentially expressed genes (DEGs) with Alu insertions. A total of 320 overlapping genes among the selected study results were used to predict biological functions, diseases, and gene regulatory networks. Fifty-six LCLs were used as a model to investigate the association between the Alu methylation status and Alu expression profiles in LCLs
Details of the gene expression profiles obtained from GEO DataSets
| GSE DataSets | Titles | Sample type | Sample information | Number of transcripts | References | |||
|---|---|---|---|---|---|---|---|---|
| Sample matching | Sample size | Total | Cutoff filter at 70% | Available transcripts with Alu insertion | ||||
| GSE15402 | Gene expression profiling differentiates autism case-controls and phenotypic variants of autism spectrum disorders | Lymphoblastoid cell lines (LCLs) | Sex (male) and age-matched | 87 ASD and 29 controls | 116 | 14,834 | 6118 | Hu VW et al. [ |
| GSE18123 | Blood gene expression signatures distinguish autism spectrum disorders from controls | Whole blood | Sex (male) and age-matched | 170 ASD and 115 controls | 285 | 42,150 | 8575 | Kong SW et al. [ |
| GSE25507 | Autism and increased paternal age-related changes in global levels of gene expression regulation | Peripheral blood lymphocytes | Sex (male) matched | 82 ASD and 64 controls | 146 | 43,735 | 9437 | Alter MD et al. [ |
| GSE42133 | Disrupted functional networks in autism underlie early brain mal-development and provide accurate classification | Whole blood | Sex (male) matched | 91 ASD and 56 controls | 147 | 24,933 | 7873 | Pramparo T et al. [ |
| GSE6575 | Gene expression in blood of children with autism spectrum disorder | Whole blood | Sex (male) and age-matched | 35 ASD and 12 controls | 47 | 43,745 | 9437 | Gregg JP et al. [ |
Total number of the Alu-inserted gene lists from TranspoGene database
| Insertion type | Alu-inserted genes ( |
|---|---|
| All insertion types* | 13,534 |
| Exonized type | 812 |
| Exonic type | 1593 |
| Intronic type | 13,245 |
| Promoter type | 557 |
We obtained the lists of human genes with at least one Alu elements from the TranspoGene database. The lists can be categorized into five types of Alu insertions: exonized, exonic, intronic, promoter, and all insertion. Note that multiple Alu elements can be inserted within a single gene with different insertion types. Therefore, the total number of Alu-inserted genes is less than the sum of the exonic, exonized, intronic, and promoter types. These lists of Alu-inserted genes were used for subsequent overlap analyses with differentially expressed genes (DEGs) in ASD
*List of genes with at least one instance of one of the four types of Alu insertions
Fig. 2Alu element structure and illustration of COBRA for determining AluS methylation levels and patterns. a Alu elements are approximately 300 bp in length and have a dimeric structure that is separated by an A-rich region (A5TACA6) and ends with a poly-A tail. The left half of the Alu contains the A and B boxes, which are internal promoters for RNA polymerase III. b Illustration of the COBRA method designed to assess methylation of two CpGs at the internal promoter of AluS subfamilies. The four different methylation patterns of AluS were calculated from the percentages of differently digested products of 133, 75, 58, 43, and 32 bp. c Representative gel image from the COBRA for AluS subfamilies
Association analyses between the differentially expressed genes (DEGs) in ASD and the human Alu-inserted gene lists
| Insertion type | Comparison | GEO datasets | All differentially expressed genes | Upregulated genes | Downregulated genes | |||
|---|---|---|---|---|---|---|---|---|
| Genes ( | Genes ( | Genes ( | ||||||
| All insertion | ASD vs. control | GSE15402 | 0.799 | 215 | 0.926 | 100 | 0.799 | 116 |
| GSE18123 | < 0.00005 | 1492 | 1.000 | 255 | < 0.00005 | 1245 | ||
| GSE25507 | 0.012 | 869 | 0.256 | 522 | < 0.00005 | 355 | ||
| GSE42133 | < 0.00005 | 1001 | 0.015 | 387 | < 0.00005 | 624 | ||
| GSE6575 | < 0.00005 | 388 | 0.955 | 123 | < 0.00005 | 266 | ||
| Intronic | ASD vs. control | GSE15402 | 0.784 | 212 | 0.926 | 99 | 0.784 | 114 |
| GSE18123 | < 0.00005 | 1476 | 0.970 | 252 | < 0.00005 | 1231 | ||
| GSE25507 | 0.007 | 860 | 0.322 | 516 | < 0.00005 | 352 | ||
| GSE42133 | < 0.00005 | 985 | 0.012 | 382 | < 0.00005 | 613 | ||
| GSE6575 | < 0.00005 | 382 | 0.882 | 119 | < 0.00005 | 264 | ||
| Exonized | ASD vs. control | GSE15402 | 1.000 | 15 | 0.933 | 6 | 0.933 | 9 |
| GSE18123 | 0.009 | 102 | 0.926 | 14 | 0.003 | 88 | ||
| GSE25507 | 0.825 | 45 | 0.008 | 17 | 0.012 | 28 | ||
| GSE42133 | 0.008 | 75 | 0.306 | 27 | 0.015 | 49 | ||
| GSE6575 | 0.006 | 33 | 1.000 | 7 | < 0.00005 | 26 | ||
| Exonic | ASD vs. control | GSE15402 | 0.450 | 19 | 0.904 | 10 | 0.426 | 9 |
| GSE18123 | 0.001 | 177 | 0.306 | 21 | < 0.00005 | 158 | ||
| GSE25507 | 0.426 | 78 | 0.136 | 46 | 0.662 | 33 | ||
| GSE42133 | < 0.00005 | 148 | 0.799 | 43 | < 0.00005 | 106 | ||
| GSE6575 | 0.034 | 48 | 0.715 | 16 | 0.013 | 33 | ||
| Promoter | ASD vs. control | GSE15402 | 0.933 | 6 | 0.436 | 1 | 0.799 | 5 |
| GSE18123 | 0.240 | 37 | 0.898 | 8 | 0.268 | 29 | ||
| GSE25507 | 0.135 | 22 | 0.033 | 11 | 0.937 | 11 | ||
| GSE42133 | 0.831 | 35 | 0.447 | 19 | 0.255 | 16 | ||
| GSE6575 | 0.466 | 16 | 0.716 | 3 | 0.123 | 13 | ||
The list of DEGs from each gene expression profiling study was overlapped with the lists of Alu-inserted genes. Alu-inserted genes were categorized into five types of Alu insertions which included exonized, exonic, intronic, promoter, and combined insertion types. Fisher’s exact test with Benjamini-Hochberg correction (FDR = 0.05) was used to determine the association the DEGs and Alu-inserted genes, and P values of less than 0.05 were considered significant. The number of DEGs and adjusted P values are shown
Fig. 3Venn diagram of genes containing Alu that are differentially expressed in ASD. The significant DEGs with Alu insertions from each study based on Fisher’s exact test overlapped. The diagram shows the reproducibility of Alu-inserted genes that were differentially expressed in peripheral blood and blood-derived cell lines from ASD individuals. A total of 320 genes were selected to identify biological functions and gene regulatory networks through an Ingenuity Pathway Analysis (IPA)
Diseases and biological functions associated with reproducible DEGs with Alu insertion predicted by the Ingenuity Pathway Analysis (IPA)
| Disease or function annotation | Benjamini-Hochberg | No. of genes | Gene symbol |
|---|---|---|---|
| Autism or intellectual disability | 2.19E−04 | 21 | ABCB1, ADNP, ANKRD11, ARID1A, ATP6V1A, CAMTA1, CASP2, CDC42, CHD4, COL4A3BP, CREBBP, GNB1, OPA1, PTEN, SLC35A3, SMARCA2, SON, TRIO, UBE3A, YY1, ZMYND11 |
| Neuromuscular disease | 5.70E−04 | 34 | ABCB1, ADAM10, ALCAM, ANKRD11, ATP2A2, ATP6V1A, ATXN1, CANX, CASP2, CFLAR, GSK3B, HBP1, HMGCR, HSPA5, IFNAR2, IL7R, KIF1B, LDLR, MAP2K4, MBP, MBTPS1, NOTCH2, OSBPL8, PPP3CB, PTPRC, PTPRE, RUNX3, SSX2IP, TLR2, TOMM20, TRIO, USP13, WNK1, XRCC6 |
| Synthesis of reactive oxygen species | 7.52E−04 | 14 | CANX, CDC42, CYBB, ETS1, FCER1A, HGF, ITGB1, MAP2K4, MAPK1, PIK3CG, PTEN, SHC1, TLR2, TXNRD1 |
| Disorder of basal ganglia | 9.21E−04 | 29 | ABCB1, ANKRD11, ATP2A2, ATP6V1A, ATXN1, CA2, CASP2, CFLAR, GSK3B, HBP1, HMGCR, HSPA5, KIF1B, LDLR, MAP2K4, MBP, MBTPS1, NOTCH2, OSBPL8, PTPRE, RUNX3, SAMHD1, SSX2IP, TOMM20, TRIO, USP13, WNK1, XPR1, XRCC6 |
| Dyskinesia | 9.44E−04 | 23 | ABCB1, ANKRD11, ATP2A2, ATP6V1A, ATXN1, CASP2, CFLAR, HBP1, HMGCR, HSPA5, LDLR, MAP2K4, MBTPS1, NOTCH2, OSBPL8, PTPRE, RUNX3, SSX2IP, TOMM20, TRIO, USP13, WNK1, XRCC6 |
| Mental retardation | 1.13E−03 | 18 | ADNP, ANKRD11, ARID1A, ATP6V1A, CAMTA1, CASP2, CDC42, CHD4, COL4A3BP, CREBBP, GNB1, OPA1, SLC35A3, SMARCA2, SON, TRIO, YY1, ZMYND11 |
| Brain lesion | 1.36E−03 | 33 | ANKRD11, ANXA7, APC, ARCN1, ARID1A, ATP6V1A, CA2, CBL, CREBBP, CTBP2, DICER1, DOCK5, EHD4, HGF, HMGCR, IRS2, LDLR, LYST, NCOA1, NF1, PABPC1, PIK3R1, PRKCSH, PTEN, PTPN11, SAP130, SON, TBK1, TOP1, TRIM33, TRIP11, TRRAP, ZCCHC6 |
| Cognitive impairment | 1.44E−03 | 20 | ADNP, ANKRD11, ARID1A, ATP6V1A, CA2, CAMTA1, CASP2, CDC42, CHD4, COL4A3BP, CREBBP, GNB1, HMGCR, OPA1, SLC35A3, SMARCA2, SON, TRIO, YY1, ZMYND11 |
| Dementia | 1.62E−03 | 27 | ADAM10, APLP2, ATXN1, CA2, CANX, CASP2, DICER1, GSK3B, HMGCR, HSPA5, LDLR, LIMS1, NFE2L2, OPA1, PIK3R1, PTEN, PTPRE, RUNX3, SLC6A6, SMPD1, SPG21, SRPK2, TBK1, TFCP2, TRIO, UBQLN1, WDR7 |
Neurological diseases and functions are significantly associated with 320 overlapping genes that were identified in multiple studies. P values calculated by Fisher’s exact test with Benjamini-Hochberg correction (FDR = 0.05) and the number of genes for each function are shown
Canonical pathways associated with reproducible DEGs with Alu insertion predicted by the Ingenuity Pathway Analysis (IPA)
| Ingenuity canonical pathways | Benjamini-Hochberg | Gene symbol |
|---|---|---|
| ILK signaling | 6.39E−08 | PPP2R5E, GSK3B, CDC42, PTEN, IRS2, CREB1, PIK3R4, RHOQ, PIK3R1, MYH9, PIK3CG, PTPN11, MAP2K4, ITGB1, MAPK1, CREBBP, LIMS1, FNBP1, NACA |
| Neurotrophin/TRK signaling | 3.20E−07 | CDC42, MAP3K5, IRS2, CREB1, PTPN11, MAP2K4, PIK3R4, MAPK1, CREBBP, PIK3R1, SHC1, PIK3CG |
| NGF signaling | 5.11E−07 | CDC42, MAP3K5, IRS2, CREB1, PIK3R4, SMPD1, PIK3R1, PIK3CG, PTPN11, MAP2K4, TRIO, MAPK1, CREBBP, SHC1 |
| Reelin signaling in neurons | 1.02E−06 | GSK3B, ITGAL, IRS2, FYN, PTPN11, MAP2K4, ITGB1, PIK3R4, PIK3R1, YES1, LYN, PIK3CG |
| HGF signaling | 1.24E−06 | CDC42, MAP3K5, IRS2, PIK3R4, ELF2, PIK3R1, PIK3CG, PTPN11, MAP2K4, ITGB1, MAPK1, ETS1, HGF |
| ERK/MAPK signaling | 2.73E−06 | PPP2R5E, PAK2, IRS2, CREB1, PIK3R4, ELF2, PRKAG2, PIK3R1, PIK3CG, FYN, PTPN11, ITGB1, MAPK1, CREBBP, ETS1, SHC1 |
| Insulin receptor signaling | 5.09E−06 | GSK3B, PTEN, IRS2, PIK3R4, PRKAG2, RHOQ, PIK3R1, CBL, PIK3CG, FYN, PTPN11, MAPK1, SHC1 |
| Axonal guidance signaling | 2.47E−05 | SEMA4D, GSK3B, CDC42, PAK2, IRS2, PIK3R4, PLXNC1, GNB1, PPP3CB, PRKAG2, ADAM10, PIK3R1, RASSF5, PIK3CG, GNAI2, FYN, PPP3R1, PTPN11, ITGB1, MAPK1, PLCL2, SHC1 |
| IL-6 signaling | 4.05E−05 | ABCB1, MAP4K4, IRS2, PTPN11, MAP2K4, PIK3R4, MAPK1, PIK3R1, SHC1, IL6ST, PIK3CG |
| Neuroinflammation signaling pathway | 4.51E−05 | GSK3B, IRS2, CREB1, PIK3R4, PPP3CB, PIK3R1, TBK1, PIK3CG, CFLAR, PPP3R1, PTPN11, MAP2K4, MAPK1, CREBBP, CYBB, TLR2, NFE2L2 |
| Glucocorticoid receptor signaling | 4.78E−05 | IRS2, CREB1, NCOA1, PIK3R4, SMARCA2, PPP3CB, PRKAG2, PIK3R1, PIK3CG, TAF4, PPP3R1, PTPN11, MAP2K4, HSPA5, MAPK1, CREBBP, ARID1A, SHC1 |
| PI3K/AKT signaling | 1.22E−04 | PPP2R5E, GSK3B, MAP3K5, PTEN, ITGB1, MAPK1, LIMS1, PIK3R1, SHC1, PIK3CG |
| CREB signaling in neurons | 1.49E−04 | IRS2, CREB1, PIK3R4, GNB1, PRKAG2, PIK3R1, PIK3CG, GNAI2, PTPN11, MAPK1, CREBBP, PLCL2, SHC1 |
| Synaptic long-term potentiation | 6.29E−03 | PPP3R1, CREB1, PPP3CB, MAPK1, CREBBP, PRKAG2, PLCL2 |
| Estrogen receptor signaling | 9.05E−03 | TAF4, NCOA1, CTBP2, MAPK1, CREBBP, TRRAP, SHC1 |
| Androgen signaling | 1.19E−02 | GNAI2, NCOA1, GNB1, MAPK1, CREBBP, PRKAG2, SHC1 |
Canonical pathways are significantly associated with 320 overlapping genes that were identified in multiple studies. P values calculated by Fisher’s exact test with Benjamini-Hochberg correction (FDR = 0.05) for each function are shown
Fig. 4Predicted gene regulatory network of the overlapping genes associated with neurological disease. This network revealed interactions or relationships among the overlapping molecules (gray background) and with other molecules from the IPA database (white background) that play a role in several mechanisms associated with neurological disease and estrogen receptor and androgen signaling, which is known to be associated with sex bias in ASD (labeled pink)
COBRA-derived percentages of AluS methylation and patterns in LCLs from ASD individuals and sex- and age-matched controls
| Comparison | Sample groups | Age (mean ± SD) | Percentages of AluS methylation patterns | ||||
|---|---|---|---|---|---|---|---|
| %mC | %mCmC | %uCmC | %mCuC | %uCuC | |||
| ASD vs. control (sex- and age-matched) | Control ( | 15 ± 6.97 | 37.98 ± 1.36 | 25.70 ± 2.36 | 18.71 ± 1.15 | 21.76 ± 1.03 | 33.83 ± 1.69 |
| ASD ( | 13.4 ± 4.55 | 37.86 ± 2.07 | 25.92 ± 2.69 | 18.72 ± 1.66 | 21.22 ± 1.76 | 34.13 ± 3.00 | |
| 0.866 | 0.866 | 0.974 | 0.315 | 0.845 | |||
| Subgroup M vs. control (sex- and age-matched) | Control ( | 12.1 ± 3.81 | 38.07 ± 1.52 | 25.57 ± 2.31 | 18.76 ± 0.87 | 22.06 ± 0.69 | 33.62 ± 1.85 |
| ASD subgroup M ( | 12.1 ± 3.73 | 39.03 ± 1.16 | 25.17 ± 1.15 | 20.06 ± 0.92 | 22.99 ± 0.82 | 31.77 ± 1.76 | |
| 0.293 | 0.845 |
| 0.089 | 0.168 | |||
| Subgroup L vs. control (sex- and age-matched) | Control ( | 13.7 ± 1.64 | 37.84 ± 1.51 | 24.88 ± 2.94 | 19.10 ± 1.60 | 22.25 ± 1.03 | 33.77 ± 1.84 |
| ASD subgroup L ( | 13.5 ± 1.97 | 36.024 ± 1.43 | 23.37 ± 2.63 | 19.32 ± 1.69 | 20.93 ± 2.52 | 36.38 ± 2.29 | |
| 0.168 | 0.569 | 0.866 | 0.502 | 0.168 | |||
| Subgroup S vs. control (sex- and age-matched) | Control ( | 15 ± 6.97 | 37.98 ± 1.36 | 25.70 ± 2.36 | 18.71 ± 1.15 | 21.76 ± 1.03 | 33.83 ± 1.69 |
| ASD subgroup S ( | 15 ± 5.41 | 37.83 ± 2.23 | 27.06 ± 2.67 | 17.88 ± 1.44 | 20.43 ± 1.17 | 34.64 ± 3.00 | |
| 0.866 | 0.242 | 0.168 |
| 0.502 | |||
The percentages of AluS methylation patterns were determined based on four patterns: the hypermethylated pattern (mCmC), two partially methylated patterns (mCuC, uCmC), and the hypomethylated pattern (uCuC). Comparisons of the methylation status between ASD and sex- and age-matched unaffected control groups and between ASD phenotypic subgroups and the matched unaffected controls were also performed. Statistically significant P values < 0.05 with Benjamini-Hochberg correction (FDR = 0.05) are shown in italics
Fig. 5Box plot of the Alu methylation patterns in the LCLs of ASD subgroup M. In ASD subgroup M, the percentage of the partially methylated pattern uCmC (20.06% ± 0.92%) was significantly increased. In ASD subgroup S, the partially methylated pattern mCuC was significantly decreased. *adjusted P value < 0.05
Quantitative RT-PCR analyses of AluS expression levels in the LCLs of ASD and control groups
| Group | Fold change (FC) | Log2 (FC) | |
|---|---|---|---|
| ASD vs. control | 1.75 | 0.81 | 0.316 |
| Subgroup M vs. control | 1.05 | 0.06 | 0.953 |
| Subgroup L vs. control | 0.29 | − 1.77 | 0.032 |
| Subgroup S vs. control | 3.68 | 1.88 | 0.038 |
The levels of Alu transcripts were normalized to the housekeeping gene GAPDH. The AluS expression levels were calculated using the 2−ΔΔCt method, and differences with a P value < 0.05, as determined by two-tailed t tests with Benjamini-Hochberg correction, were considered significant
Fig. 6Correlation analysis between AluS methylation and expression level for all LCL samples. The AluS expression for each LCL was normalized with the average GAPDH dCt of the control group. The Alu expression levels were then calculated using the 2−ΔΔCt method
Fig. 7Correlation analysis between AluS methylation and expression levels in ASD subgroup M and sex- and age-matched controls. The AluS expression of each LCL was normalized to the average GAPDH dCt of the control group. The Alu expression levels were then calculated using the 2−ΔΔCt method
Fig. 8Schematic diagram illustrating a possible mechanism of Alu elements in ASD. Our model suggests that exposure to environmental factors or dysregulation of other DNA methylation regulatory mechanisms lead to changes in CpG methylation patterns in Alu elements. Such changes alter transcription factor binding and, possibly in combination with other Alu regulatory mechanisms, cause the dysregulation of the expression and retrotransposition of Alu elements. Disrupted Alu retrotransposition results in changes in target genes via cis-/trans-regulatory mechanisms, which, in turn, dysregulate gene expression and gene regulatory networks known to be negatively impacted in ASD