| Literature DB >> 19492049 |
Valerie W Hu1, AnhThu Nguyen, Kyung Soon Kim, Mara E Steinberg, Tewarit Sarachana, Michele A Scully, Steven J Soldin, Truong Luu, Norman H Lee.
Abstract
Despite the identification of numerous autism susceptibility genes, the pathobiology of autism remains unknown. The present "case-control" study takes a global approach to understanding the molecular basis of autism spectrum disorders based upon large-scale gene expression profiling. DNA microarray analyses were conducted on lymphoblastoid cell lines from over 20 sib pairs in which one sibling had a diagnosis of autism and the other was not affected in order to identify biochemical and signaling pathways which are differentially regulated in cells from autistic and nonautistic siblings. Bioinformatics and gene ontological analyses of the data implicate genes which are involved in nervous system development, inflammation, and cytoskeletal organization, in addition to genes which may be relevant to gastrointestinal or other physiological symptoms often associated with autism. Moreover, the data further suggests that these processes may be modulated by cholesterol/steroid metabolism, especially at the level of androgenic hormones. Elevation of male hormones, in turn, has been suggested as a possible factor influencing susceptibility to autism, which affects approximately 4 times as many males as females. Preliminary metabolic profiling of steroid hormones in lymphoblastoid cell lines from several pairs of siblings reveals higher levels of testosterone in the autistic sibling, which is consistent with the increased expression of two genes involved in the steroidogenesis pathway. Global gene expression profiling of cultured cells from ASD probands thus serves as a window to underlying metabolic and signaling deficits that may be relevant to the pathobiology of autism.Entities:
Mesh:
Year: 2009 PMID: 19492049 PMCID: PMC2685981 DOI: 10.1371/journal.pone.0005775
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Differentially expressed genes between autistic and control siblings (FDR = 13.5%).
| Genbank # | Gene Symbol | log2(ratio)* | SEM |
| AI421603 | ATP2B2 | −0.34 | −0.09 |
| N80619 | ATRN | −0.32 | −0.08 |
| H90147 | BCL7A | 0.36 | 0.08 |
| AA973009 | C16ORF44 | 0.32 | 0.07 |
| AI291693 | C21ORF34 | −0.48 | −0.11 |
| AA932364 | CCDC102B | 0.31 | 0.07 |
| AA412053 | CD9 | 0.35 | 0.08 |
| N51674 | COL24A1 | −0.65 | −0.14 |
| AA458486 | COMMD4 | 0.33 | 0.08 |
| T62491 | CXCR4 | 0.47 | 0.1 |
| AA410291 | FGD6 | 0.36 | 0.08 |
| AA025380 | GATA3 | 0.38 | 0.08 |
| AA487739 | GOT2 | 0.34 | 0.09 |
| AA884151 | GPR175 | 0.34 | 0.09 |
| AI283902 | HIST1H1A | 0.34 | 0.09 |
| AA007370 | HKR1 | 0.37 | 0.9 |
| AA984306 | HMBOX1 | 0.3 | 0.07 |
| AA609471 | IER5L | 0.54 | 0.13 |
| AA115054 | KCTD12 | −0.32 | −0.07 |
| H10192 | LIFR | 0.51 | 0.11 |
| AA625666 | LITAF | 0.34 | 0.07 |
| AA644559 | LMO4 | −0.33 | −0.07 |
| R83847 | LOC388335 | 0.37 | 0.83 |
| AA256157 | MIRH1 | 0.29 | 0.06 |
| H69786 | NFKBIZ | 0.38 | 0.08 |
| H15535 | PDE4DIP | 0.33 | 0.07 |
| AI221690 | PRKCZ | −0.3 | −0.07 |
| AI198213 | RNU12P | −0.55 | −0.12 |
| AA443899 | SCARB1 | 0.29 | 0.06 |
| AA044664 | SCN5A | −0.61 | −0.16 |
| AA148736 | SDC4 | 0.87 | 0.21 |
| R36874 | SRD5A1 | 0.29 | 0.06 |
| AI291307 | SVIL | −0.36 | −0.08 |
| N73575 | TRIM25 | 0.41 | 0.09 |
| AA429572 | WASF2 | 0.29 | 0.07 |
| R28287 | unknown | 0.41 | 0.09 |
| AI061421 | unknown | 0.35 | 0.08 |
| R32996 | unknown | 0.33 | 0.08 |
| H20826 | unknown | 0.31 | 0.07 |
| AI217709 | unknown | 0.3 | 0.07 |
| R12679 | unknown | 0.29 | 0.07 |
| AA424531 | unknown | −0.31 | −0.07 |
| AI141767 | unknown | −0.32 | −0.08 |
| AI122714 | unknown | −0.5 | −0.13 |
| AA939238 | unknown | −0.65 | −0.14 |
Figure 1A relational gene network constructed using Pathway Studio 5 from the dataset of significant genes identified by SAM analysis with 70% data filtering (see Table 1).
Red denotes genes with increased expression in the autistic sibling while green indicates decreased expression. Note that inflammation, epilepsy, liver disease, diabetes, and schizophrenia are among the pathological processes associated with this gene network while apoptosis, differentiation, and regulation of action potential are among the cellular processes that are influenced by this set of genes.
Biological functions identified by Ingenuity Pathway Analysis of significant differentially expressed genes (log2 ratio>±0.3) identified by SAM analysis (FDR = 13.5%).
| Category | Function Annotation | p-value | Molecules |
| Endocrine System Development and Function | biosynthesis of androgen/steroidogenesis | 4.89E-05 | SCARB1, SRD5A1 |
| proliferation of pancreatic duct cells | 3.69E-03 | CXCR4 | |
| quantity of 4-androstene-3,17-dione | 1.29E-02 | SRD5A1 | |
| Small Molecule Biochemistry | endocytosis of cholesterol | 1.85E-03 | SCARB1 |
| breakdown of progesterone | 1.85E-03 | SRD5A1 | |
| biosynthesis of norepinephrine | 5.53E-03 | GATA3 | |
| synthesis of ganglioside GM3 | 7.37E-03 | CD9 | |
| uptake of taurocholic acid | 1.83E-02 | PRKCZ | |
| Nervous System Development and Function | morphology of neurons | 5.49E-04 | CD9, GATA3 |
| morphology of Purkinje cells | 1.85E-03 | ATP2B2 | |
| morphology of serotonergic neurons | 1.85E-03 | GATA3 | |
| fusion of vagus cranial nerve ganglion | 1.85E-03 | LMO4 | |
| polarization of astrocytes | 1.85E-03 | PRKCZ | |
| development of cerebellum | 1.98E-03 | ATP2B2, CXCR4 | |
| branching of sympathetic neuron | 3.69E-03 | LIFR | |
| differentiation/quantity of central nervous system cells | 5.43E-03 | ATP2B2, LIFR | |
| morphology of central nervous system | 5.53E-03 | ATRN | |
| development of Purkinje cells | 1.10E-02 | CXCR4 | |
| migration of motor neurons | 1.83E-02 | GATA3 | |
| biogenesis of synapse | 2.74E-02 | ATP2B2 | |
| guidance of motor axons | 2.74E-02 | CXCR4 |
Significance calculated for each function is an indicator of the likelihood of that function being associated with the dataset by random chance. The range of p-values was calculated using the right-tailed Fisher's Exact Test, which compares the number of user-specified genes to the total number of occurrences of these genes in the respective functional/pathway annotations stored in the Ingenuity Pathways Knowledge Base.
Common regulators and targets of differentially expressed genes (from Table 1) identified by Pathway Studio 5 analysis.
| Regulators | Targets |
|
|
|
| androgen | androgen |
| Ca2+ | androstenedione |
| cAMP | Ca2+ |
| cholesterol | cAMP |
| dexamethasone | ceramide |
| estradiol | cholesterol |
| estrogen | cortisol |
| fatty acids | estradiol |
| glucocorticoid | estrogen |
| glucose | fatty acids |
| norepinephrine | glutamate |
| phospholipids | lipid |
| progesterone | melanin |
| RA | NO |
| siRNA | progesterone |
| steroids | testosterone |
| testosterone | tyrosine |
|
|
|
| diabetes mellitus | Crohn disease |
| fetal development | dementia |
| inflammation | diabetes mellitus |
| neural tube malformation | digestion |
| neuroblastoma | embryonic cell viability |
| endocrine abnormality | |
| endocrine function | |
| epilepsy | |
| fetal development | |
| hyperandrogenemia | |
| hyperinsulinemia | |
| inflammation | |
| long-term potentiation | |
| muscular dystrophy | |
| neural tube malformation | |
| neuron dysfunction | |
| neuron toxicity | |
| peripheral nerve function |
Gene ontology analysis using DAVID of significant differentially expressed genes with (log2 ratio>±0.3) identified by SAM analysis (FDR = 13.5%).
| Process | p-value | Genes |
| GO:0008092∼cytoskeletal protein binding | 3.46E-03 | SVIL, PDE4DIP, WASF2, CXCR4, SDC4, |
| GO:0030036∼actin cytoskeleton organization and biogenesis | 4.18E-03 | SVIL, PDE4DIP, WASF2, FGD6, |
| GO:0030029∼actin filament-based process | 5.06E-03 | SVIL, PDE4DIP, WASF2, FGD6, |
| GO:0007010∼cytoskeleton organization and biogenesis | 8.38E-03 | PRKCZ, SVIL, PDE4DIP, WASF2, FGD6, |
| GO:0003779∼actin binding | 9.61E-03 | SVIL, PDE4DIP, WASF2, CXCR4, |
| GO:0006996∼organelle organization and biogenesis | 3.47E-02 | PRKCZ, SVIL, PDE4DIP, HIST1H1A, WASF2, FGD6, |
| GO:0016043∼cellular component organization and biogenesis | 4.89E-02 | PRKCZ, ATP2B2, SVIL, PDE4DIP, HIST1H1A, WASF2, CXCR4, CD9, FGD6, |
| GO:0032502∼developmental process | 4.01E-04 | PRKCZ, SRD5A1, SVIL, HKR1, CD9, SCARB1, FGD6, LITAF, GPR175, ATP2B2, LMO4, CXCR4, ATRN, GATA3, |
| GO:0000003∼reproduction | 9.35E-03 | SRD5A1, ATP2B2, HIST1H1A, CXCR4, CD9, |
| GO:0065007∼biological regulation | 1.28E-02 | PRKCZ, BCL7A, SVIL, HKR1, CD9, LIFR, HMBOX1, FGD6, SCN5A, LITAF, ATP2B2, LMO4, CXCR4, ATRN, GATA3, |
| GO:0009653∼anatomical structure morphogenesis | 2.83E-02 | ATP2B2, LMO4, CXCR4, CD9, FGD6, GATA3, |
| GO:0007399∼nervous system development | 3.12E-02 | ATP2B2, LMO4, CXCR4, CD9, GATA3, |
| GO:0048646∼anatomical structure formation | 3.15E-02 | ATP2B2, CXCR4, CD9, |
Significance by Fisher's Exact Test.
Figure 2Confirmation of select differentially expressed genes by qRT-PCR analyses.
Five representative samples were analyzed per group for each gene, with each sample run in triplicate. The graph shows the average log2 ratios obtained for each gene for the 5 samples analyzed by qRT-PCR, for the same 5 samples analyzed by DNA microarrays, and for all 21 samples analyzed by DNA microarrays. *p-value<0.05; **p-value<0.004; The p-values for the qRT-PCR analysis of GATA3 was 0.069, and for the microarray analysis of SRD5A1 based on only 5 samples was 0.156. However, the p-values for GATA3 and SRD5A1 based on the microarray analysis of all 21 paired samples were <0.00006 and <0.002, respectively.
Figure 3Common regulators (identified by Pathway Studio 5) associated with the dataset of significant differentially expressed qRT-PCR confirmed genes.
Color coding of entities associated with gene network: Red – upregulated genes; pink – other genes which are part of the regulatory network constructed by the pathway analysis program; green - small molecules; orange - functional class; purple - disorders.
Figure 4Common targets (identified by Pathway Studio 5) associated with the dataset of significant differentially expressed qRT-PCR confirmed genes.
Colored entities are defined in legend to Fig. 3. Yellow entities describe cellular processes.
Figure 5A bionetwork that shows the relationships and interactions between SCARB1 and SRD5A1 at the gene, protein, and metabolite levels.
Briefly, SCARB1 is responsible for the uptake of cholesterol into cells while SRD5A1 converts testosterone to 5-α-dihydrotestosterone (DHT), a more potent form of the male hormone. We propose that increases in the expression of these genes may lead to an overall increase in the production of androgens. It is also of interest that bile acid synthesis is linked to this same pathway, thereby suggesting that altered expression of these genes in ASD may lead to disturbances of bile acid synthesis in some tissues as well.
Concentration of testosterone in LCL extracts from 3 pairs of autistic-nonautistic siblings as determined by HPLC-MS/MS analyses.
| Sample | Age | Status | Testosterone (ng/dL) | Ratio (autistic/normal) |
| HI0366 | 18 | autistic | 241 | 1.14 |
| HI0365 | 20 | normal | 212 | |
| HI0355 | 12 | autistic | 218 | ≥218 |
| HI0354 | 14 | normal | <1 | |
| HI2769 | 10 | autistic | 251 | 1.22 |
| HI2772 | 13 | normal | 206 |
Below level of detection.
Figure 6Separation of 1351 autistic probands (each represented by a point) into phenotypic groups on the basis of principal components analysis (PCA) of 123 scored items on the Autism Diagnostic Interview-Revised (ADIR) questionnaires for each individual which was obtained from the AGRE phenotypic database.
PCA divided the autistic individuals into 2 main groups. Hierarchical clustering of the ADIR data (data not shown) revealed that individuals in the smaller group were characterized by higher severity scores on spoken language items on the ADIR. These individuals are represented by the red points in the PCA. Hierarchical clustering also suggested 3 other phenotypic groups that were characterized by lower severity scores across all items (individuals coded blue), higher frequency of savant skills (individuals coded yellow), and intermediate severity across all items (individuals coded green). To restrict sample heterogeneity, this study used LCL only from individuals with severe language impairment (coded red) as identified by these 2 cluster analyses. Detailed methods used for the identification of distinct ASD behavioral phenotypes based on cluster analyses of ADIR scores are described by Hu and Steinberg (Autism Research (2009) in press).