| Literature DB >> 33272211 |
Zongtao Yu1,2,3, Yuanyuan Fu4, Junmei Ai3, Jicai Zhang2, Gang Huang5, Youping Deng6.
Abstract
BACKGROUND: Evaluating the toxicity of chemical mixture and their possible mechanism of action is still a challenge for humans and other organisms. Microarray classifier analysis has shown promise in the toxicogenomic area by identifying biomarkers to predict unknown samples. Our study focuses on identifying gene markers with better sensitivity and specificity, building predictive models to distinguish metals from non-metal toxicants, and individual metal from one another, and furthermore helping understand underlying toxic mechanisms.Entities:
Keywords: Biomarker; Classification; Microarray; Toxic heavy metals
Mesh:
Substances:
Year: 2020 PMID: 33272211 PMCID: PMC7712572 DOI: 10.1186/s12859-020-3525-7
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Development of predicitve models for distinguishing metals from non-metal toxicants. a. SVM-RFE and InfoGain methods reached their highest accuracy at the feature size 50 and 150 respectively in 2007 dataset (Dataset 1). b SVM-RFE and InfoGain methods reached their highest accuracy at the feature size 25 and 100 respectively in 2007 dataset (Dataset 1). c SVM-RFE and InfoGain methods reached their highest accuracy at the feature size 25 and 10 respectively in 2008 dataset (Dataset 2). D Both SVM-RFE and InfoGain methods reached their highest accuracy at the feature size 25 in 2008 dataset (Dataset 2)
Fig. 2Prediction of metal from non-metal toxicants using independent datasets. a SVM-RFE feature selection method was overall better prediction accuracy than InfoGain in 2007 dataset (Dataset 1). b A higher prediciton accuracy was observed when using combined features trained model to predict 2007 dataset (Dataset 1). c A higher predcition accuracy was observed for predicting 2008 dataset (Dataset 2) using Feature 1. d A higher predcition accuracy was observed for predicting 2008 dataset (Dataset 2) using combined feaures
The best predictive model and gene signature for discrimination of metal from non-metal toxicants
| No. of probe sets | Cross validation error | Prediction error (D2 to D1) | Probe set ID | |
|---|---|---|---|---|
| D2 | C | |||
| 1 | 8 | 8 | 9 | A_44_P915194(FAM174B) |
| 5 | 4 | 5 | 8 | + A_42_P546708(KHDRBS3), A_44_P1034910(RTN2), A_42_P537091(FAM12B), A_43_P11261(AHNAK2) |
| 7 | 3 | 4 | 5 | + A_44_P593735(TC632928), A_42_P829301(SLC1A5) |
| 8 | 3 | 3 | 4 | + A_44_P427814(IGH-6) |
| 10 | 2 | 3 | 2 | + A_42_P537051(FAM70B), A_44_P1005988(CDIG2) |
| 14 | 0 | 1 | 1 | + A_43_P11561(ARNT), A_44_P1011716(ADORA2B), A_43_P11444(S100G), A_44_P608892(TC596871) |
| 15 | 0 | 0 | 0 | + A_43_P11861(DIO3) |
| 16 | 0 | 1 | 1 | + A_44_P175654 |
| 18 | 2 | 1 | 1 | + A_44_P1040207, A_44_P426107 |
| 25 | 2 | 2 | 1 | + A_43_P21000, A_44_P299835,A_44_P1040926 A_44_P751206 A_44_P961496 A_44_P471440 A_43_P21816 |
Fig. 3Heatmap shows the gene expression pattern and functional analysis of the gene signature that discriminates metal from non-metal toxicants
Pathway analysis of 15 gene markers
| Ingenuity Canonical Pathways | -Log( | Ratio | Molecules |
|---|---|---|---|
| Hypoxia Signaling in the Cardiovascular System | 1.45E00 | 1.43E-02 | ARNT |
| Renal Cell Carcinoma Signaling | 1.41E00 | 1.37E-02 | ARNT |
| VDR/RXR Activation | 1.36E00 | 1.25E-02 | S100G |
| TR/RXR Activation | 1.33E00 | 1.06E-02 | DIO3 |
| VEGF Signaling | 1.33E00 | 1.03E-02 | ARNT |
| HIF1α Signaling | 1.28E00 | 9.52E-03 | ARNT |
| Aryl Hydrocarbon Receptor Signaling | 1.12E00 | 6.37E-03 | ARNT |
| cAMP mediated Signaling | 1.07E00 | 6.1E-03 | ORA2B |
| G-Protein Coupled Receptor Signaling | 9.61E-01 | 4.59E-03 | ORA2B |
| Xenobiotic Metabolism Signaling | 8.56E-01 | 3.4E-03 | ARNT |
Physiological functional anlaysis of 15 gene markers
| Category | Molecules | |
|---|---|---|
| Cardiovascular System Development and Function | 5.75E-04-2.57E-02 | ADORA2B, ARNT |
| Tissue Morphology | 5.75E-04-1.15E-03 | ADORA2B, ARNT |
| Embryonic Development | 1.15E-03-5.17E-03 | S100G, ARNT |
| Hematological System Development and Function | 2.3E-03-2.3E-03 | ADORA2B |
| Organ Morphology | 2.3E-03-7.46E-03 | ARNT |
| Reproductive System Development and Function | 2.87E-03-3.9E-02 | S100G, ARNT |
| Endocrine System Development and Function | 3.45E-03-3.35E-02 | DIO3, ARNT |
| Tissue Development | 5.17E-03-5.17E-03 | ARNT |
| Skeletal and Muscular System Development and Function | 6.89E-03-6.89E-03 | ADORA2B |
| Digestive System Development and Function | 7.46E-03-7.46E-03 | ARNT |
| Hepatic System Development and Function | 7.46E-03-3.12E-02 | ARNT |
| Organismal Survival | 1.95E-02-1.95E-02 | DIO3, ARNT |
| Organismal Development | 3.4E-02-3.4E-02 | DIO3 |
| Organ Development | 3.9E-02-3.9E-02 | ARNT |
Fig. 4Identification of gene markers to separate individual metals. Both the training accuracy and prediction accuracy for 2007 dataset (Dataset 1) were increased with the the feature number increase. When the feature number was up to 7 and above, the training accuracy reached 100%
Fig. 5Heatmap shows the gene expression pattern and functional analysis of the gene markers that distinguish individual metals
Physiological functional anlaysis of 15 gene markers
| Category | Molecules | |
|---|---|---|
| Connective Tissue Development and Function | 4.32E-04-3.53E-02 | HSD11B2, AREG |
| Endocrine System Development and Function | 4.32E-04-4.31E-03 | HSD11B2 |
| Organ Morphology | 4.32E-04-2.59E-03 | HSD11B2, AREG |
| Reproductive System Development and Function | 4.32E-04-1.8E-02 | AREG |
| Tissue Development | 4.32E-04-3.53E-02 | HSD11B2, AREG |
| Organ Development | 8.63E-04-7.74E-03 | AREG |
| Skeletal and Muscular System Development and Function | 1.29E-03-3.53E-02 | HSD11B2, AREG |
| Tissue Morphology | 1.29E-03-4.74E-03 | HSD11B2, AREG |
| Embryonic Development | 3.02E-03-3.02E-03 | AREG |
| Tumor Morphology | 2.69E-02-4.61E-02 | AREG |
| Cardiovascular System Development and Function | 2.73E-02-2.73E-02 | HSD11B2 |
| Nervous System Development and Function | 2.86E-02-4.41E-02 | HSD11B2, AREG |
| Hair and Skin Development and Function | 2.98E-02-2.98E-02 | AREG |
Pathway analysis of 15 gene markers
| Ingenuity Canonical Pathways | -Log( | Ratio | Molecules |
|---|---|---|---|
| C21-Steroid Hormone Metabolism | 2.07E00 | 1.41E-02 | HSD11B2 |
| Complement System | 1.84E00 | 2.78E-02 | C8B |
| Glutathione Metabolism | 1.65E00 | 1.02E-02 | GSTM2 |
| PXR/RXR Activation | 1.54E00 | 1.16E-02 | GSTM2 |
| Androgen and Estrogen Metabolism | 1.51E00 | 6.99E-03 | HSD11B2 |
| Neuregulin Signaling | 1.43E00 | 1E-02 | AREG |
| Metabolism of Xenobiotics by Cytochrome P450 | 1.27E00 | 4.76E-03 | GSTM2 |
| Aryl Hydrocarbon Receptor Signaling | 1.24E00 | 6.37E-03 | GSTM2 |
| NRF2-mediated Oxidative Stress Response | 1.14E00 | 5.41E-03 | GSTM2 |
| LPS/IL-1 Mediated Inhibition of RXR Function | 1.1E00 | 4.88E-03 | GSTM2 |
| Xenobiotic Metabolism Signaling | 9.73E-01 | 3.4E-03 | GSTM2 |