| Literature DB >> 29159232 |
Andrew Williams1, Sabina Halappanavar1.
Abstract
This article contains data related to the research article 'Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials' (Williams and Halappanavar, 2015) [1]. The presence of diverse types of nanomaterials (NMs) in commerce has grown significantly in the past decade and as a result, human exposure to these materials in the environment is inevitable. The traditional toxicity testing approaches that are reliant on animals are both time- and cost- intensive; employing which, it is not possible to complete the challenging task of safety assessment of NMs currently on the market in a timely manner. Thus, there is an urgent need for comprehensive understanding of the biological behavior of NMs, and efficient toxicity screening tools that will enable the development of predictive toxicology paradigms suited to rapidly assessing the human health impacts of exposure to NMs. In an effort to predict the long term health impacts of acute exposure to NMs, in Williams and Halappanavar (2015) [1], we applied bi-clustering and gene set enrichment analysis methods to derive essential features of altered lung transcriptome following exposure to NMs that are associated with lung-specific diseases. Several datasets from public microarray repositories describing pulmonary diseases in mouse models following exposure to a variety of substances were examined and functionally related bi-clusters showing similar gene expression profiles were identified. The identified bi-clusters were then used to conduct a gene set enrichment analysis on lung gene expression profiles derived from mice exposed to nano-titanium dioxide, carbon black or carbon nanotubes (nano-TiO2, CB and CNTs) to determine the disease significance of these data-driven gene sets. The results of the analysis correctly identified all NMs to be inflammogenic, and only CB and CNTs as potentially fibrogenic. Here, we elaborate on the details of the statistical methods and algorithms used to derive the disease relevant gene signatures. These details will enable other investigators to use the gene signature in future Gene Set Enrichment Analysis studies involving NMs or as features for clustering and classifying NMs of diverse properties.Entities:
Keywords: Bi-clustering; Nanomaterials; Predictive toxicology; Toxicogenomics
Year: 2017 PMID: 29159232 PMCID: PMC5683856 DOI: 10.1016/j.dib.2017.10.060
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Workflow employed for the discovery phase of the analysis.
Studies used for Gene Set Exploration Phase.
| Geo accession | Phenotype/model | Microarray platform (GEO GPL ID) | Reference |
|---|---|---|---|
| Lung inflammation | UCSF 10Mm Mouse v.2 Oligo Array (GPL1089); UCSF GS Operon Mouse v.2 Oligo Array (GPL3330); UCSF 11Mm Mouse v.2 Oligo Array (GPL3331); UCSF 7Mm Mouse v.2 Oligo Array (GPL3359) | ||
| Lung tumors | Affymetrix Mouse Genome 430 2.0 Array (GPL1261) | ||
| Asthma | Affymetrix Mouse Genome 430 2.0 Array (GPL1261) | ||
| Emphysema | Affymetrix Mouse Genome 430 2.0 Array (GPL1261) | ||
| Emphysema | Agilent-011978 Mouse Microarray G4121A (GPL891) | ||
| Small cell lung cancer | Affymetrix Mouse Genome 430 2.0 Array (GPL1261) | ||
| Lung carcinogenesis | Illumina MouseRef-8 v2.0 Expression Beadchip (GPL6885) | ||
| Pulmonary fibrosis | Affymetrix Mouse Genome 430 2.0 Array (GPL1261) | ||
| Spontaneous lung tumors | Affymetrix Mouse Genome 430 2.0 Array (GPL1261) | ||
| Idiopathic pulmonary fibrosis | Affymetrix Mouse Genome 430 2.0 Array (GPL1261) | ||
| Lung cancer | Illumina Mouse WG-6 v2.0 expression beadchip (GPL6887) | ||
| Chronic obstructive pulmonary disease (COPD) | Illumina MouseRef-8 v2.0 Expression Beadchip (GPL6885) |
NM Studies used for the Gene Set Enrichment Analysis.
| Geo Accession | Nanomaterial | Doses | Time Points | Reference |
|---|---|---|---|---|
| CNT: MWCNT-7 | 10 µg, 20 µg, 40 µg and 80 µg | 1, 3, 28 and 56 days | ||
| CB: Printex 90 | 18 µg, 54 µg and 162 µg | 1, 3 and 28 days | ||
| TiO2: UV-Titan L181 | 18 µg, 54 µg and 162 µg | 1, 3 and 28 days | ||
| CNT: Mitsui7 | 18 µg, 54 µg and 162 µg | 1 and 28 days | ||
| TiO2: NRCWE-025, NRCWE-030 | 18 µg, 54 µg and 162 µg | 1, 3 and 28 days | ||
| TiO2 Sanding dust: Indoor-R, IndoornanoTiO2 | 18 µg, 54 µg and 162 µg | 1 and 28 days | ||
| TiO2: Sanding dust NRCWE-032, Sanding dust NRCWE-033 | 18 µg, 54 µg and 162 µg | 1 and 28 days | ||
| TiO2: NRCWE 001 (neutral), NRCWE 002 (positively charged) | 18 µg, 54 µg and 162 µg | 1 and 28 days | ||
| CNT: NRCWE-26, NM-401 | 18 µg, 54 µg and 162 µg | 1, 3 and 28 days |
Fig. 2A heatmap of Bi-cluster 7 for GSE61366 is presented. Biological replicates were averaged using the median. Group medians were clustered using average linkage with the 1-correlation dissimilarity metric estimated using spearman correlations.
| Organism/cell line/tissue | |
| Sequencer or array type | |
| Data format | |
| Experimental factors | |
| Experimental features | |
| Sample source location | |
| Data accessibility |