| Literature DB >> 20676069 |
W Shi1, M Bessarabova, D Dosymbekov, Z Dezso, T Nikolskaya, M Dudoladova, T Serebryiskaya, A Bugrim, A Guryanov, R J Brennan, R Shah, J Dopazo, M Chen, Y Deng, T Shi, G Jurman, C Furlanello, R S Thomas, J C Corton, W Tong, L Shi, Y Nikolsky.
Abstract
Gene expression signatures of toxicity and clinical response benefit both safety assessment and clinical practice; however, difficulties in connecting signature genes with the predicted end points have limited their application. The Microarray Quality Control Consortium II (MAQCII) project generated 262 signatures for ten clinical and three toxicological end points from six gene expression data sets, an unprecedented collection of diverse signatures that has permitted a wide-ranging analysis on the nature of such predictive models. A comprehensive analysis of the genes of these signatures and their nonredundant unions using ontology enrichment, biological network building and interactome connectivity analyses demonstrated the link between gene signatures and the biological basis of their predictive power. Different signatures for a given end point were more similar at the level of biological properties and transcriptional control than at the gene level. Signatures tended to be enriched in function and pathway in an end point and model-specific manner, and showed a topological bias for incoming interactions. Importantly, the level of biological similarity between different signatures for a given end point correlated positively with the accuracy of the signature predictions. These findings will aid the understanding, and application of predictive genomic signatures, and support their broader application in predictive medicine.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20676069 PMCID: PMC2920075 DOI: 10.1038/tpj.2010.35
Source DB: PubMed Journal: Pharmacogenomics J ISSN: 1470-269X Impact factor: 3.550
MAQCII data sets and classification end points
| Hamner Institutes[ | A | Chemical tumorigenesis in mouse lung | 70 | 26 | 44 |
| Entelos[ | B | Chemical nongenotoxic carcinogenesis in rat liver | 216 | 73 | 143 |
| NIEHS[ | C | Necrosis in rat liver | 214 | 79 | 135 |
| MD Anderson Cancer Center[ | D | Clinical breast cancer treatment response | 130 | 33 | 97 |
| E | Breast cancer estrogen receptor status | 130 | 80 | 50 | |
| University of Arkansas[ | F | Overall survival milestone outcome in multiple myeloma | 340 | 51 | 289 |
| G | Event-free survival in multiple myeloma | 340 | 84 | 256 | |
| H | Control parameter S1 (gender) | 340 | 194 | 146 | |
| I | Control parameter R1 (random) | 340 | 200 | 140 | |
| University of Cologne[ | J | Overall survival milestone outcome in neuroblastoma | 238 | 22 | 216 |
| K | Event-free survival milestone outcome in neuroblastoma | 239 | 49 | 190 | |
| L | Control parameter S (gender) | 246 | 145 | 101 | |
| M | Control parameter R (random) | 246 | 145 | 101 |
Analysis teams[25]
| ABT | Abbott Laboratories |
| Almac | Almac Diagnostics, UK |
| CAS | Chinese Academy of Sciences, China |
| CBC | CapitalBio Corporation, China |
| CDRH | Center for Devices and Radiological Health, FDA |
| CDRH2 | Center for Devices and Radiological Health, FDA |
| CIPF | Centro de Investigacion Principe Felipe, Spain |
| Cornell | Weill Medical College of Cornell University |
| Cornell2 | Cornell University |
| DKFZ | German Cancer Research Center, Germany |
| EPA | US Environmental Protection Agency |
| FBK | Fondazione Bruno Kessler, Italy |
| GeneGo | GeneGo Inc. |
| GHI | Golden Helix Inc. |
| GT | Georgia Institute of Technology, Emory University |
| JHSPH | Johns Hopkins Bloomberg School of Public Health |
| KU | University of Kansas |
| Ligand | Ligand Pharmaceuticals |
| NCTR | National Center for Toxicological Research, FDA |
| NIEHS | National Institute of Environmental Health Sciences |
| NWU | Northwestern University |
| Princeton | Princeton University |
| Roche | Roche Palo Alto LLC |
| SAI | Systems Analytics Inc. |
| SAS | SAS Institute Inc. |
| SDSU | South Dakota State University |
| SIB | Swiss Institute of Bioinformatics, Switzerland |
| SA | SuperArray Bioscience Corporation |
| Tsinghua | Tsinghua University, China |
| UAMS | University of Arkansas for Medical Sciences |
| UCLA | Cedars-Sinai Medical Center of UCLA |
| UIUC | University of Illinois at Urbana-Champaign |
| UML | University of Massachusetts Lowell |
| USM | University of Southern Mississippi |
| ZJU | Zhejiang University, China |
| GSK | GlaxoSmithKline |
Figure 1Protein function enrichment in signatures and unions for all end points. (a) Protein class distribution for end point A, individual signatures. (b) Protein class distribution for end point E, individual signatures. (c) Protein class distribution for union signatures for all 13 end points. (d) Protein class distribution for all end points; all signatures.
Figure 2Network topology enrichment for signatures and unions for all end points. (a) Enrichment in ‘degree'—average number of interactions per gene in signatures for end points A, E and median signatures for all end points. (b) Distribution of degrees IN and degrees OUT for end points A and E and for all end points.
Figure 3Signature similarity for end point A based on (a) feature content; (b) ontology enrichment; (c) network closeness. Using the κ statistics, we generated hierarchical trees using z-scores as a measurement of distance. Trees for the 12 other end points are given in Supplementary File 9.
Figure 4Correlation between signature similarity and model performance. (a) Correlation between signature intersection and ontological enrichment similarity (five ontologies) for 13 end points. (b) Correlation between signature similarity and model performance. The similarity was calculated by κ statistics as P-values among all submitted gene lists for each end point. MCC was used to evaluate model performance.
Figure 5Interactome analysis of signature gene lists. The most frequent upstream and downstream genes for each signature generated for end points A (a), E (b), H (c) and J (d).