| Literature DB >> 30221212 |
Vincenzo Belcastro1, Carine Poussin1, Yang Xiang1, Maurizio Giordano2, Kumar Parijat Tripathi2, Akash Boda1, Stéphanie Boué1, Mario Guarracino2, Florian Martin1, Manuel C Peitsch1, Julia Hoeng1, Roberto Romero3,4,5,6, Adi L Tarca7,8, Zhongqu Duan9,10, Hao Yang9,11, Xiaofeng Gong9,10, Peixuan Wang9,10, Chenfang Zhang9,10, Wenxin Yang9,11, Omer Sinan Sarac12, Ismail Bilgen12, Ali Tugrul Balci12, Rahul Kumar13, Sandeep Kumar Dhanda14.
Abstract
Cigarette smoking entails chronic exposure to a mixture of harmful chemicals that trigger molecular changes over time, and is known to increase the risk of developing diseases. Risk assessment in the context of 21st century toxicology relies on the elucidation of mechanisms of toxicity and the identification of exposure response markers, usually from high-throughput data, using advanced computational methodologies. The sbv IMPROVER Systems Toxicology computational challenge (Fall 2015-Spring 2016) aimed to evaluate whether robust and sparse (≤40 genes) human (sub-challenge 1, SC1) and species-independent (sub-challenge 2, SC2) exposure response markers (so called gene signatures) could be extracted from human and mouse blood transcriptomics data of current (S), former (FS) and never (NS) smoke-exposed subjects as predictors of smoking and cessation status. Best-performing computational methods were identified by scoring anonymized participants' predictions. Worldwide participation resulted in 12 (SC1) and six (SC2) final submissions qualified for scoring. The results showed that blood gene expression data were informative to predict smoking exposure (i.e. discriminating smoker versus never or former smokers) status in human and across species with a high level of accuracy. By contrast, the prediction of cessation status (i.e. distinguishing FS from NS) remained challenging, as reflected by lower classification performances. Participants successfully developed inductive predictive models and extracted human and species-independent gene signatures, including genes with high consensus across teams. Post-challenge analyses highlighted "feature selection" as a key step in the process of building a classifier and confirmed the importance of testing a gene signature in independent cohorts to ensure the generalized applicability of a predictive model at a population-based level. In conclusion, the Systems Toxicology challenge demonstrated the feasibility of extracting a consistent blood-based smoke exposure response gene signature and further stressed the importance of independent and unbiased data and method evaluations to provide confidence in systems toxicology-based scientific conclusions.Entities:
Keywords: Systems toxicology; blood biomarkers; computational challenge; gene signature; smoking biomarker
Year: 2017 PMID: 30221212 PMCID: PMC6136260 DOI: 10.1016/j.comtox.2017.07.004
Source DB: PubMed Journal: Comput Toxicol ISSN: 2468-1113