Literature DB >> 20624779

Dealing with sparse data in predicting outcomes of HIV combination therapies.

Jasmina Bogojeska1, Steffen Bickel, André Altmann, Thomas Lengauer.   

Abstract

MOTIVATION: As there exists no cure or vaccine for the infection with human immunodeficiency virus (HIV), the standard approach to treating HIV patients is to repeatedly administer different combinations of several antiretroviral drugs. Because of the large number of possible drug combinations, manually finding a successful regimen becomes practically impossible. This presents a major challenge for HIV treatment. The application of machine learning methods for predicting virological responses to potential therapies is a possible approach to solving this problem. However, due to evolving trends in treating HIV patients the available clinical datasets have a highly unbalanced representation, which might negatively affect the usefulness of derived statistical models.
RESULTS: This article presents an approach that tackles the problem of predicting virological response to combination therapies by learning a separate logistic regression model for each therapy. The models are fitted by using not only the data from the target therapy but also the information from similar therapies. For this purpose, we introduce and evaluate two different measures of therapy similarity. The models are also able to incorporate phenotypic knowledge on the therapy outcomes through a Gaussian prior. With our approach we balance the uneven therapy representation in the datasets and produce higher quality models for therapies with very few training samples. According to the results from the computational experiments our therapy similarity model performs significantly better than training separate models for each therapy by using solely their examples. Furthermore, the model's performance is as good as an approach that encodes therapy information in the input feature space with the advantage of delivering better results for therapies with very few training samples. AVAILABILITY: Code of the efficient logistic regression is available from http://www.mpi-inf.mpg.de/%7Ejasmina/fastLogistic.zip.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20624779     DOI: 10.1093/bioinformatics/btq361

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  3 in total

1.  Can linear regression modeling help clinicians in the interpretation of genotypic resistance data? An application to derive a lopinavir-score.

Authors:  Alessandro Cozzi-Lepri; Mattia C F Prosperi; Jesper Kjær; David Dunn; Roger Paredes; Caroline A Sabin; Jens D Lundgren; Andrew N Phillips; Deenan Pillay
Journal:  PLoS One       Date:  2011-11-16       Impact factor: 3.240

2.  Leveraging domain information to restructure biological prediction.

Authors:  Xiaofei Nan; Gang Fu; Zhengdong Zhao; Sheng Liu; Ronak Y Patel; Haining Liu; Pankaj R Daga; Robert J Doerksen; Xin Dang; Yixin Chen; Dawn Wilkins
Journal:  BMC Bioinformatics       Date:  2011-10-18       Impact factor: 3.169

3.  The individualized genetic barrier predicts treatment response in a large cohort of HIV-1 infected patients.

Authors:  Niko Beerenwinkel; Hesam Montazeri; Heike Schuhmacher; Patrick Knupfer; Viktor von Wyl; Hansjakob Furrer; Manuel Battegay; Bernard Hirschel; Matthias Cavassini; Pietro Vernazza; Enos Bernasconi; Sabine Yerly; Jürg Böni; Thomas Klimkait; Cristina Cellerai; Huldrych F Günthard
Journal:  PLoS Comput Biol       Date:  2013-08-29       Impact factor: 4.475

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.