| Literature DB >> 27467159 |
Victor E Kuz'min1, Pavel G Polishchuk2, Anatoly G Artemenko1, Sergey A Andronati1.
Abstract
A new algorithm for the interpretation of Random Forest models has been developed. It allows to calculate the contribution of each descriptor to the calculated property value. In case of the simplex representation of a molecular structure, contributions of individual atoms can be calculated, and thus it becomes possible to estimate the influence of separate molecular fragments on the investigated property. Such information can be used for the design of new compounds with a predefined property value. The proposed measure of descriptor contributions is not an alternative to the importance of Breiman's variable, but it characterizes the contribution of a particular explanatory variable to the calculated response value.Keywords: Random Forest interpretation; Simplex representation of molecular structure
Year: 2011 PMID: 27467159 DOI: 10.1002/minf.201000173
Source DB: PubMed Journal: Mol Inform ISSN: 1868-1743 Impact factor: 3.353