| Literature DB >> 18795887 |
Jean-Philippe Vert1, Laurent Jacob.
Abstract
Support vector machines and kernel methods belong to the same class of machine learning algorithms that has recently become prominent in both computational biology and chemistry, although both fields have largely ignored each other. These methods are based on a sound mathematical and computationally efficient framework that implicitly embeds the data of interest, respectively proteins and small molecules, in high-dimensional feature spaces where various classification or regression tasks can be performed with linear algorithms. In this review, we present the main ideas underlying these approaches, survey how both the "biological" and the "chemical" spaces have been separately constructed using the same mathematical framework and tricks, and suggest different avenues to unify both spaces for the purpose of in silico chemogenomics.Entities:
Mesh:
Year: 2008 PMID: 18795887 PMCID: PMC2748698 DOI: 10.2174/138620708785739899
Source DB: PubMed Journal: Comb Chem High Throughput Screen ISSN: 1386-2073 Impact factor: 1.339
A Typology of Kernels for Proteins
| Strategy | Input Data | Examples |
|---|---|---|
| Define a list of descriptors | Sequence | Physico-chemical kernels [ |
| 3D Structure | Kernel based on 3D descriptors [ | |
| Derive a kernel from a generative model | Sequence | Fisher, TOP kernels [ |
| 3D Structure | Random walk kernels [ | |
| Derive a kernel from a measure of similarity | Sequence | Local alignment kernel [ |
| 3D Structure | Structure alignment kernel [ |
A Typology of Kernels for Small Molecules
| Strategy | Input Data | Examples |
|---|---|---|
| Use classical fingerprints of molecular descriptors | 1D to 4D structure | Tanimoto or inner products between fingerprints [ |
| Use an infinite number of descriptors and a computational trick | 2D structure | Walk kernels [ |
| 3D Structure | Pharmacophore kernel [ | |
| Use a measure of similarity | 2D structure | Optimal alignment kernel [ |