| Literature DB >> 24957895 |
Massimiliano Zanin1, David Papo2, José Luis González Solís3, Juan Carlos Martínez Espinosa4, Claudio Frausto-Reyes5, Pascual Palomares Anda6, Ricardo Sevilla-Escoboza7, Rider Jaimes-Reategui7, Stefano Boccaletti8, Ernestina Menasalvas9, Pedro Sousa10.
Abstract
In the last decade, complex networks have widely been applied to the study of many natural and man-made systems, and to the extraction of meaningful information from the interaction structures created by genes and proteins. Nevertheless, less attention has been devoted to metabonomics, due to the lack of a natural network representation of spectral data. Here we define a technique for reconstructing networks from spectral data sets, where nodes represent spectral bins, and pairs of them are connected when their intensities follow a pattern associated with a disease. The structural analysis of the resulting network can then be used to feed standard data-mining algorithms, for instance for the classification of new (unlabeled) subjects. Furthermore, we show how the structure of the network is resilient to the presence of external additive noise, and how it can be used to extract relevant knowledge about the development of the disease.Entities:
Year: 2013 PMID: 24957895 PMCID: PMC3901251 DOI: 10.3390/metabo3010155
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Figure 1Example of calculation of the weight of a link. (Left) Lineal fit of data corresponding to control subjects and patients; (right) classification of an unlabeled subject (marked as X) into one of the two groups.
Figure 2Four examples of network representation of spectral data. Upper (bottom) networks represent control subjects (patients suffering from Glomerulonephritis).
Figure 3Analysis of the structural characteristics of networks for control subjects (green) and patients (blue). The three plots represent the histograms for (left) link density, (center) clustering coefficient, and (right) efficiency-see text for definitions.
Figure 4Histograms of the eigenvector centrality of nodes of networks represented in Figure 2, i.e., two control subjects (Left) and two GN patients (Right).
Figure 5Mean classification scores obtained by four algorithms for data sets polluted with additive noise; the proposed network-based representation is represented by black squares.
Figure 6Classification of control and leukemia subjects. (Left) Representation of the position of control subjects (green squares) and leukemia patients (blue points) in the space of network features. (Right) Classification score as a function of the binning size.
Figure 7Evolution through time of the link density of networks representing a leukemia patient under chemotherapy treatment.