| Literature DB >> 26531245 |
Hiromi Motegi1, Yuuri Tsuboi2, Ayako Saga1, Tomoko Kagami1, Maki Inoue1, Hideaki Toki1, Osamu Minowa1, Tetsuo Noda1,3, Jun Kikuchi2,4,5.
Abstract
There is an increasing need to use multivariate statistical methods for understanding biological functions, identifying the mechanisms of diseases, and exploring biomarkers. In addition to classical analyses such as hierarchical cluster analysis, principal component analysis, and partial least squares discriminant analysis, various multivariate strategies, including independent component analysis, non-negative matrix factorization, and multivariate curve resolution, have recently been proposed. However, determining the number of components is problematic. Despite the proposal of several different methods, no satisfactory approach has yet been reported. To resolve this problem, we implemented a new idea: classifying a component as "reliable" or "unreliable" based on the reproducibility of its appearance, regardless of the number of components in the calculation. Using the clustering method for classification, we applied this idea to multivariate curve resolution-alternating least squares (MCR-ALS). Comparisons between conventional and modified methods applied to proton nuclear magnetic resonance ((1)H-NMR) spectral datasets derived from known standard mixtures and biological mixtures (urine and feces of mice) revealed that more plausible results are obtained by the modified method. In particular, clusters containing little information were detected with reliability. This strategy, named "cluster-aided MCR-ALS," will facilitate the attainment of more reliable results in the metabolomics datasets.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26531245 PMCID: PMC4632111 DOI: 10.1038/srep15710
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Flow chart of cluster-aided multivariate curve resolution-alternating least squares (MCR-ALS).
The process of cluster-aided MCR-ALS is roughly illustrated. Details are described in the Results section.
Figure 2Results of cluster-aided multivariate curve resolution-alternating least squares (MCR-ALS) and conventional MCR-ALS.
Concentration profiles of the results of urinary data analysis. In the bar graph, the order of the samples is indicated at the bottom of the figure. B6, C57BL/6J; C3, C3H/HeJ; D2, DBA/2J; Cont, control group; HFD, high-fat-diet-fed group; Aged, aged group. Typical concentration profiles in 21 identified reliable clusters analyzed by cluster-aided MCR-ALS are shown on the left side of the figure. Six components analyzed by conventional MCR-ALS are shown on the right side. The number in parentheses indicates the cluster size. Colored clusters/components indicate that the component belongs to the same color cluster. Scales of bar graphs are in arbitrary units. The colors of the bars correspond to coefficients of variation.
Figure 3Details of the selected results of cluster-aided multivariate curve resolution-alternating least squares.
(A) Typical concentration profile and spectral profile of cluster 144_5 in urine analysis. (B) Profiles of clusters 92_1_2 and 117_4 in urine analysis. The colors of the bars correspond to coefficients of variation.
Figure 4Color-coding bar graph representation.
(A) Signals of 2.88 ppm in urine. (B) Signals of 0.94 ppm (feces). Right panels show spectral profiles of each cluster.