Literature DB >> 15488750

Mapping high-dimensional data onto a relative distance plane--an exact method for visualizing and characterizing high-dimensional patterns.

R L Somorjai1, B Dolenko, A Demko, M Mandelzweig, A E Nikulin, R Baumgartner, N J Pizzi.   

Abstract

We introduce a distance (similarity)-based mapping for the visualization of high-dimensional patterns and their relative relationships. The mapping preserves exactly the original distances between points with respect to any two reference patterns in a special two-dimensional coordinate system, the relative distance plane (RDP). As only a single calculation of a distance matrix is required, this method is computationally efficient, an essential requirement for any exploratory data analysis. The data visualization afforded by this representation permits a rapid assessment of class pattern distributions. In particular, we can determine with a simple statistical test whether both training and validation sets of a 2-class, high-dimensional dataset derive from the same class distributions. We can explore any dataset in detail by identifying the subset of reference pairs whose members belong to different classes, cycling through this subset, and for each pair, mapping the remaining patterns. These multiple viewpoints facilitate the identification and confirmation of outliers. We demonstrate the effectiveness of this method on several complex biomedical datasets. Because of its efficiency, effectiveness, and versatility, one may use the RDP representation as an initial, data mining exploration that precedes classification by some classifier. Once final enhancements to the RDP mapping software are completed, we plan to make it freely available to researchers.

Mesh:

Year:  2004        PMID: 15488750     DOI: 10.1016/j.jbi.2004.07.005

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  5 in total

Review 1.  Creating robust, reliable, clinically relevant classifiers from spectroscopic data.

Authors:  R L Somorjai
Journal:  Biophys Rev       Date:  2009-11-25

Review 2.  Deriving biomedical diagnostics from NMR spectroscopic data.

Authors:  Ian C P Smith; Ray L Somorjai
Journal:  Biophys Rev       Date:  2011-03-08

Review 3.  MRS-based Metabolomics in Cancer Research.

Authors:  Tedros Bezabeh; Omkar B Ijare; Alexander E Nikulin; Rajmund L Somorjai; Ian Cp Smith
Journal:  Magn Reson Insights       Date:  2014-02-13

4.  Mapping lung cancer epithelial-mesenchymal transition states and trajectories with single-cell resolution.

Authors:  Loukia G Karacosta; Benedict Anchang; Nikolaos Ignatiadis; Samuel C Kimmey; Jalen A Benson; Joseph B Shrager; Robert Tibshirani; Sean C Bendall; Sylvia K Plevritis
Journal:  Nat Commun       Date:  2019-12-06       Impact factor: 14.919

5.  Spectral embedding finds meaningful (relevant) structure in image and microarray data.

Authors:  Brandon W Higgs; Jennifer Weller; Jeffrey L Solka
Journal:  BMC Bioinformatics       Date:  2006-02-16       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.