Literature DB >> 10551499

Visualisation of biomedical datasets by use of growing cell structure networks: a novel diagnostic classification technique.

A J Walker1, S S Cross, R F Harrison.   

Abstract

BACKGROUND: Medical research produces large multivariable datasets that are difficult to visualise and interpret intuitively. We describe a novel growing cell structure (GCS) technique that compresses multidimensional datasets into two dimensional maps with colour overlays that can be visually interpreted.
METHODS: The two-dimensional map is self-discovered from the training set by distribution of cases to different nodes according to similarity between the cases at each node. Nodes are added to the map until there is no further significant reduction in error. The Parzen window method is used to estimate the probability distribution of the training cases, and this probability is converted to posterior class probabilities by use of Bayes' theorem. Classification performance can be assessed by means of receiver operating characteristic (ROC) curves. Colour maps of the values of each input variable at each node are constructed, which illustrate the relation between each input variable and the overall distribution of cases in the network map.
FINDINGS: From a dataset of 11 input variables from 692 fine-needle aspirate samples from breast lesions, a 32-node network produced an area under the ROC curve of 0.96, which was not significantly different from that for logistic regression (0.98, z=1.09, p>0.05). Colour maps of the input variables showed that some variables had discrete distributions over exclusively benign or malignant areas of the network, and were thus discriminant, whereas others, such as foamy macrophages, covered both benign and malignant regions.
INTERPRETATION: This technique produces dimensional compression that allows multidimensional data to be displayed as two-dimensional colour images. This envisioning of information allows the highly developed visuospatial abilities of human observers to perceive subtle inter-relations in the dataset.

Entities:  

Mesh:

Year:  1999        PMID: 10551499     DOI: 10.1016/S0140-6736(99)02186-8

Source DB:  PubMed          Journal:  Lancet        ISSN: 0140-6736            Impact factor:   79.321


  4 in total

1.  Observer accuracy in estimating proportions in images: implications for the semiquantitative assessment of staining reactions and a proposal for a new system.

Authors:  S S Cross
Journal:  J Clin Pathol       Date:  2001-05       Impact factor: 3.411

2.  Discriminant histological features in the diagnosis of chronic idiopathic inflammatory bowel disease: analysis of a large dataset by a novel data visualisation technique.

Authors:  S S Cross; R F Harrison
Journal:  J Clin Pathol       Date:  2002-01       Impact factor: 3.411

3.  d-matrix - database exploration, visualization and analysis.

Authors:  Dominik Seelow; Raffaello Galli; Siegrun Mebus; Hans-Peter Sperling; Hans Lehrach; Silke Sperling
Journal:  BMC Bioinformatics       Date:  2004-10-28       Impact factor: 3.169

4.  Evaluation of the diagnostic power of thermography in breast cancer using Bayesian network classifiers.

Authors:  Cruz-Ramírez Nicandro; Mezura-Montes Efrén; Ameca-Alducin María Yaneli; Martín-Del-Campo-Mena Enrique; Acosta-Mesa Héctor Gabriel; Pérez-Castro Nancy; Guerra-Hernández Alejandro; Hoyos-Rivera Guillermo de Jesús; Barrientos-Martínez Rocío Erandi
Journal:  Comput Math Methods Med       Date:  2013-05-22       Impact factor: 2.238

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.