Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Emergent unsupervised clustering paradigms with potential application to bioinformatics.

Literature DB >> 17981579

Emergent unsupervised clustering paradigms with potential application to bioinformatics.

David J Miller¹, Yue Wang, George Kesidis.

Abstract

In recent years, there has been a great upsurge in the application of data clustering, statistical classification, and related machine learning techniques to the field of molecular biology, in particular analysis of DNA microarray expression data. Clustering methods can be used to group co-expressed genes, shedding light on gene function and co-regulation. Alternatively, they can group samples or conditions to identify phenotypical groups, disease subgroups, or to help identify disease pathways. A rich variety of unsupervised techniques have been applied, including partitional, hierarchical, graph-based, model-based, and biclustering methods. While a number of machine learning problems and tools have found mainstream applications in bioinformatics, in this article we identify some challenging problems which, though clearly relevant to bioinformatics, have not been extensively investigated in this domain. These include i) unsupervised clustering with unsupervised feature selection, ii) semisupervised learning, iii) unsupervised learning (and supervised learning) in the presence of confounding variables, and iv) stability of clustering solutions. We review recent methods which address these problems and take the position that these methods are well-suited to addressing some common scenarios that occur in bioinformatics.

Mesh：

Year: 2008 PMID： 17981579 DOI： 10.2741/2711

Source DB: PubMed Journal: Front Biosci ISSN： 1093-4715

Keyword Cloud
Cited

10 in total

Review 1. The properties of high-dimensional data spaces: implications for exploring gene and protein expression data.

Authors: Robert Clarke; Habtom W Ressom; Antai Wang; Jianhua Xuan; Minetta C Liu; Edmund A Gehan; Yue Wang
Journal: Nat Rev Cancer Date: 2008-01 Impact factor: 60.716

2. Heirarchical clustering and beyond in PCOS endometrium: brave new world.

Authors: Richard S Legro; Jan M McAllister
Journal: J Clin Endocrinol Metab Date: 2009-04 Impact factor: 5.958

Review 3. Biomarker discovery for Alzheimer's disease, frontotemporal lobar degeneration, and Parkinson's disease.

Authors: William T Hu; Alice Chen-Plotkin; Steven E Arnold; Murray Grossman; Christopher M Clark; Leslie M Shaw; Leo McCluskey; Lauren Elman; Jason Karlawish; Howard I Hurtig; Andrew Siderowf; Virginia M-Y Lee; Holly Soares; John Q Trojanowski
Journal: Acta Neuropathol Date: 2010-07-22 Impact factor: 17.088

4. Serum angiogenic profile of patients with glioblastoma identifies distinct tumor subtypes and shows that TIMP-1 is a prognostic factor.

Authors: Matthew Crocker; Sue Ashley; Ian Giddings; Vladimir Petrik; Anthea Hardcastle; Wynne Aherne; Andy Pearson; B Anthony Bell; Stergios Zacharoulis; Marios C Papadopoulos
Journal: Neuro Oncol Date: 2010-12-16 Impact factor: 12.300

5. m⁶A Regulator-Mediated Methylation Modification Patterns and Characterisation of Tumour Microenvironment Infiltration in Non-Small Cell Lung Cancer.

Authors: Yongfei Fan; Yong Zhou; Ming Lou; Xinwei Li; Xudong Zhu; Kai Yuan
Journal: J Inflamm Res Date: 2022-03-23

Emergent unsupervised clustering paradigms with potential application to bioinformatics.

Review 1. The properties of high-dimensional data spaces: implications for exploring gene and protein expression data.

2. Heirarchical clustering and beyond in PCOS endometrium: brave new world.

Review 3. Biomarker discovery for Alzheimer's disease, frontotemporal lobar degeneration, and Parkinson's disease.

4. Serum angiogenic profile of patients with glioblastoma identifies distinct tumor subtypes and shows that TIMP-1 is a prognostic factor.

5. m⁶A Regulator-Mediated Methylation Modification Patterns and Characterisation of Tumour Microenvironment Infiltration in Non-Small Cell Lung Cancer.

Review 6. Gene module level analysis: identification to networks and dynamics.

7. Discriminant analysis of Raman spectra for body fluid identification for forensic purposes.

8. caBIG VISDA: modeling, visualization, and discovery for cluster analysis of genomic data.

9. Dual Transcriptomic and Molecular Machine Learning Predicts all Major Clinical Forms of Drug Cardiotoxicity.

Review 10. Approaches to working in high-dimensional data spaces: gene expression microarrays.

Emergent unsupervised clustering paradigms with potential application to bioinformatics.

Review 1. The properties of high-dimensional data spaces: implications for exploring gene and protein expression data.

2. Heirarchical clustering and beyond in PCOS endometrium: brave new world.

Review 3. Biomarker discovery for Alzheimer's disease, frontotemporal lobar degeneration, and Parkinson's disease.

4. Serum angiogenic profile of patients with glioblastoma identifies distinct tumor subtypes and shows that TIMP-1 is a prognostic factor.

5. m6A Regulator-Mediated Methylation Modification Patterns and Characterisation of Tumour Microenvironment Infiltration in Non-Small Cell Lung Cancer.

Review 6. Gene module level analysis: identification to networks and dynamics.

7. Discriminant analysis of Raman spectra for body fluid identification for forensic purposes.

8. caBIG VISDA: modeling, visualization, and discovery for cluster analysis of genomic data.

9. Dual Transcriptomic and Molecular Machine Learning Predicts all Major Clinical Forms of Drug Cardiotoxicity.

Review 10. Approaches to working in high-dimensional data spaces: gene expression microarrays.

5. m⁶A Regulator-Mediated Methylation Modification Patterns and Characterisation of Tumour Microenvironment Infiltration in Non-Small Cell Lung Cancer.