| Literature DB >> 26437641 |
H López-Fernández1,2, H M Santos3, J L Capelo4, F Fdez-Riverola5,6, D Glez-Peña7,8, M Reboiro-Jato9,10.
Abstract
BACKGROUND: Mass spectrometry is one of the most important techniques in the field of proteomics. MALDI-TOF mass spectrometry has become popular during the last decade due to its high speed and sensitivity for detecting proteins and peptides. MALDI-TOF-MS can be also used in combination with Machine Learning techniques and statistical methods for knowledge discovery. Although there are many software libraries and tools that can be combined for these kind of analysis, there is still a need for all-in-one solutions with graphical user-friendly interfaces and avoiding the need of programming skills.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26437641 PMCID: PMC4595311 DOI: 10.1186/s12859-015-0752-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Mass-Up main workflow. Mass-Up main workflow operations and datatypes. Different colors have been used to identify input/output operations (green), preprocessing operations (orange), analysis operations (blue), and datatypes (red)
Fig. 2Quality control view. Details of the quality control analysis views for a replicates, and b samples. Box plot charts are used to summarize the more detailed information presented in the tables
Tests of independence applied depending on the number of samples and conditions
| <= 1000 samples | >1000 samples | |
|---|---|---|
| 2 conditions | Fisher’s exact test | Yates’ chi-square test |
| >2 conditions | Randomization test | Chi-square test |
Fig. 3Inter-label and intra-label biomarker discovery analysis views. a Inter-label biomarker discovery view. Depending on the number of samples and conditions, Mass-Up automatically selects the appropriate statistical test to apply. b Intra-label biomarker discovery view. Filters are configured to select only the m/z values present in the MA samples and absent in the other samples
Fig. 4PCA, clustering, and bi-clustering analysis views. a Principal component analysis view presenting three different clusters, one for each condition. b Detail of the hierarchical clustering visualization using JTreeView. The upper dendrogram automatically colors the tree branches that only include samples from the same condition, while the side dendrogram groups the more similar m/z values. c Class-biclusters of the Cancer dataset extracted with the Mass-Up Biclustering Viewer. Purple rectangles denote the existence of biclusters associated with one condition
Fig. 5Classification analysis view. Classification analysis view presenting the result of executing a Bayes Net classifier using a 10-fold cross validation scheme. The resulting confusion matrix is presented along with several statistical measurements. ROC curve corresponding to condition C of the Wine dataset is also showed