| Literature DB >> 34788796 |
Thomas Naake1, Wolfgang Huber1.
Abstract
MOTIVATION: First-line data quality assessment and exploratory data analysis are integral parts of any data analysis workflow. In high-throughput quantitative omics experiments (e.g. transcriptomics, proteomics, metabolomics), after initial processing, the data are typically presented as a matrix of numbers (feature IDs × samples). Efficient and standardized data-quality metrics calculation and visualization are key to track the within-experiment quality of these rectangular data types and to guarantee for high-quality data sets and subsequent biological question-driven inference.Entities:
Year: 2021 PMID: 34788796 PMCID: PMC8796383 DOI: 10.1093/bioinformatics/btab748
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Examples of MatrixQCvis functionality and user interface. (a) MA plot of human plasma proteomics samples identifying a dependence between M and A values for Sample 6 indicating problems with its data. More visualizations using the clinical datasets of Jiang and Brueffer are shown in the Supplementary Material. (b) MatrixQCvis builds upon the SummarizedExperiment S4 class, a container for assay data (e.g. proteomics intensity values) and associated metadata on the features and samples. The figure is adjusted from the vignette of the SummarizedExperiment package. (c) Sidebar panel. MatrixQCvis enables to interactively normalize, transform, perform batch correction and impute the dataset. Furthermore, samples can be excluded or selected. In the shown example, phosphate-buffered saline samples are excluded. (d) Main panel. Navigation within MatrixQCvis is realized by browsing through tabs. Each visualization is embedded within a dedicated tab