| Literature DB >> 31729955 |
Daniel J Giguere1, Jean M Macklaim2, Brandon Y Lieng2, Gregory B Gloor2.
Abstract
BACKGROUND: Differential abundance analysis is widely used with high-throughput sequencing data to compare gene abundance or expression between groups of samples. Many software packages exist for this purpose, but each uses a unique set of statistical assumptions to solve problems on a case-by-case basis. These software packages are typically difficult to use for researchers without command-line skills, and software that does offer a graphical user interface do not use a compositionally valid method.Entities:
Keywords: Compositional data; Data visualization; Differential abundance; Differential expression; Effect plots; Exploratory data analysis
Mesh:
Year: 2019 PMID: 31729955 PMCID: PMC6858670 DOI: 10.1186/s12859-019-3174-x
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Workflow of omicplotR. The data input requires samples as columns, features by rows, whereas the metadata input descriptors as columns and samples by row. Count tables can be filtered to remove samples or features with low counts. After filtering, zero-imputation and a log-ratio transform is applied to the counts. A principal component analysis (PCA) biplot is typically the first exploratory visualization used. Several other plots are available to visualize differences between samples, features, and experimental conditions. Visualizing which features and samples have been removed by filtering is also possible. Plots are stylized representations of plots that can be generated by omicplotR