| Literature DB >> 33515233 |
Zachary B Abrams1, Caitlin E Coombes1,2, Suli Li3, Kevin R Coombes1.
Abstract
SUMMARY: Unsupervised machine learning provides tools for researchers to uncover latent patterns in large-scale data, based on calculated distances between observations. Methods to visualize high-dimensional data based on these distances can elucidate subtypes and interactions within multi-dimensional and high-throughput data. However, researchers can select from a vast number of distance metrics and visualizations, each with their own strengths and weaknesses. The Mercator R package facilitates selection of a biologically meaningful distance from 10 metrics, together appropriate for binary, categorical, and continuous data, and visualization with 5 standard and high-dimensional graphics tools. Mercator provides a user-friendly pipeline for informaticians or biologists to perform unsupervised analyses, from exploratory pattern recognition to production of publication-quality graphics. AVAILABILITY: Mercator is freely available at the Comprehensive R Archive Network (https://cran.r-project.org/web/packages/Mercator/index.html).Entities:
Year: 2021 PMID: 33515233 DOI: 10.1093/bioinformatics/btab037
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937