Literature DB >> 33575624

Amalgams: data-driven amalgamation for the dimensionality reduction of compositional data.

Thomas P Quinn1, Ionas Erb2.   

Abstract

Many next-generation sequencing datasets contain only relative information because of biological and technical factors that limit the total number of transcripts observed for a given sample. It is not possible to interpret any one component in isolation. The field of compositional data analysis has emerged with alternative methods for relative data based on log-ratio transforms. However, these data often contain many more features than samples, and thus require creative new ways to reduce the dimensionality of the data. The summation of parts, called amalgamation, is a practical way of reducing dimensionality, but can introduce a non-linear distortion to the data. We exploit this non-linearity to propose a powerful yet interpretable dimension method called data-driven amalgamation. Our new method, implemented in the user-friendly R package amalgam, can reduce the dimensionality of compositional data by finding amalgamations that optimally (i) preserve the distance between samples, or (ii) classify samples as diseased or not. Our benchmark on 13 real datasets confirm that these amalgamations compete with state-of-the-art methods in terms of performance, but result in new features that are easily understood: they are groups of parts added together.
© The Author(s) 2019. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.

Entities:  

Year:  2020        PMID: 33575624      PMCID: PMC7671324          DOI: 10.1093/nargab/lqaa076

Source DB:  PubMed          Journal:  NAR Genom Bioinform        ISSN: 2631-9268


  4 in total

1.  Learning Sparse Log-Ratios for High-Throughput Sequencing Data.

Authors:  Elliott Gordon-Rodriguez; Thomas P Quinn; John P Cunningham
Journal:  Bioinformatics       Date:  2021-09-08       Impact factor: 6.937

2.  A prospective investigation into the association between the gut microbiome composition and cognitive performance among healthy young adults.

Authors:  Kolade Oluwagbemigun; Maike E Schnermann; Matthias Schmid; John F Cryan; Ute Nöthlings
Journal:  Gut Pathog       Date:  2022-04-19       Impact factor: 5.324

3.  Approximation of a Microbiome Composition Shift by a Change in a Single Balance Between Two Groups of Taxa.

Authors:  Vera E Odintsova; Natalia S Klimenko; Alexander V Tyakht
Journal:  mSystems       Date:  2022-05-09       Impact factor: 7.324

4.  Principal Amalgamation Analysis for Microbiome Data.

Authors:  Yan Li; Gen Li; Kun Chen
Journal:  Genes (Basel)       Date:  2022-06-24       Impact factor: 4.141

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.