Literature DB >> 34048487

Smell compounds classification using UMAP to increase knowledge of odors and molecular structures linkages.

Marylène Rugard1, Thomas Jaylet1, Olivier Taboureau2, Anne Tromelin3, Karine Audouze1.   

Abstract

This study aims to highlight the relationships between the structure of smell compounds and their odors. For this purpose, heterogeneous data sources were screened, and 6038 odorant compounds and their known associated odors (162 odor notes) were compiled, each individual molecule being represented with a set of 1024 structural fingerprint. Several dimensional reduction techniques (PCA, MDS, t-SNE and UMAP) with two clustering methods (k-means and agglomerative hierarchical clustering AHC) were assessed based on the calculated fingerprints. The combination of UMAP with k-means and AHC methods allowed to obtain a good representativeness of odors by clusters, as well as the best visualization of the proximity of odorants on the basis of their molecular structures. The presence or absence of molecular substructures has been calculated on odorant in order to link chemical groups to odors. The results of this analysis bring out some associations for both the odor notes and the chemical structures of the molecules such as "woody" and "spicy" notes with allylic and bicyclic structures, "balsamic" notes with unsaturated rings, both "sulfurous" and "citrus" with aldehydes, alcohols, carboxylic acids, amines and sulfur compounds, and "oily", "fatty" and "fruity" characterized by esters and with long carbon chains. Overall, the use of UMAP associated to clustering is a promising method to suggest hypotheses on the odorant structure-odor relationships.

Entities:  

Year:  2021        PMID: 34048487     DOI: 10.1371/journal.pone.0252486

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  3 in total

Review 1.  Synthesis of Cyclic Fragrances via Transformations of Alkenes, Alkynes and Enynes: Strategies and Recent Progress.

Authors:  Zhigeng Lin; Baoying Huang; Lufeng Ouyang; Liyao Zheng
Journal:  Molecules       Date:  2022-06-02       Impact factor: 4.927

2.  A topological data analysis-based method for gait signals with an application to the study of multiple sclerosis.

Authors:  Alexandre Bois; Brian Tervil; Albane Moreau; Aliénor Vienne-Jumeau; Damien Ricard; Laurent Oudre
Journal:  PLoS One       Date:  2022-05-13       Impact factor: 3.752

3.  Biocatalytic Production of Aldehydes: Exploring the Potential of Lathyrus cicera Amine Oxidase.

Authors:  Elisa Di Fabio; Alessio Incocciati; Alberto Boffi; Alessandra Bonamore; Alberto Macone
Journal:  Biomolecules       Date:  2021-10-18
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.