| Literature DB >> 30873534 |
Kobi Snitz1, Ofer Perl1, Danielle Honigstein1, Lavi Secundo1, Aharon Ravia1, Adi Yablonka1, Yaara Endevelt-Shapira1, Noam Sobel1.
Abstract
A common goal in olfaction research is modeling the link between odorant structure and odor perception. Such modeling efforts require large data sets on olfactory perception, yet only a few of these are publicly and freely available. Given that individual odor perception may be informative on personal makeup and interpersonal relationships, we hypothesized that people would gladly provide olfactory perceptual estimates in the context of an odor-based social network. We developed a web-based infrastructure for such a network we called SmellSpace and distributed 10 000 scratch-and-sniff registration booklets each containing a subset of 12 out of 35 microencapsulated odorants. Within ~100 days, we obtained data from ~1000 participants who rated the odorants along 13 verbal descriptors. To verify that these estimates are comparable to lab-collected estimates we tested 26 participants in a controlled lab setting using the same odorants and descriptors. We observed remarkably high overall group correlations between lab and SmellSpace data, implying that this method provides for credible group-representations of odorants. We further estimated the usability of the data by applying to it two previously published models that used odorant structure alone to predict either odorant pleasantness or pairwise odorant perceptual similarity. We observed statistically significant predictions in both cases, thus further implying that the current data may be helpful toward future efforts of modeling olfactory perception from structure. We conclude that an odor-based social network is a potentially useful instrument for collecting extensive data on olfactory perception and here post the complete raw data set from the first ~1000 participants.Entities:
Keywords: odor perception; odorant descriptors; odorant pleasantness; odorant similarity
Year: 2019 PMID: 30873534 PMCID: PMC6462760 DOI: 10.1093/chemse/bjz014
Source DB: PubMed Journal: Chem Senses ISSN: 0379-864X Impact factor: 3.160
Figure 4.Descriptors were variably applied across odorants. Histograms for each of the 11 fixed descriptors as they were applied to 10 fixed odorants. Columns arranged according to increasing average column variance (from left to right) and rows according to increasing average row variance in increasing order from top to bottom (the standard deviation is color coded). In other words, “Spicy” and “Fresh-Rotten” were the most and least variably applied descriptors, respectively, and JQ and QB were the most and least variably perceived odorants, respectively.
Figure 1.Odorants were distributed by Scratch and Sniff booklets. (A) The monomolecules used in the study: 10 fixed odorants and 13 rotators (the remaining 12 odorants were mixtures) depicted within the first and second principal components of a representative physicochemical space containing ~1500 odorant molecules. (B) An odorant page from within the booklet. The microencapsulated odorant (in this case; JX) was printed on the entire page. (C) A participant sniffing the booklet. (D) Example of the VAS scale in the webpage.
The odorants used
| Concentration | |||||||
|---|---|---|---|---|---|---|---|
| Odor code | Name | CAS number | (V/V in IPM) | CID | Number of subjects | Intensity ratings | |
| 1 | JX | Laevo-fenchone | 7787-20-4 | 7.60% | 14525 | 985 | 62.6305 |
| 2 | QB | Isoamyl acetate | 123-92-2 | 25.00% | 31276 | 985 | 48.0234 |
| 3 | SD | 3-Propylidene phthalide | 17369-59-4 | 7.90% | 5373603 | 985 | 71.9421 |
| 4 | EQ | Cuminaldehyde | 122-03-2 | 13.60% | 326 | 985 | 74.0538 |
| 5 | AY | Strawberry glycidate 1 (aldehyde C-16) | 77-83-8 | 100.00% | 6501 | 985 | 56.8213 |
| 6 | XI | Nonanal (aldehyde C-9) | 124-19-6 | 100.00% | 31289 | 985 | 48.5157 |
| 7 | FL | Citral | 5392-40-5 | 100.00% | 638011 | 985 | 56.7553 |
| 8 | JQ | Skatole | 83-34-1 | 0.75% | 6736 | 985 | 53.0751 |
| 9 | ZB | Hexanol | 111-27-3 | 30.00% | 8103 | 985 | 33.9107 |
| 10 | LV | 6-Methylquinoline | 91-62-3 | 10.70% | 7059 | 985 | 51.3553 |
| 11 | KO | Musk | 1222-05-5 | 100.00% | 91497 | 65 | 46.3846 |
| 12 | JT | (Mixture) [XI, LV, SD, EQ] | 65 | 78.6154 | |||
| 13 | UV | Damascenone | 23696-85-7 | 10.00% | 5366074 | 53 | 56.8491 |
| 14 | OS | (Mixture) [EQ, ZB, QB, JX] | 53 | 71.6981 | |||
| 15 | SS | ISO E Super | 54464-57-2 | 100.00% | 108242 | 34 | 54.4412 |
| 16 | WZ | (Mixture) [JX, JQ, LV, EQ] | 34 | 75.0294 | |||
| 17 | LB | Iralia pure | 1335-46-2 | 100.00% | 5371084 | 54 | 70.2778 |
| 18 | RZ | (Mixture) [JQ, QB, LV, XI] | 54 | 61.0741 | |||
| 19 | VY | (Mixture) [JX, SD, FL, LV] | 66 | 74.303 | |||
| 20 | MH | Oxane | 59323-76-1 | 10.00% | 101010 | 78 | 82.5641 |
| 21 | SF | (Mixture) [XI, ZB, FL, JX] | 78 | 62.359 | |||
| 22 | ZI | Damscone alpha | 24720-09-0 | 10.00% | 5366077 | 94 | 74.7766 |
| 23 | WV | (Mixture) [JQ, SD, FL, LV, QB, XI, EQ, ZB, JX, AY] | 128 | 78.4922 | |||
| 24 | FE | 4-(2,6,6-trimethylcyclohexen-1-yl)but-3-en-2-one | 8013-90-9 | 26955 | 102 | 64.0294 | |
| 25 | UB | Isoamyl phenyl acetate | 102-19-2 | 100.00% | 7600 | 102 | 68.4804 |
| 26 | NB | Norlimbanol | 70788-30-6 | 25.00% | 116699 | 119 | 76.3697 |
| 27 | TJ | Thiogeraniol | 38237-00-2 | 25.00% | 6365572 | 98 | 79.6224 |
| 28 | FM | Clementine aldehide | 20407-84-5 | 5283361 | 66 | 74.7424 | |
| 29 | IF | Aldehide C7 | 111-71-7 | 10.00% | 8130 | 155 | 48.7806 |
| 30 | HG | Galaxolide | 1222-05-5 | 70.00% | 91497 | 33 | 48.9394 |
| 31 | OA | (Mixture) [ZB, JQ, QB, JX] | 119 | 67.8151 | |||
| 32 | EK | (Mixture) [EQ, SD, QB, ZB] | 98 | 77.2551 | |||
| 33 | AS | (Mixture) [XI, LV, SD, ZB] | 155 | 76.1677 | |||
| 34 | BA | (Mixture) [XI, QB, JQ, EQ] | 33 | 75.9091 | |||
| 35 | NE | IPM pure (the diluent) | 110-27-0 | 34 | 37.9412 |
All odorants used, the first 10 are the fixed odorants and the remaining are rotators, columns as follows: odorant number; 2-letter odorant code; odorant common name; odorant CAS number; odorant dilution volume-by-volume in IPM before encapsulation; compound identification number (CID) identification number; the number of subjects that rated the odorant; the average odorant intensity rating. Note that odorants KO and HG are the same molecular species, KO supplied by Sigma-Aldrich and HG by DreamAir. The mixtures are equal combinations of their components without added dilution. We printed 13 booklet types, each containing the 10 fixed odorants and 2 of the rotators. The 13 rotator pairs were: “KO+JT”; “UV+OS”; NB+OA”; “TJ+EK”; “IF+AS”; “HG+BA”; “SS+WZ”; “LB+RZ”; “FM+VY”; “MH+SF”; “ZI+WV”; “FE+UB”; “NE+WV”. We printed 800 copies of each type but the last, (“NE+WV”) of which we printed 400.
Figure 2.One thousand participants within 100 days. A histogram depicting age and gender of the participants. The insert pie chart denotes country of origin: IL = Israel; US = USA; JP = Japan; DE = Germany; CH = Switzerland; GB = Great Britain.
Figure 3.The odorants varied in perceived intensity. (A) Mean and standard error of perceived intensity for all 35 odorants used, rank-ordered by intensity. (B) Perceived intensity of the 10 fixed set odorants across 100 days of distribution (day 0 is the first day of distribution). Each point is the average for that day. The t and p values reflect a two-tailed paired t-test between the first 10 days and last 10 days.
Figure 5.Individual lab participants provided consistent ratings. (A) A histogram reflecting the frequencies of day-after-day correlations on the entire fixed set (a 110 unit vector of 11 odorants along 10 descriptors). (B) Histograms reflecting the frequencies of day-after-day correlations on the fixed set of 10 odorants across each of the 13 descriptors. (C) Histograms reflecting the frequencies of day-after-day correlations on the fixed set of 11 descriptors across each of the 10 odorants.
Figure 6.SmellSpace group data reflected lab group data. We sampled SmellSpace 1000 times, each time selecting 26 participants age and gender matched to the lab cohort. (A) A histogram reflecting the frequencies of SmellSpace-to-lab correlations on the entire fixed set (a 110 unit vector of 11 odorants along 10 descriptors). (B) Histograms reflecting the frequencies of SmellSpace-to-lab correlations on the fixed set of 10 odorants across each of the 13 descriptors. (C) Histograms reflecting the frequencies of SmellSpace-to-lab correlations on the fixed set of 11 descriptors across each of the 10 odorants.
Figure 7.Correlations with lab data as a function of SmellSpace sample size. We randomly sampled SmellSpace 1000 times for each sample size. For each sample we compared the fixed odorant set vector (without odorant ZB). The dashed line reflects a significant correlation between lab and SmellSpace. At 198 SmellSpace participants, all descriptors but “Intensity” are significantly correlated across lab and SmellSpace.
The physicochemical features for predicting pleasantness
| Desc. code | Weight | Desc. code | Weight | Desc. code | Weight |
|---|---|---|---|---|---|
| nSK | 1.262E−04 | SM05_EA(ri) | 6.255E−05 | L3p | −3.57E−06 |
| nCL | −2.980E−06 | SM06_EA(ri) | 4.799E−05 | P2p | −8.92E−06 |
| ZM1 | −9.124E−04 | SM13_EA(ri) | 6.593E−04 | E3p | 1.08E−05 |
| MAXDP | 1.732E−04 | SM11_AEA(bo) | 4.713E−06 | Tu | 2.88E−04 |
| Psi_i_s | 2.826E+00 | SM15_AEA(bo) | −2.647E−04 | Ts | 1.17E−04 |
| SRW08 | 2.014E−04 | SM03_AEA(dm) | −2.160E−04 | Ks | 5.38E−05 |
| SIC0 | 1.247E−05 | Eig15_EA(dm) | 2.592E−05 | Vs | −1.14E−01 |
| CIC5 | −2.021E−05 | Eig06_EA(ri) | −2.532E−04 | HATS0u | 5.42E−05 |
| SpDiam_D | 9.163E−05 | Eig02_AEA(bo) | 6.112E−05 | H7m | 2.78E−07 |
| H_X | −1.603E−04 | Eig06_AEA(bo) | −8.063E−05 | H0v | 5.30E−05 |
| Chi_H2 | 8.301E−05 | Eig11_AEA(bo) | 1.771E−05 | H3v | −9.47E−06 |
| SpDiam_Dt | 7.432E−05 | Mecc | 4.663E−05 | H7v | 1.14E−06 |
| H_D/Dt | −7.089E−02 | SM5_RG | 1.522E−04 | HATS0e | 4.78E−05 |
| EE_D/Dt | 1.570E−04 | SM6_RG | 1.874E−04 | H3p | −1.05E−05 |
| SpAbs_Dz(Z) | −4.421E−04 | VE3_G/D | −6.906E−06 | HATS7p | 8.95E−06 |
| WiA_Dz(e) | 3.562E−05 | VR1_G/D | −2.042E−05 | HATS8p | 7.07E−06 |
| SpAD_Dz(e) | −7.315E−02 | VR3_G/D | −3.026E−05 | HATS2i | 7.90E−05 |
| SpPos_B(m) | 2.513E−04 | TDB08m | −1.927E−06 | R7m+ | −1.08E−07 |
| SpAD_B(m) | 2.149E−04 | TDB07e | 3.715E−05 | R1e | 5.17E−05 |
| SpPosA_B(v) | 5.311E−05 | TDB07p | 5.632E−06 | R8e | 3.85E−05 |
| SM3_B(v) | 1.782E−04 | TDB09s | 6.361E−06 | R8p | 5.84E−06 |
| HyWi_B(p) | 1.113E−04 | TDB02r | 2.296E−05 | R3p+ | 4.05E−07 |
| SpPos_B(p) | 1.357E−04 | RDF155u | −1.171E−05 | R6i | 2.95E−06 |
| SM6_B(p) | 4.491E−04 | RDF035v | 3.212E−05 | DP10 | 6.86E−05 |
| VE1_B(p) | 5.796E−05 | RDF045v | −1.423E−05 | SP15 | −1.90E−04 |
| Chi_B(i) | 2.659E−05 | RDF090v | −1.362E−04 | nCt | −2.12E−04 |
| VE3_B(i) | 7.702E−07 | RDF095e | −6.829E−05 | nR#CH/X | −6.68E−07 |
| ATS5m | 9.451E−05 | RDF085p | −1.584E−04 | nRCHO | 3.14E−05 |
| ATS6v | 9.437E−05 | RDF110p | −2.197E−05 | nArCHO | 4.04E−05 |
| ATS7v | 5.203E−05 | RDF140p | −4.061E−06 | nRC=N | 8.87E−06 |
| ATS6i | 2.640E−04 | RDF015s | 8.325E−01 | nRNR2 | −2.25E−05 |
| ATS8s | 7.139E−05 | RDF080s | 8.029E−05 | nOHp | 8.62E−05 |
| ATSC7i | −9.476E−06 | RDF085s | −3.650E−03 | nHDon | −8.10E−05 |
| MATS3v | 2.435E−05 | Mor13u | 6.520E−06 | C-017 | 1.51E−04 |
| MATS3e | −2.214E−05 | Mor18u | 7.619E−06 | C-026 | −1.85E−05 |
| MATS7s | −1.483E−04 | Mor10m | −6.187E−05 | O-059 | −1.16E−04 |
| GATS4e | 1.629E−05 | Mor17m | 1.195E−05 | SdsCH | 5.53E−04 |
| GATS8i | −3.500E−05 | Mor32m | 9.089E−06 | StsC | −1.69E−06 |
| JGI4 | −1.717E−06 | Mor12v | 6.155E−07 | SdssC | 2.95E−04 |
| JGI5 | 8.027E−06 | Mor27v | −2.076E−06 | NaasC | 6.14E−05 |
| SpMax2_Bh(p) | 1.544E−04 | Mor13e | 1.362E−05 | NsOH | −5.69E−05 |
| SpMax8_Bh(p) | −9.740E−06 | Mor17p | 1.838E−-05 | NaaS | 0.00E+00 |
| SpMin4_Bh(v) | 4.138E−05 | Mor23i | 7.078E−05 | B04[C-S] | −2.11E−05 |
| P_VSA_MR_3 | 3.568E−01 | Mor26i | 2.012E−05 | B10[C-N] | 0.00E+00 |
| P_VSA_m_4 | −4.369E−01 | Mor26s | 5.735E−05 | F03[O-S] | −8.54E−05 |
| P_VSA_v_4 | 5.865E−02 | P1u | 5.349E−05 | F04[C-S] | −2.76E−05 |
| P_VSA_e_3 | −2.517E−01 | L2m | −1.499E−05 | G(O..O) | 2.79E−01 |
| P_VSA_p_2 | −5.653E−01 | G1v | 3.753E−06 | TPSA(NO) | 1.93E−04 |
| P_VSA_p_4 | 5.865E−02 | G2v | 5.278E−05 | Inflammat-80 | −1.96E−06 |
| Eta_betaS | 2.386E−05 | L1e | 2.689E−04 | Infective-80 | −1.16E−04 |
The 150 Dragon features and their associated weighting values that together enable the prediction of odorant pleasantness.
Figure 8.Predicting odorant perception from odorant structure in SmellSpace. (A) Predicting pairwise differences in monomolecular odorant pleasantness from odorant structure. Each dot is a comparison of 2 odorants. (B) Predicting odorant pleasantness from odorant structure. Each dot is a monomolecule. (C) Predicting pairwise monomolecular odorant similarity from odorant structure. Each dot is a comparison of 2 odorants.
The physicochemical features for predicting pairwise perceptual similarity
| No. | Abbreviation | Description |
|---|---|---|
| 1 | nCIR | Number of circuits (constitutional descriptors) |
| 2 | ZM1 | First Zagreb index M1 (topological descriptors) |
| 3 | GNar | Narumi geometric topological index (topological descriptors) |
| 4 | S1K | 1-path Kier alpha-modified shape index (topological descriptors) |
| 5 | piPC08 | Molecular multiple path count of order 08 (walk and path counts) |
| 6 | MATS1v | Moran autocorrelation—lag 1/weighted by atomic van der Waals volumes (2D autocorrelations) |
| 7 | MATS7v | Moran autocorrelation—lag 7/weighted by atomic van der Waals volumes (2D autocorrelations) |
| 8 | GATS1v | Geary autocorrelation—lag 1/weighted by atomic van der Waals volumes (2D autocorrelations) |
| 9 | EEig05x | Eigenvalue 05 from edge adj. matrix weighted by edge degrees (edge adjacency indices) |
| 10 | ESpm02x | Spectral moment 02 from edge adj. matrix weighted by edge degrees (edge adjacency indices) |
| 11 | ESpm03d | Spectral moment 03 from edge adj. matrix weighted by dipole moments (edge adjacency indices) |
| 12 | ESpm10d | Spectral moment 10 from edge adj. matrix weighted by dipole moments (edge adjacency indices) |
| 13 | ESpm13d | Spectral moment 13 from edge adj. matrix weighted by dipole moments (edge adjacency indices) |
| 14 | BELv3 | Lowest eigenvalue n. 3 of Burden matrix/weighted by atomic van der Waals volumes (Burden eigenvalues) |
| 15 | RDF035v | Radial distribution function—3.5/weighted by atomic van der Waals volumes (RDF descriptors) |
| 16 | G1m | First component symmetry directional WHIM index/weighted by atomic masses (WHIM descriptors) |
| 17 | G1v | First component symmetry directional WHIM index/weighted by atomic van der Waals volumes (WHIM descriptors) |
| 18 | G1e | First component symmetry directional WHIM index/weighted by Sanderson electronegativities (WHIM descriptors) |
| 19 | G3s | Third component symmetry directional WHIM index/weighted by atomic electropological states (WHIM descriptors) |
| 20 | R8u+ |
|
| 21 | nRCOSR | Number of thioesters (aliphatic) (functional group counts) |
WHIM=weighted holistic invariant molecular descriptors.
The 21 Dragon features that enable predicting perceptual similarity. This table is reproduced from (Snitz et al. 2013).