| Literature DB >> 28959395 |
Melaine A Kuenemann1, Malgorzata Szymczyk2, Yufei Chen2, Nadia Sultana2, David Hinks2, Harold S Freeman2, Antony J Williams3, Denis Fourches1, Nelson R Vinueza2.
Abstract
We present the Max Weaver Dye Library, a collection of ∼98 000 vials of custom-made and largely sparingly water-soluble dyes. Two years ago, the Eastman Chemical Company donated the library to North Carolina State University. This unique collection of chemicals, housed in the College of Textiles, also includes tens of thousands of fabric samples dyed using some of the library's compounds. Although the collection lies at the core of hundreds of patented inventions, the overwhelming majority of this chemical treasure trove has never been published or shared outside of a small group of scientists. Thus, the goal of this donation was to make this chemical collection, and associated data, available to interested parties in the research community. To date, we have digitized a subset of 2700 dyes which allowed us to start the constitutional and structural analysis of the collection using cheminformatics approaches. Herein, we open the discussion regarding the research opportunities offered by this unique library.Entities:
Year: 2017 PMID: 28959395 PMCID: PMC5605791 DOI: 10.1039/c7sc00567a
Source DB: PubMed Journal: Chem Sci ISSN: 2041-6520 Impact factor: 9.825
Number of textile dyes produced at Eastman through 1986
| Color | Number of dyes |
| Yellow | 14 979 |
| Orange | 11 076 |
| Red | 27 112 |
| Blue | 30 009 |
| Brown | 10 048 |
| Black | 921 |
Fig. 1Distribution of the set of 2196 dyes according to their color as dyes for textile fibers.
Fig. 2Distribution of different substructures present in the 2196 digitalized dyes.
Fig. 4Distribution of dyes according to their molecular weight (A) and S log P (C). Boxplot of the MW (B) and S log P (D) are shown for all dyes as well as for each individual dye color. Stars on each boxplot represent the level of significance resulting from a pairwise comparison of the particular DYE color dataset versus all others colors from *moderately significant (0.01 < P-value < 0.05), **significant (0.001 < P-value < 0.01), to ***very significant (P-value < 0.001).
Fig. 3Top-3 dyes retrieved by similarity search using two probes, Disperse Orange 3 and Disperse Red 11, known to cause contact dermatitis.
Fig. 5Circular dendrograms obtained from the hierarchical clustering of the set of 2196 dyes represented in RDKIT descriptor space. Compound nodes & names are colored according to their color (A) and their molecular weight (B).
Fig. 6Three different pairs of dyes identified as potential “color cliffs”, i.e., compounds with highly similar structures (Tanimoto coefficient ≥ 0.95 with MACCS fingerprints) but different colors.
MS validation of dyes identified as “color cliffs”
| Dye Library compound | Chemical formula | Theoretical | Experimental | Error (ppm) |
|
| C11H12N6O | 245.1145 | 245.1146 | –0.41 |
|
| C11H12N6O | 245.1145 | 245.1144 | 0.44 |
|
| C18H17N8O3Cl | 429.1185 | 429.1188 | –2.08 |
|
| C18H17N8O3Cl | 429.1185 | 429.1189 | –2.70 |
|
| C28H16S2O6 | 513.0461 | 513.0457 | 1.41 |
|
| C28H16S2O6 | 513.0461 | 513.0454 | 1.33 |
Protonated molecules were selected for this analysis.