| Literature DB >> 34698546 |
Victoria M Anderson1,2,3, Karen L Wendt1,2,3, Fares Z Najar3,4, Laura-Isobel McCall3,5,6, Robert H Cichewicz1,2,3.
Abstract
The success of natural product-based drug discovery is predicated on having chemical collections that offer broad coverage of metabolite diversity. We propose a simple set of tools combining genetic barcoding and metabolomics to help investigators build natural product libraries aimed at achieving predetermined levels of chemical coverage. It was found that such tools aided in identifying overlooked pockets of chemical diversity within taxa, which could be useful for refocusing collection strategies. We have used fungal isolates identified as Alternaria from a citizen-science-based soil collection to demonstrate the application of these tools for assessing and carrying out predictive measurements of chemical diversity in a natural product collection. Within Alternaria, different subclades were found to contain nonequivalent levels of chemical diversity. It was also determined that a surprisingly modest number of isolates (195 isolates) was sufficient to afford nearly 99% of Alternaria chemical features in the data set. However, this result must be considered in the context that 17.9% of chemical features appeared in single isolates, suggesting that fungi like Alternaria might be engaged in an ongoing process of actively exploring nature's metabolic landscape. Our results demonstrate that combining modest investments in securing internal transcribed spacer (ITS)-based sequence information (i.e., establishing gene-based clades) with data from liquid chromatography-mass spectrometry (i.e., generating feature accumulation curves) offers a useful route to obtaining actionable insights into chemical diversity coverage trends in a natural product library. It is anticipated that these outcomes could be used to improve opportunities for accessing bioactive molecules that serve as the cornerstone of natural product-based drug discovery. IMPORTANCE Natural product drug discovery efforts rely on libraries of organisms to provide access to diverse pools of compounds. Actionable strategies to rationally maximize chemical diversity, rather than relying on serendipity, can add value to such efforts. Readily implementable biological (i.e., ITS sequence analysis) and chemical (i.e., mass spectrometry-based feature and scaffold measurements) diversity assessment tools can be employed to monitor and adjust library development tactics in real time. In summary, metabolomics-driven technologies and simple gene-based specimen barcoding approaches have broad applicability to building chemically diverse natural product libraries.Entities:
Keywords: LC-MS metabolomics; chemical diversity; drug discovery; fungi; library design; metabolomics; natural products
Year: 2021 PMID: 34698546 PMCID: PMC8547436 DOI: 10.1128/mSystems.00644-21
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
FIG 1Genetic and chemical clustering of Alternaria. ITS phylogeny of Alternaria isolates is shown. Inner ring indicates the clade, while color-coded stars represent the chemical cluster. The clades and clusters show remarkable overlap but also reveal a hidden chemical cluster within clade V. Numbers indicate type strain data from GenBank (Table S2).
FIG 2Chemical and geographical distribution of Alternaria. Shown is the geographical distribution of isolates by chemical cluster. Whereas clusters 1, 2, and 6 are well distributed throughout the study area, clusters 3, 4, and 5 occupy more limited ranges.
FIG 3Summary of feature diversity in Alternaria. (A) Alpha diversity of genetic clades. The median numbers of chemical features differed significantly by clade. The asterisk indicates a statistically significant difference from clade U. The double dagger indicates a statistically significant difference from clade V. The diamond indicates a statistically significant difference from clade Y. (B) Venn diagram of features by clade.
FIG 4Chemical diversity curves and data extrapolation. (A) Rarefaction curve of chemical features within Alternaria. (B) Rarefaction curves for each ITS-based clade within Alternaria.
FIG 5Scaffold diversity in Alternaria. (A) Results from molecular networking analysis constructed from LC-MS data reveal 5,754 subnetworks/scaffolds. Nodes are colored by clade. (B) Venn diagram illustrating chemical scaffolds by clade.
FIG 6Visualization of scaffold accumulation models. (A) Scaffold accumulation curve generated starting with the most abundant clade (clade U) before adding isolates from the less abundant clades. (B) Scaffold accumulation curve generated by starting with less abundant clades (clades Y, X, W, and V) before introducing isolates from the most abundant clade.