Literature DB >> 27595045

Identifying Plant-Human Disease Associations in Biomedical Literature: A Case Study.

Vivekanand Sharma1, Wayne Law2, Michael J Balick2, Indra Neil Sarkar1.   

Abstract

The impact of ethnobotanical data from surveys of traditional medicinal uses ofplants can be enhanced through the validation of biomedical knowledge that may be embedded in literature. This study aimed to explore the use of informatics approaches, including natural language processing and terminology resources, for extracting and comparing ethnobotanical leads from biomedical literature indexed in MEDLINE. Using ethnobotanical data for plant species described in Primary Health Care Manuals of the Micronesian islands of Palau and Pohnpei, the results of this study were done relative to disease concepts from the "Mental, Behavioral And Neurodevelopmental Disorders " ICD-9-CM category. The results from this feasibility study suggest that informatics methods can be used to extract and prioritize relevant ethnobotanical information from biomedical knowledge literature.

Entities:  

Year:  2016        PMID: 27595045      PMCID: PMC5009952     

Source DB:  PubMed          Journal:  AMIA Jt Summits Transl Sci Proc


Introduction

Understanding the use of plants by cultures around the world, the science of ethnobotany, has been shown as a potential source for the identification of new therapies (1, 2). Depending on the disease state being studied, searches for bioactive components that are driven by ethnobotanical knowledge are more effective than random selection of plant species (3); however, the process of cataloguing putative medicinal uses of plants, such as those that are compiled in field surveys, often depends on a labor intensive analysis of previous related ethnobotanical explorations and biomedical literature (4). Extensive searches of biomedical literature are also an essential aspect of the ethnobotanical pipeline for documenting and evaluating the pharmacological relevance of collected indigenous information. The extraction of meaningful information from ethnobotanical and biomedical texts thus remains an essential task in the cataloguing of plants with potential medicinal properties (5). Ethnobotanical research focuses on organizing information, including therapeutic applications, about the historical and contemporary interactions between plants and traditional societies (6). Ethnobotanists aim to capture traditional botanical knowledge by interacting with people who are knowledgeable about plant uses, often but not limited to indigenous populations. Various qualitative and quantitative methods are employed to gather useful evidence pertaining to plant use patterns. Plant voucher specimens are also collected and taxonomic studies conducted for correct identification and nomenclature of plant species (7). Subsequent validation of potential plants with therapeutic properties is accomplished through a combination of systematic searches of literature, existing collections of herbaria (collections of preserved plant specimens), and ethnobotanical surveys (8). The study of the diversity of plants and their uses across various cultures may reveal essential patterns that provide insights to potential therapies. For example, a cross-cultural ethnomedicinal evaluation revealed similar uses of phylogenetically related plant species (9). However, to date, the adoption of informatics approaches have been slow in their application in the field of ethnobotany (10). There have been significant advances in Natural Language Processing (NLP) tools and techniques for addressing the issues of variability, ambiguity, and context-dependent interpretation (11). Tools like MetaMap (12) and the National Center for Biomedical Ontology (NCBO) Annotator (13) have been shown to be effective in identifying biomedical concepts from free text (14, 15), including the extraction and ranking of key associations between biomedical entities from biomedical corpora (16 –19). Several approaches have been developed for automating the extraction of entity relations and inferring new or hidden relations from biomedical text. Co-occurrence statistics based methods are commonly used to extract relations among biomedical entities (20). This approach relies on the assumption that two biomedical entities co-occur within the scope of a given text are likely to be related (21). In addition to the co-occurrence based methods, rule-based, statistical, machine learning and NLP based methods have also been used to extract relations from biomedical literature (22 –26). For example, systems like BioMedLEE (27) and SemRep (28) have been developed for extracting relations between entities using syntactic and linguistic analysis. Use of such systems for integrating semantic relations with co-occurrence have also been explored (29). Methods commonly used for extracting entity relations from text has been reviewed by Zweigenbaum, et al. (30). This preliminary study aimed to demonstrate the potential to leverage existing NLP approaches for extracting and ranking disease associations for plants that have been the focus of ethnobotanical surveys of the Micronesian islands of Palau and Pohnpei and compiled in Primary Health Care Manuals (32, 33). The plants from these Micronesian islands are known for botanical endemism, and are the focus of an NIH-funded project to develop computational methods for identifying and validating potential therapeutic knowledge about plants. A co-occurrence based metric was use to rank the plant-associated disease concepts from both MEDLINE and the Primary Health Care Manuals for Palau and Pohnpei. The identified associations from these two sources were compared to highlight known ethnobotanical uses that have been evaluated in indexed biomedical literature as well as uses that may be unique. The evaluation for this study focused on potential therapeutic uses of plants for the ICD-9-CM Mental, Behavioral and Neurodevelopmental Disorders category. This specific focus was chosen as such conditions may be related to stress-related suicide, for which Micronesia has amongst the highest rates in the world (Department of Public Health and Social Security) (32). The results, which included the identification of several putative therapeutic uses of plant species, reveal the applicability of informatics approaches for supporting large-scale comparative analysis of ethnobotanical knowledge within biomedical literature. The results also suggest that indigenous knowledge may be used to guide the identification of bioactive plant species within a western medicinal context.

Methods

The goal of this study was to identify disease associations of plants identified within the Primary Health Care Manuals (PHCMs) of Palau and Pohnpei (32, 33). Processing articles related to these plants as identified from MEDLINE using PubMed provided the basis for comparison of identified associations from the Primary Health Care Manuals. A general overview of the process developed for this study is depicted in Figure 1.
Figure 1:

Study Overview. MEDLINE citations accessed from PubMed containing plants from the Primary Health Care Manuals (PHCMs) for Palau and Pohnpei and corresponding descriptions in the PHCMs were processed using MetaMap. Plant associations with UMLS identified concepts were scored and filtered by the “Disorders” UMLS Semantic Group. Manual evaluation focused on disease concepts that could be mapped to the ICD-9-CM category “Mental, Behavioral And Neurodevelopmental Disorders (290-319).”

Biomedical concept association with plants from Micronesia. A list of plants was attained from the Primary Health Care Manuals (PHCMs) of Palau (32) and Pohnpei (33), which were the result of ethnobotanical explorations carried out in the Micronesian Islands of Palau and Pohnpei. A Ruby script that leveraged Entrez e- utils was used to query MEDLINE from PubMed with each plant’s scientific name. The titles and abstracts of the resulting set of identified MEDLINE citations were extracted using e-utils, and processed using the MetaMap Java API (34). The Unified Medical Language System (UMLS) concepts identified by MetaMap were parsed from the machine output and the association scores between a given plant and UMLS concept calculated using the following equation:where f, is the frequency of concept ‘c’ co-occurring with plant ‘p’, N is the total number of plants and n is the number of plants co-occurring with the given concept ‘c’. These scores were then normalized using the following formula: where S(p,c), is the score of a given plant-concept association and max(Score(p,c)) is the maximum score among all the concepts associated with a given plant p. The descriptions from the PHCMs, which consisted of one plant per document, were processed using MetaMap and the resulting plant-concept associations were scored as with MEDLINE articles using Equations 1 and 2. Mapping to ICD-9-CM categories. Concepts were filtered based on those that belonged to the “Disorders” UMLS Semantic Group and mapped to corresponding ICD-9-CM codes by querying the UMLS MRCONSO table (these were referred to as “direct mappings”). If no direct mapping was possible for a given concept, related concepts were retrieved from UMLS MRREL table (including all relationship types) and then mapped to ICD-9-CM (these were referred to as “ inferred mappings”). Evaluation. The validity of the results focused on those plant associations with concepts that belonged the ICD-9- CM category “Mental, Behavioral And Neurodevelopmental Disorders (290-319).” The evaluation involved manually comparing the MetaMap -identified top scoring plant-concept associations from MEDLINE with those identified from the PHCMs.

Results

Plant-concept associations. From the PHCMs a total of 180 unique plants were identified, for which 129 could be identified in MEDLINE. In total there were 19,798 citations identified from MEDLINE that had at least one mention of a plant from the PHCMs, out of which 18,322 contained associated UMLS concepts. A total of 22,425 and 310,155 plant-concept co-occurrences were identified from the PHCMs and MEDLINE datasets, respectively, with 7,521 associations in common. The organizing and filtering of associations based on the semantic group ‘Disorders’ resulted in 2,106 and 24,517 associations for PHCMs and MEDLINE datasets, respectively, with 537 associations in common (Figure 2). Comparative statistics of plant-wise associations across different semantic types belonging to the semantic group ‘Disorders’ is presented in Supplemental Table 1. The score comparison of plant-concept associations across different semantic types of semantic group ‘Disorders’ is presented in Supplemental Table 2. Similar statistics for the ICD-9-CM categories are provided in Supplemental Table 3 and 4.
Figure 2:

Identified plant and “Disorder” UMLS Semantic Group concept associations.

Table 1:

Distribution of Plant-Human Disease Concept Associations Across ICD-9-CM Categories. The disease concept counts include all associations that could be mapped to ICD-9-CM codes (either through direct or inferred mapping).

CodeCategoryMEDLINEBothPHCM
001-139Infectious and parasitic diseases132122109
140-239Neoplasms167626110
240-279Endocrine, nutritional and metabolic diseases, and immunity disorders1126219
280-289Diseases of the blood and blood-forming organs353434
290-319Mental, behavioral and neurodevelopmental disorders635327
320-389Diseases of the nervous system and sense organs119323109
390-459Diseases of the circulatory system651832
460-519Diseases of the respiratory system259221
520-579Diseases of the digestive system9932581
580-629Diseases of the genitourinary system506630
630-679Complications of pregnancy, childbirth, and the puerperium12404
680-709Diseases of the skin and subcutaneous tissue4631080
710-739Diseases of the musculoskeletal system and connective tissue226127
740-759Congenital anomalies50718
760-779Certain conditions originating in the perinatal period9200
780-799Symptoms, signs, and ill-defined conditions181169294
800-999Injury and poisoning80965200
Table 3:

.” Shown for each top hit are the Disease Concept, UMLS CUI, ICD Category (only for associations across All ICD-9-CM Categories), Rank Score, and Manual Evaluation (True/False) along with citation used for evaluation.

MEDLINE[1] PHCM[2]
All ICD-9-CM Categories290-319 ICD-9-CM CategoryAll ICD-9-CM Categories290-319 ICD-9-CM Category
Ageratum conyzoides L.Vein Disorder (C0235522) (390-459) Score: 0.0833 False [PMID: 10544139]Impotence (C0242350) Score: 0.0130 False [PMID: 17362507]Pallor (C0030232) (780-799) Score: 0.5531 False [Pl:pp 37]Stress (C0038435) Score: 0.1225 False [Pl:pp 37]
Allophylus timoriensis (DC.) BlumeNo association identifiedNo association identifiedPrecursor T-Cell Lymphoblastic Leukemia-Lymphoma (C1961099) (280-289) Score: 0.3559 False [Ph:pp 32]Stress (C0038435) Score: 0.0063 True [Ph:pp 106]
Areca catechu L.Fibroses, Oral Submucous (C0029172) (520-579) Score: 0.0909 False [PMID: 26336810]Schizophrenias (C0036341) Score: 0.0682 True [PMID 19748131]Rubor (C0332575) (780-799) Score: 0.6111 False [Ph:pp 121]Acute Psychosis (C0281774) Score: 0.0208 False [Ph:pp 121]
Calophyllum innophyllum L.No association identifiedNo association identifiedCarcinogenesis (C0596263) (140-239) Score: 0.8182 True [Pl:pp 103]Stress (C0038435) Score: 0.0221 True [Pl:pp 103]
Centella asiatica (L.) Urb.Oxidative Stress (C0242606) (760-779) Score: 0.0544 True [PMID: 25633675]Anxiety Disorder (C0003469) Score: 0.0259 True [PMID: 22841896]Dysentery (C0013369) (001-139) Score: 0.8811 True [Ph:pp 30]Hand Rubbing (C0239846) Score: 0.1006 False [Ph:pp 30]
Citrus limon (L.) Burm. f.Facial dysmorphism, immunodeficiency, livedo, and short stature (C3554576) (240-279) Score: 0.0521 False [PMID: 12231681]Stress (C0038435) Score: 0.0159 True [PMID 26050208]Tachycardia (C0039231) (780-799) Score: 0.1539 True [Ph:pp 117]Stupor (C0085628) Score: 0.0769 True [Ph:pp 117]
Clerodendrum inerme (L.) GaertnTic Dis Motor (C0751554) (290-319) Score: 0.1231 True [PMID: 19617461]Tic Dis Motor (C0751554) Score: 0.1231 True [PMID: 19617461]Asthma (C0004096) (460-519) Score: 1.0000 True [Ph:pp 99]Stress (C0038435) Score: 0.0210 True [Ph:pp 99]
Cyathula prostrata (L.) BlumeAscites (C0003962) (780-799) Score: 0.27017 True [PMID: 23870465]No association identifiedThin hair (C0423867) (680-709) Score: 0.3428 False [Ph:pp 10]Hand Rubbing (C0239846) Score: 0.0571 False [Ph:pp 70]
Glochidion ramiflorum J. R. Forst. & G. Forst.No association identifiedNo association identifiedJoint Pain Adverse Event (C1963066) (710-739) Score: 0.2338 Arthralgia (C0003862) (710-739) Score: 0.2338 True [Ph:pp 150]Stress (C0038435) Score: 0.0284 True [Ph:pp 102]
Hibiscus tiliaceus L.Disorder, Puerperal (C0034040) (580-629) Score: 0.0608 True [PMID: 22494845]Disease, Seitelberger’s (C0270724) Score: 0.0308 False [PMID: 16701930]Diarrhea (C0011991) (780-799) Score: 0.1961 True Ph:pp 27Stress (C0038435) Score: 0.1218 True [Ph:pp 99]
Horsfieldia irya (Gaertn.) WarbNo association identifiedNo association identifiedFainting (C0039070) (780-799) Score: 0.2802 True [Pl:pp 104]Stress (C0038435) Score: 0.0682 True [Pl:pp 104]
Ipomoea littoralis BlumeNo association identifiedNo association identifiedStaphylococcus aureus infections (C1318973) (001-139) Score: 0.2498 True [Ph:pp 45]Hand Rubbing (C0239846) Score: 0.0555 False [Ph:pp 142]
Ipomoea mauritiana Jacq.Diastasis (C0036679) (320-389) Score: 0.1155 False [PMID: 25050305]No association identifiedMuscle Cramps (C0026821) (320-389) Score: 0.5835 True [Ph:pp 133]Hand Rubbing (C0239846) Score: 0.1110 False [Ph:pp 133]
Ixora casei HanceDrug Tolerance (C0013220) (290-319) Score: 0.3393 False [PMID: 19283052]Drug Tolerance (C0013220) Score: 0.3393 False [PMID: 19283052]Dysmenorrhea (C0013390) (580-629) Score: 0.1805 True [Pl:pp 135]Hand Rubbing (C0239846) Score: 0.0333 False [Ph:pp 39]
Kyllinga brevifolia Rottb Catatonia (C0007398) (780-799) Score: 0.2500 True [PMID: 10473172]Stress (C0038435) Score: 0.0509 True [PMID: 10473172]Viral infection (C0042769) (780-799) Score: 1.0000 True [Ph:pp 49]Hand Rubbing (C0239846) Score: 0.0238 False [Ph:pp 69]
Nephrolepis obliterata (R. Br.) J. Sm.NANAC0013369: Dysentery (001-139) Score: 0.3231 True Ph:pp 31C0239846: Hand Rubbing Score: 0.0369 False [Ph:pp 30]
Paraderris elliptica (Wall.) AdemaIGS (C1306856) (520-579) Score: 0.6688 False [PMID: 23144360]No association identifiedConjunctivitides (C0009763) (320-389) Score: 0.8182 True [Pl:pp 62]Depressions (C0011570) Score: 0.0584 False [Pl:pp 62]
Phaleria nisidai KanehiraFallot Tetralogy (C0039685) (740-759) Score: 0.19448 False [PMID: 23144360]No association identifiedTumor Mass (C3273930) (140-239) Score: 0.7007 True [Pl:pp 105]Stress (C0038435) Score: 0.0284 True [Pl:pp 105]
Phyllanthus palauensis Hosok.No association identifiedNo association identifiedFlushing (C0016382) (780-799) Score: 0.5000 False [Pl:pp 105]Stress (C0038435) Score: 0.0406 True [Pl:pp 105]
Piper methysticum G. Forst.Anxiety Disorder (C0003469) (290-319) Score: 0.1796 True [PMID: 23635869]Anxiety Disorder (C0003469) Score: 0.1796 True [PMID: 23635869]Skin Discoloration (C0151907) (680-709) Score: 0.1250 True Ph:pp 105 [toxicity]Anxiety Disorder (C0003469) Score: 0.0417 True [Ph:pp 105]
Premna serratifolia L.Myopathy (C0026848) (320-389) Score: 0.1114 True [PMID: 23407688]Stress (C0038435) Score: 0.0339 True [PMID: 23244417]Nodule (C0028259) (140-239) Score: 0.4927 False [Ph:pp 51]Hand Rubbing (C0239846) Score: 0.0035 False [Ph:pp 51]
Solenostemon scutellarioides (L.)Prostration (C0277794) (320-389) Score: 0.1175 False [PMID: 18603655]Stress (C0038435) (290-319) Score: 0.0239 False [PMID: 2586264]Rash (C0015230) (780-799) Score: 1.0000 True [Ph:pp 55]Depressions (C0011570) Score: 0.1922 False [Ph:pp 55]

Evaluation based on PHCMs, page numbers in brackets (Pl = PHCM of Palau; Ph = PHCM of Pohnpei);

Evalation based on biomedical literature, PubMed identifier in brackets

Distribution across ICD-9-CM disease categories. The plant-associated disorder concepts were mapped into 17 ICD-9-CM categories (Table 1). Out of total 526 concepts from the PHCMs dataset, 337 (64%) were mapped to ICD-9-CM codes (103 direct and 234 inferred). A total of 3,189 out of 5,393 concepts (59%) from the MEDLINE dataset were mapped to ICD-9-CM codes (786 direct and 2403 inferred). Associations from the MEDLINE dataset were represented in all 17 categories, while those from PHCMs dataset were represented in 16 categories (all except “Certain Conditions Originating In The Perinatal Period’ [760-779]). Evaluation. There were 27 disorder concept associations for 22 plants identified from the PHCMs dataset belonging to ICD-9-CM category “Mental, Behavioral And Neurodevelopmental Disorders (290-319),” comprised of seven unique UMLS concepts: Stress (C0038435), Acute Psychosis (C0281774), Hand Rubbing (C0239846), Stupor (C0085628), Depression (C0011570), Depressive disorder (C0011581), and Anxiety disorder (C0003469). By contrast, there were 635 disease concept associations from MEDLINE belonging to the same ICD-9-CM category comprised of 145 UMLS concepts. Table 2 summarizes the number of associations manually identified as correct. For the selected set of plants, there were eight true associations out of 14 top scoring associations from MEDLINE dataset across all disease categories. Sixteen out of the 22 top scoring associations from PHCM dataset were also deemed to be valid across all disease categories. Seven out of 11 and 10 out of 22 top scoring associations from the “290-319” category were true for MEDLINE and PHCM dataset, respectively. Table 3 summarizes the manual evaluation of the top scoring predicted therapeutic use for MetaMap identified disease concepts from the PHCMs and MEDLINE. Within the chosen ICD-9-CM category, six plants were only identified in the PHCMs with therapeutic use but not in MEDLINE: Glochidion ramiflorum, Horsfieldia irya, Phaleria nisidai, Calophyllum innophyllum, and Phyllanthus palauensis. There were 111 plants identified in MEDLINE in categories that did not have corresponding PHCM association for the chosen ICD-9-CM category.
Table 2:

Counts of True Associations Identified from MEDLINE and PHCMs

All ICD-9-CM Disease CategoriesICD-9-CM 290-319 Category
MEDLINE8/147/11
PHCMs16/2210/22

Discussion

Along with continued progress of ethnobotanical surveys of indigenous and other populations around the globe, there are increasing efforts to digitize historical texts (35) and other documentation of traditional knowledge that may include descriptions of therapeutic applications of plant species (36). Informatics methodologies may be used to connect such cultural knowledge that has remained historically isolated from contemporary biomedical knowledge sources, such as biomedical literature. Information regarding the historical use of plants may potentially reflect the efficacy and safety of their use. The current regulatory guidelines established by the Center for Drug Evaluation and Research (CDER) for botanicals encourages submission of documentation of prior human experience for preliminary safety assessments (37). Such information may also provide relevant background for conducting search for chemical drugs as well as in designing appropriate clinical studies for evaluation (38). This study aimed to develop an informatics approach for enabling comparison and evaluation of potential therapeutic information documented in PHCMs of Palau and Pohnpei in light of potentially supporting evidence within MEDLINE. From this study, several potentially interesting therapeutic applications were highlighted based on the scoring metric (Supplemental Tables 1–4). This scoring metric was inspired by the tf-idf (term frequency-inverse document frequency) metric. An advantage of using the tf-idf weighing strategy is that it is able to highlight rare interactions and filter noisy datasets by deprioritizing trivial relationships. Future work may involve additional statistical comparisons of additional scoring metrics (e.g., Mutual Information Measure, Association rules). The identified concepts were organized according to the UMLS Semantic Network, which reduces the more than 1 million concepts in the UMLS Metathesaurus into 133 semantic types (39) that are further grouped into 15 semantic groups (40). Using the UMLS Semantic Network helped focus the study by grossly filtering identified associations into a single semantic group, “Disorders.” It is important to note, however, that it is often challenging to correlate the diagnosis descriptions from indigenous or historical sources into canonical medical concepts (37). For example, within the Herbarium Ambionense (41), a compilation published in 1741 about plants from Amboina (a geographic region within modern day Indonesia) gonorrhea is described as “fire-piss.” The International Classification of Diseases (ICD) provides a set of standardized codes for describing diagnosis maintained by World Health Organization (42). ICD-9 Clinical Modification (ICD-9-CM) is the adaptation of the ninth revision of ICD created by the U.S. National Center for Health Statistics (NCHS), Center for Disease Control (CDC) (43). By mapping disease concepts to the ICD classification, a broader picture of potential therapeutic applications of plants was possible in this study. For the purposes of this feasibility study, the evaluation of identified potential therapies was limited to the ICD-9-CM category “Mental, Behavioral And Neurodevelopmental Disorders (290-319).” Six out of 22 plant species from Palau and Pohnpei showed promising medicinal applications for the ICD-9-CM evaluation category based on MEDLINE indexed literature beyond the original applications described in the Primary Health Care Manuals. Association between Areca catechu (betel nut) and schizophrenia or schizoaffective disorder was highlighted by the approach implemented in this study. Published evidence has shown that high-consumption of betel nuts had significantly milder positive symptoms of schizophrenia in males (44, 45). Such a therapeutic role of betel nut has been attributed to arecoline, a partial muscarinic agonist (46). Its potential use in medication-induced movement disorders (46) and role in enhancing cognitive ability and social activity in schizophrenia patients have also been explored (47). The use of Citrus limon essential oil has been evaluated and has shown positive effects for anxiety (48 –50). The possibility of action on benzodiazepine-type receptors is implicated for such anxiolytic activity (48). Clerodendrum inerme, traditionally used to reduce stress, has been successfully tested for use against motor tics (Tic disorder) (51). The therapeutic role of hispidulin, a flavonoid isolated from C. inerme has been proposed in hyper-dopaminergic disorders (52). The role of Kyllinga brevifolia in alleviating stress has been implicated by interaction with benzodiazepine receptors (53). Premna serratifolia has been shown to play a role in stress resistance. An iridoid, 10-0-trans-p-Coumaroylcatalpol, from P. serratifolia decreased the aggregation of Parkinson’s disease associated protein (alpha synuclein) in a transgenic Caenorhabditis elegans model, as well as promoting longevity (54). The anxiolytic effect of Centella asiatica extract has been demonstrated in mice, with activity attributed to madecassoside and asiaticoside (55). Additionally, has been shown to attenuate amyloid-beta induced oxidative stress and mitochondrial dysfunction contributing towards its neuroprotective action (56). In addition to the role of identified plants species discussed here for the ICD-9-CM category “Mental, Behavioral And Neurodevelopmental Disorders (290-319),” several other potential therapeutic roles were identified (Table 2). The ability to identify such potential uses of plants that have been identified from study of indigenous populations may provide the basis for designing appropriate subsequent clinical studies to understand the efficacy of these plant species. This study aimed to extract and prioritize the associations between plants and biomedical concepts based on cooccurrence statistics. A major weakness of the current co-occurrence approach is that cannot robustly distinguish between correlation and causation relationships. Thus, while the developed approach did highlight some potentially useful therapeutic associations, it did not distinguish between therapeutic and toxicity associations. For example, in addition to the above mentioned potential therapeutic roles, Areca catechu is also known to cause Oral Submucous Fibrosis (57) and oral cancer (58). Further confounding this issue is the fact that plant toxicity profiles may sometimes be used to identify additional therapeutic roles. For example, the toxic plant Atropa belladonna has been shown to be a potential analgesic (59). A major area of future emphasis of this work will thus be enhancement of the approaches demonstrated in this study for distinguishing between toxicity and therapeutic roles. An immediate area of emphasis will be to focus on mining associations within context such as within defined scopes of text (e.g., sentence, phrase, or utterance). It will be essential to implement such enhancements before considering the general utility of informatics to support the ethnobotany workflow towards identification of possible plant-based therapies. Nonetheless, the results of this feasibility study demonstrate the potential to enhance the current workflow in identifying previously described therapeutic uses for plants.

Conclusion

The adoption of informatics for identifying ethnobotanical knowledge within the increasing stores of electronic has been slow. This feasibility study demonstrates the potential to leverage existing natural language processing, along with terminological resources, to mine medicinal plant knowledge from biomedical literature as indexed in MEDLINE. The developed approach was able to identify potential new therapeutic applications for plants previously described in ethnobotanical surveys from the Micronesian islands of Palau and Pohnpei. Building on this feasibility study, future work is needed to increase overall specificity of identified correlations (e.g., distinguish between reported therapeutic versus toxic effects) before computational approaches be used for potentially impacting the ability to identify new plant based remedies inspired by ethnobotanical surveys. Supplementary information referenced in this study is available at: https://sites.google.com/a/brown.edu/phytokb/tbi2016
  43 in total

1.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

2.  Aggregating UMLS semantic types for reducing conceptual complexity.

Authors:  A T McCray; A Burgun; O Bodenreider
Journal:  Stud Health Technol Inform       Date:  2001

3.  Exploring text mining from MEDLINE.

Authors:  Padmini Srinivasan; Thomas Rindflesch
Journal:  Proc AMIA Symp       Date:  2002

4.  Effects of chewing betel nut (Areca catechu) on the symptoms of people with schizophrenia in Palau, Micronesia.

Authors:  R J Sullivan; J S Allen; C Otto; J Tiobech; K Nero
Journal:  Br J Psychiatry       Date:  2000-08       Impact factor: 9.319

Review 5.  Frontiers of biomedical text mining: current progress.

Authors:  Pierre Zweigenbaum; Dina Demner-Fushman; Hong Yu; Kevin B Cohen
Journal:  Brief Bioinform       Date:  2007-10-30       Impact factor: 11.622

Review 6.  Bioinformatics opportunities for identification and study of medicinal plants.

Authors:  Vivekanand Sharma; Indra Neil Sarkar
Journal:  Brief Bioinform       Date:  2012-05-15       Impact factor: 11.622

7.  Hispidulin, a constituent of Clerodendrum inerme that remitted motor tics, alleviated methamphetamine-induced hyperlocomotion without motor impairment in mice.

Authors:  Wei-Jan Huang; Hsin-Jung Lee; Hon-Lie Chen; Pi-Chuan Fan; Yuan-Ling Ku; Lih-Chu Chiou
Journal:  J Ethnopharmacol       Date:  2015-03-09       Impact factor: 4.360

8.  The effects of an indigenous muscarinic drug, Betel nut (Areca catechu), on the symptoms of schizophrenia: a longitudinal study in Palau, Micronesia.

Authors:  Roger J Sullivan; Sylvia Andres; Caleb Otto; Wayne Miles; Robert Kydd
Journal:  Am J Psychiatry       Date:  2007-04       Impact factor: 18.112

Review 9.  Essential oils and anxiolytic aromatherapy.

Authors:  William N Setzer
Journal:  Nat Prod Commun       Date:  2009-09       Impact factor: 0.986

10.  Comparison of concept recognizers for building the Open Biomedical Annotator.

Authors:  Nigam H Shah; Nipun Bhatia; Clement Jonquet; Daniel Rubin; Annie P Chiang; Mark A Musen
Journal:  BMC Bioinformatics       Date:  2009-09-17       Impact factor: 3.169

View more
  1 in total

1.  Identifying Supplement Use Within Clinical Notes: An Applicationof Natural Language Processing.

Authors:  Vivekanand Sharma; Indra Neil Sarkar
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2018-05-18
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.