| Literature DB >> 32214930 |
Nigel Collier1, Ai Kawazoe1, Lihua Jin1, Mika Shigematsu2, Dinh Dien3, Roberto A Barrero4, Koichi Takeuchi5, Asanee Kawtrakul6.
Abstract
A lack of surveillance system infrastructure in the Asia-Pacific region is seen as hindering the global control of rapidly spreading infectious diseases such as the recent avian H5N1 epidemic. As part of improving surveillance in the region, the BioCaster project aims to develop a system based on text mining for automatically monitoring Internet news and other online sources in several regional languages. At the heart of the system is an application ontology which serves the dual purpose of enabling advanced searches on the mined facts and of allowing the system to make intelligent inferences for assessing the priority of events. However, it became clear early on in the project that existing classification schemes did not have the necessary language coverage or semantic specificity for our needs. In this article we present an overview of our needs and explore in detail the rationale and methods for developing a new conceptual structure and multilingual terminological resource that focusses on priority pathogens and the diseases they cause. The ontology is made freely available as an online database and downloadable OWL file. © Springer Science+Business Media B.V. 2007.Entities:
Keywords: Infectious disease surveillance; Multilingual ontology; Text mining
Year: 2007 PMID: 32214930 PMCID: PMC7087677 DOI: 10.1007/s10579-007-9019-7
Source DB: PubMed Journal: Lang Resour Eval ISSN: 1574-020X Impact factor: 1.358
Fig. 1Partial BCO class hierarchy showing the top level and target entity classes (capitalized)
Fig. 2Example of multilingual term correspondence for weakness and fatigue