Michael Halper1, Huanying Gu2, Yehoshua Perl3, Christopher Ochs4. 1. Information Technology Department, New Jersey Institute of Technology, Newark, NJ 07102, USA. Electronic address: michael.halper@njit.edu. 2. Computer Science Department, New York Institute of Technology, New York, NY 10023, USA. Electronic address: hgu03@nyit.edu. 3. Computer Science Department, New Jersey Institute of Technology, Newark, NJ 07102, USA. Electronic address: yehoshua.perl@gmail.com. 4. Computer Science Department, New Jersey Institute of Technology, Newark, NJ 07102, USA. Electronic address: cro3@njit.edu.
Abstract
OBJECTIVE: Terminologies and terminological systems have assumed important roles in many medical information processing environments, giving rise to the "big knowledge" challenge when terminological content comprises tens of thousands to millions of concepts arranged in a tangled web of relationships. Use and maintenance of knowledge structures on that scale can be daunting. The notion of abstraction network is presented as a means of facilitating the usability, comprehensibility, visualization, and quality assurance of terminologies. METHODS AND MATERIALS: An abstraction network overlays a terminology's underlying network structure at a higher level of abstraction. In particular, it provides a more compact view of the terminology's content, avoiding the display of minutiae. General abstraction network characteristics are discussed. Moreover, the notion of meta-abstraction network, existing at an even higher level of abstraction than a typical abstraction network, is described for cases where even the abstraction network itself represents a case of "big knowledge." Various features in the design of abstraction networks are demonstrated in a methodological survey of some existing abstraction networks previously developed and deployed for a variety of terminologies. RESULTS: The applicability of the general abstraction-network framework is shown through use-cases of various terminologies, including the Systematized Nomenclature of Medicine - Clinical Terms (SNOMED CT), the Medical Entities Dictionary (MED), and the Unified Medical Language System (UMLS). Important characteristics of the surveyed abstraction networks are provided, e.g., the magnitude of the respective size reduction referred to as the abstraction ratio. Specific benefits of these alternative terminology-network views, particularly their use in terminology quality assurance, are discussed. Examples of meta-abstraction networks are presented. CONCLUSIONS: The "big knowledge" challenge constitutes the use and maintenance of terminological structures that comprise tens of thousands to millions of concepts and their attendant complexity. The notion of abstraction network has been introduced as a tool in helping to overcome this challenge, thus enhancing the usefulness of terminologies. Abstraction networks have been shown to be applicable to a variety of existing biomedical terminologies, and these alternative structural views hold promise for future expanded use with additional terminologies.
OBJECTIVE: Terminologies and terminological systems have assumed important roles in many medical information processing environments, giving rise to the "big knowledge" challenge when terminological content comprises tens of thousands to millions of concepts arranged in a tangled web of relationships. Use and maintenance of knowledge structures on that scale can be daunting. The notion of abstraction network is presented as a means of facilitating the usability, comprehensibility, visualization, and quality assurance of terminologies. METHODS AND MATERIALS: An abstraction network overlays a terminology's underlying network structure at a higher level of abstraction. In particular, it provides a more compact view of the terminology's content, avoiding the display of minutiae. General abstraction network characteristics are discussed. Moreover, the notion of meta-abstraction network, existing at an even higher level of abstraction than a typical abstraction network, is described for cases where even the abstraction network itself represents a case of "big knowledge." Various features in the design of abstraction networks are demonstrated in a methodological survey of some existing abstraction networks previously developed and deployed for a variety of terminologies. RESULTS: The applicability of the general abstraction-network framework is shown through use-cases of various terminologies, including the Systematized Nomenclature of Medicine - Clinical Terms (SNOMED CT), the Medical Entities Dictionary (MED), and the Unified Medical Language System (UMLS). Important characteristics of the surveyed abstraction networks are provided, e.g., the magnitude of the respective size reduction referred to as the abstraction ratio. Specific benefits of these alternative terminology-network views, particularly their use in terminology quality assurance, are discussed. Examples of meta-abstraction networks are presented. CONCLUSIONS: The "big knowledge" challenge constitutes the use and maintenance of terminological structures that comprise tens of thousands to millions of concepts and their attendant complexity. The notion of abstraction network has been introduced as a tool in helping to overcome this challenge, thus enhancing the usefulness of terminologies. Abstraction networks have been shown to be applicable to a variety of existing biomedical terminologies, and these alternative structural views hold promise for future expanded use with additional terminologies.
Authors: Yue Wang; Michael Halper; Duo Wei; Huanying Gu; Yehoshua Perl; Junchuan Xu; Gai Elhanan; Yan Chen; Kent A Spackman; James T Case; George Hripcsak Journal: J Biomed Inform Date: 2011-09-01 Impact factor: 6.317
Authors: Christopher Ochs; Zhe He; Ling Zheng; James Geller; Yehoshua Perl; George Hripcsak; Mark A Musen Journal: J Biomed Inform Date: 2016-03-14 Impact factor: 6.317
Authors: Tiffany J Callahan; William A Baumgartner; Michael Bada; Adrianne L Stefanski; Ignacio Tripodi; Elizabeth K White; Lawrence E Hunter Journal: Pac Symp Biocomput Date: 2018
Authors: Yehoshua Perl; James Geller; Michael Halper; Christopher Ochs; Ling Zheng; Joan Kapusnik-Uner Journal: Ann N Y Acad Sci Date: 2016-10-17 Impact factor: 5.691