| Literature DB >> 35186965 |
Hyojin Bae1, Sanghun Lee2,3, Choong-Yeol Lee1, Chang-Eop Kim1.
Abstract
Pattern identification (PI), a unique diagnostic system of traditional Asian medicine, is the process of inferring the pathological nature or location of lesions based on observed symptoms. Despite its critical role in theory and practice, the information processing principles underlying PI systems are generally unclear. We present a novel framework for comprehending the PI system from a machine learning perspective. After a brief introduction to the dimensionality of the data, we propose that the PI system can be modeled as a dimensionality reduction process and discuss analytical issues that can be addressed using our framework. Our framework promotes a new approach in understanding the underlying mechanisms of the PI process with strong mathematical tools, thereby enriching the explanatory theories of traditional Asian medicine.Entities:
Keywords: diagnostic system; dimensionality reduction; machine learning; pattern identification; syndrome differentiation; traditional Asian medicine; traditional Chinese medicine
Year: 2022 PMID: 35186965 PMCID: PMC8853725 DOI: 10.3389/fmed.2021.763533
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Figure 1Schematic figures explaining the features of high-dimensional data. (A): Intuitive understanding of multidimensional data. Each observation (row) in each table is described by one or more features (columns) and visualized as a point in one-, two- or three-dimensional space. This representation is easily extended to four-dimensional space or higher, but it cannot be visualized. (B): By transforming the dataset into high-dimensional space, the data can be separated using a linear decision boundary. (C): Curse of dimensionality. As the dimension of the space increases, the volume of the space expands exponentially, and the density of the space becomes increasingly sparse. (D): Projection into an intrinsic-dimensional space. Data laid out in three-dimensional space can be approximated by a two-dimensional plane composed of newly discovered axes that account for majority of the data variability.
Figure 2Framework modeling the PI with dimensionality reduction. (A): Rather than mapping high-dimensional spaces directly, the number of cases can be reduced exponentially by first projecting the input space to low-dimensional space composed of multiple latent variables and then mapping it to the output space. (B): Representing the data using a few underlying patterns reveals the intrinsic structure of the data, which is difficult to capture in a high-dimensional space where distinct factors of variations are highly entangled. Each point represents sample data, and the points denoted by a black circle represent the ith sample, which is represented in both the symptom space () and the pattern space (). The points are color-coded according to the identified patterns. (C): TAM's low-dimensional pattern space is constructed from metaphorical concepts that are embodied in everyday life. The pattern space in eight-principle PI, the most comprehensive type of PI, is visualized as an example. Six of eight principle patterns are composed of the exterior, interior, cold, heat, deficiency, and excess, while the other two are Yin and Yang, which are higher concepts that encompass the other six patterns. Six principle patterns are grouped in pairs of mutually opposing properties: exterior-interior, cold-heat, and deficiency and excess. These three pairs represent the extent to which external pathogens penetrate the body, the nature of the disease, and the relative superiority of the body's resistance to pathogenic factors and the pathogenic qi, respectively. These concepts' familiar and abstract characteristics enable robust inference of the pathological pattern from a myriad of symptom phenotypes.