| Literature DB >> 28815152 |
Feichen Shen1, Sunghwan Sohn1, Majid Rastegar-Mojarad1, Sijia Liu1, Joshua J Pankratz2, Michael A Hatton3, Nancy Sowada3, Om K Shrestha4, Shawna L Shurson4, Hongfang Liu1.
Abstract
The physicians' biographical pages are essential in providing information about physicians' specialties. However, physicians may not have biographical pages or the current pages are not comprehensive. We hypothesize that physicians' specialty information can be mined from Electronic Medical Records (EMRs) of their patients. We proposed an automated physician specialty populating (PSP) system that analyzes physician-ascertained diagnoses in EMRs, aggregates them to an appropriate granularity based on the current biographical pages, and populates the biographical pages accordingly. In this study, we applied the system using EMR data from Mayo Clinic and evaluated the system using the current biographical pages regarding various ranking strategies. Preliminary results demonstrated that using EMR data is a scalable and systematic way to populate physicians' biographical pages.Entities:
Year: 2017 PMID: 28815152 PMCID: PMC5543344
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Figure 1.Workflow of the proposed physician specialty populating (PSP) system
Qualified semantic types
| Pathologic Function | T046 | patf |
| Disease or Syndrome | T047 | dsyn |
| Mental or Behavioral Dysfunction | T048 | mobd |
| Neoplastic Process | T191 | neop |
| Cell or Molecular Dysfunction | T049 | comd |
| Experimental Model of Disease | T050 | emod |
| Health Care Activity | T058 | hlca |
| Laboratory Procedure | T059 | lbpr |
| Diagnostic Procedure | T060 | diap |
| Therapeutic or Preventive Procedure | T061 | topp |
Figure 2.An example of information content measurement in SNOMED-CT
Statistics of Mayo Clinic 2010-2015 EMRs
| EMRs | 789,966 | 8,249 | 23,979,937 |
| Subset of EMRs | 789,966 | 8,078 | 16,094,797 |
Statistics of current physicians’ clinical biographies from Mayo Clinic three campuses
| Current Bio Pages | 2,967 | 2,431 | 718 | 658 |
Figure 3.Count of SNOMED-CT codes for the top 10 practice settings with 9 experimental configurations
Figure 4.Precision for the top 10 practice settings with 9 experimental configurations
Overall precision/recall/F-measure for 658 physicians with nine groups (highest value in bold)
| TF-IDF | 0.21/0.08/0.12 | 0.4/ | |
| TF | 0.45/0.5/0.47 | 0.21/0.07/0.11 | 0.42/0.54/0.47 |
| Hybrid | 0.45/0.6/ | 0.22/0.07/0.11 | 0.42/0.63/0.5 |