| Literature DB >> 17584920 |
Wei Yu1, Ajay Yesupriya, Anja Wulf, Junfeng Qu, Marta Gwinn, Muin J Khoury.
Abstract
BACKGROUND: Collaboration among investigators has become critical to scientific research. This includes ad hoc collaboration established through personal contacts as well as formal consortia established by funding agencies. Continued growth in online resources for scientific research and communication has promoted the development of highly networked research communities. Extending these networks globally requires identifying additional investigators in a given domain, profiling their research interests, and collecting current contact information. We present a novel strategy for building investigator networks dynamically and producing detailed investigator profiles using data available in PubMed abstracts.Entities:
Mesh:
Year: 2007 PMID: 17584920 PMCID: PMC1931433 DOI: 10.1186/1472-6947-7-17
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Keyword list for parsing institution information
| univ | University | English | University of Michigan |
| institu | Institute | English | National Institutes of Health |
| hospital | Hospital | English | Queen's University of Belfast |
| college | College | English | Medical College of Georgia |
| cent | Center | English | Memorial Sloan-Kettering Cancer Center |
| foundat | Foundation | English | Janssen Research Foundation |
| school | School | English | Menzies School of Health Research |
| system | System | English | North Shore-Long Island Jewish Health System |
| acad | Academy | English | Chinese Academy of Sciences |
| facul | Facility | English | Istanbul Faculty of Medicine |
| labora | Laboratory | English | Abbott Laboratories |
| clin | Clinic | English | Mayo Clinic |
| infirm | Infirmary | English | Royal Infirmary of Edinburgh |
| agenc | Agency | English | International Agency for Research on Cancer |
Figure 1Relational database schema. Note: UMLS – Unified Medical Language System. CUI – Concept Unique Identifier. MeSH – Medical Subject Heading. PK – Primary Key. FK – Foreign Key
Affiliation information available from records in PubMed and HuGE Pub Lit
| 98.6% | 43.0% | 19.8% | 98.8% | |
| 87.3% | 40.3% | 22.3% | 90.7% |
*Affiliation availability: number of documents that have affiliation string/total number of documents.
†Email availability: number of documents that have valid email addresses/total number of documents.
‡Authors with affiliation: number of first authors with affiliation/number of all authors.
§First authors with affiliation: number of first authors with affiliation/number of first authors.
Affiliation parsing performance
| Parsable* | Accuracy† | Parsable | Accuracy | |
| 92.1% | 94.0% | 91.3% | 86.8% | |
| 97.0% | 91% | 94.2% | 87.0% | |
*Parsable: number of abstracts that have country or institution information/number of abstracts that have affiliation information.
†Accuracy: number of abstracts that have correct country or institution information/number of abstracts that have affiliation information.
Comparison of investigators identified by experts and the methodology
| First/Last Author only | All Author | ||||
| Preterm Birth | preterm birth or premature | 40 (83.33%) | 46 (95.83%) | 48 | 502(F/L)§ 1694(All) |
| HIV | hiv | 97 (83.62%) | 111 (95.69%) | 116 | 518(F/L)§ 1997(All) |
| Chlamydia trachomatis | Chlamydia trachomatis | 17 (70.83%) | 24 (100%) | 24 | 19 (F/L)§ 68(All) |
* %: the number of the investigators in the methodology-generated list/the number of investigators experts identified.
§F/L:First/Last Authors option in Investigator Browser; All: All Authors option in Investigator Browser.
Figure 2Results of Investigator Browser search for HIV investigator network in human genome epidemiology.
Figure 3Investigator Browser showing an investigator detail profile in HIV investigator network in human genome epidemiology.
Figure 4Investigator Browser presentation of country distribution in HIV investigator network in human genome epidemiology.