Literature DB >> 21170415

Finding related sentence pairs in MEDLINE.

Larry H Smith1, W John Wilbur.   

Abstract

We explore the feasibility of automatically identifying sentences in different MEDLINE abstracts that are related in meaning. We compared traditional vector space models with machine learning methods for detecting relatedness, and found that machine learning was superior. The Huber method, a variant of Support Vector Machines which minimizes the modified Huber loss function, achieves 73% precision when the score cutoff is set high enough to identify about one related sentence per abstract on average. We illustrate how an abstract viewed in PubMed might be modified to present the related sentences found in other abstracts by this automatic procedure.

Entities:  

Year:  2010        PMID: 21170415      PMCID: PMC2992462          DOI: 10.1007/s10791-010-9126-8

Source DB:  PubMed          Journal:  Inf Retr Boston        ISSN: 1386-4564            Impact factor:   2.293


  15 in total

1.  Boosting naïve Bayesian learning on a large subset of MEDLINE.

Authors:  W J Wilbur
Journal:  Proc AMIA Symp       Date:  2000

2.  Evaluation of the UMLS as a terminology and knowledge resource for biomedical informatics.

Authors:  Olivier Bodenreider; Joyce A Mitchell; Alexa T McCray
Journal:  Proc AMIA Symp       Date:  2002

Review 3.  Two biomedical sublanguages: a description based on the theories of Zellig Harris.

Authors:  Carol Friedman; Pauline Kra; Andrey Rzhetsky
Journal:  J Biomed Inform       Date:  2002-08       Impact factor: 6.317

4.  MedPost: a part-of-speech tagger for bioMedical text.

Authors:  L Smith; T Rindflesch; W J Wilbur
Journal:  Bioinformatics       Date:  2004-04-08       Impact factor: 6.937

5.  Evaluating relevance ranking strategies for MEDLINE retrieval.

Authors:  Zhiyong Lu; Won Kim; W John Wilbur
Journal:  J Am Med Inform Assoc       Date:  2008-10-24       Impact factor: 4.497

6.  Concentrations of diazinon, chlorpyrifos, and bendiocarb after application in offices.

Authors:  K L Currie; E C McDonald; L T Chung; A R Higgs
Journal:  Am Ind Hyg Assoc J       Date:  1990-01

7.  Risk of sick leave associated with outdoor air supply rate, humidification, and occupant complaints.

Authors:  D K Milton; P M Glencross; M D Walters
Journal:  Indoor Air       Date:  2000-12       Impact factor: 5.770

8.  A large-scale experiment on mass transfer of trichloroethylene from the unsaturated zone of a sandy aquifer to its interfaces.

Authors:  Salah Jellali; Hocine Benremita; Paul Muntzer; Olivier Razakarisoa; Gerhard Schäfer
Journal:  J Contam Hydrol       Date:  2003-01       Impact factor: 3.188

9.  Database resources of the National Center for Biotechnology Information.

Authors:  Eric W Sayers; Tanya Barrett; Dennis A Benson; Stephen H Bryant; Kathi Canese; Vyacheslav Chetvernin; Deanna M Church; Michael DiCuccio; Ron Edgar; Scott Federhen; Michael Feolo; Lewis Y Geer; Wolfgang Helmberg; Yuri Kapustin; David Landsman; David J Lipman; Thomas L Madden; Donna R Maglott; Vadim Miller; Ilene Mizrachi; James Ostell; Kim D Pruitt; Gregory D Schuler; Edwin Sequeira; Stephen T Sherry; Martin Shumway; Karl Sirotkin; Alexandre Souvorov; Grigory Starchenko; Tatiana A Tatusova; Lukas Wagner; Eugene Yaschenko; Jian Ye
Journal:  Nucleic Acids Res       Date:  2008-10-21       Impact factor: 16.971

10.  PubMed related articles: a probabilistic topic-based model for content similarity.

Authors:  Jimmy Lin; W John Wilbur
Journal:  BMC Bioinformatics       Date:  2007-10-30       Impact factor: 3.169

View more
  6 in total

1.  Extracting drug-drug interactions from literature using a rich feature-based linear kernel approach.

Authors:  Sun Kim; Haibin Liu; Lana Yeganova; W John Wilbur
Journal:  J Biomed Inform       Date:  2015-03-19       Impact factor: 6.317

2.  Efficient large-scale protein sequence comparison and gene matching to identify orthologs and co-orthologs.

Authors:  Khalid Mahmood; Geoffrey I Webb; Jiangning Song; James C Whisstock; Arun S Konagurthu
Journal:  Nucleic Acids Res       Date:  2011-12-30       Impact factor: 16.971

3.  Prioritizing PubMed articles for the Comparative Toxicogenomic Database utilizing semantic information.

Authors:  Sun Kim; Won Kim; Chih-Hsuan Wei; Zhiyong Lu; W John Wilbur
Journal:  Database (Oxford)       Date:  2012-11-17       Impact factor: 3.451

4.  Classifying protein-protein interaction articles using word and syntactic features.

Authors:  Sun Kim; W John Wilbur
Journal:  BMC Bioinformatics       Date:  2011-10-03       Impact factor: 3.169

5.  Identifying named entities from PubMed for enriching semantic categories.

Authors:  Sun Kim; Zhiyong Lu; W John Wilbur
Journal:  BMC Bioinformatics       Date:  2015-02-21       Impact factor: 3.169

6.  BioCreative V BioC track overview: collaborative biocurator assistant task for BioGRID.

Authors:  Sun Kim; Rezarta Islamaj Doğan; Andrew Chatr-Aryamontri; Christie S Chang; Rose Oughtred; Jennifer Rust; Riza Batista-Navarro; Jacob Carter; Sophia Ananiadou; Sérgio Matos; André Santos; David Campos; José Luís Oliveira; Onkar Singh; Jitendra Jonnagaddala; Hong-Jie Dai; Emily Chia-Yu Su; Yung-Chun Chang; Yu-Chen Su; Chun-Han Chu; Chien Chin Chen; Wen-Lian Hsu; Yifan Peng; Cecilia Arighi; Cathy H Wu; K Vijay-Shanker; Ferhat Aydın; Zehra Melce Hüsünbeyi; Arzucan Özgür; Soo-Yong Shin; Dongseop Kwon; Kara Dolinski; Mike Tyers; W John Wilbur; Donald C Comeau
Journal:  Database (Oxford)       Date:  2016-09-01       Impact factor: 3.451

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.