Literature DB >> 33817035

FNG-IE: an improved graph-based method for keyword extraction from scholarly big-data.

Noman Tahir1, Muhammad Asif1, Shahbaz Ahmad1, Muhammad Sheraz Arshad Malik2, Hanan Aljuaid3, Muhammad Arif Butt4, Mobashar Rehman5.   

Abstract

Keyword extraction is essential in determining influenced keywords from huge documents as the research repositories are becoming massive in volume day by day. The research community is drowning in data and starving for information. The keywords are the words that describe the theme of the whole document in a precise way by consisting of just a few words. Furthermore, many state-of-the-art approaches are available for keyword extraction from a huge collection of documents and are classified into three types, the statistical approaches, machine learning, and graph-based methods. The machine learning approaches require a large training dataset that needs to be developed manually by domain experts, which sometimes is difficult to produce while determining influenced keywords. However, this research focused on enhancing state-of-the-art graph-based methods to extract keywords when the training dataset is unavailable. This research first converted the handcrafted dataset, collected from impact factor journals into n-grams combinations, ranging from unigram to pentagram and also enhanced traditional graph-based approaches. The experiment was conducted on a handcrafted dataset, and all methods were applied on it. Domain experts performed the user study to evaluate the results. The results were observed from every method and were evaluated with the user study using precision, recall and f-measure as evaluation matrices. The results showed that the proposed method (FNG-IE) performed well and scored near the machine learning approaches score.
© 2021 Tahir et al.

Entities:  

Keywords:  Graph-based keyword extraction; Keyword extraction; Programming

Year:  2021        PMID: 33817035      PMCID: PMC7959634          DOI: 10.7717/peerj-cs.389

Source DB:  PubMed          Journal:  PeerJ Comput Sci        ISSN: 2376-5992


  2 in total

1.  Impact analysis of keyword extraction using contextual word embedding.

Authors:  Muhammad Qasim Khan; Abdul Shahid; M Irfan Uddin; Muhammad Roman; Abdullah Alharbi; Wael Alosaimi; Jameel Almalki; Saeed M Alshahrani
Journal:  PeerJ Comput Sci       Date:  2022-05-30

2.  Automatic computer science domain multiple-choice questions generation based on informative sentences.

Authors:  Farah Maheen; Yazeed Yasin Ghadi; Muhammad Asif; Haseeb Ahmad; Shahbaz Ahmad; Fahad Alturise; Othman Asiry
Journal:  PeerJ Comput Sci       Date:  2022-08-16
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.