Literature DB >> 33489717

Querying knowledge graphs in natural language.

Shiqi Liang1, Kurt Stockinger2, Tarcisio Mendes de Farias3,4, Maria Anisimova2,3, Manuel Gil2,3.   

Abstract

Knowledge graphs are a powerful concept for querying large amounts of data. These knowledge graphs are typically enormous and are often not easily accessible to end-users because they require specialized knowledge in query languages such as SPARQL. Moreover, end-users need a deep understanding of the structure of the underlying data models often based on the Resource Description Framework (RDF). This drawback has led to the development of Question-Answering (QA) systems that enable end-users to express their information needs in natural language. While existing systems simplify user access, there is still room for improvement in the accuracy of these systems. In this paper we propose a new QA system for translating natural language questions into SPARQL queries. The key idea is to break up the translation process into 5 smaller, more manageable sub-tasks and use ensemble machine learning methods as well as Tree-LSTM-based neural network models to automatically learn and translate a natural language question into a SPARQL query. The performance of our proposed QA system is empirically evaluated using the two renowned benchmarks-the 7th Question Answering over Linked Data Challenge (QALD-7) and the Large-Scale Complex Question Answering Dataset (LC-QuAD). Experimental results show that our QA system outperforms the state-of-art systems by 15% on the QALD-7 dataset and by 48% on the LC-QuAD dataset, respectively. In addition, we make our source code available.
© The Author(s) 2021.

Entities:  

Keywords:  Knowledge graphs; Natural language processing; Query processing; SPARQL

Year:  2021        PMID: 33489717      PMCID: PMC7799375          DOI: 10.1186/s40537-020-00383-w

Source DB:  PubMed          Journal:  J Big Data        ISSN: 2196-1115


  2 in total

1.  UniProtKB/Swiss-Prot.

Authors:  Emmanuel Boutet; Damien Lieberherr; Michael Tognolli; Michel Schneider; Amos Bairoch
Journal:  Methods Mol Biol       Date:  2007

2.  Enabling semantic queries across federated bioinformatics databases.

Authors:  Ana Claudia Sima; Tarcisio Mendes de Farias; Erich Zbinden; Maria Anisimova; Manuel Gil; Heinz Stockinger; Kurt Stockinger; Marc Robinson-Rechavi; Christophe Dessimoz
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.