Towards an Open NLI LLM-based System for KGs: A Case Study of Wikidata

Jaycent Gunawan Ongris, Eduardus Tjitrahardja, Fariz Darari, Fajar J. Ekaputra

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The rise of large language models (LLMs) has significantly advanced information retrieval, yet challenges like the limitation of knowledge updating ability, lack of openness, and hallucination issues persist. To address these, Retrieval-Augmented Generation (RAG) has been introduced but remains limited in interpretability due to its reliance on vector-based representations. This paper presents a question-answering (QA) system using GraphRAG, a RAG system with knowledge graphs (KGs) as its base. We develop a natural language interface (NLI) for QA over Wikidata, a popular, open, and crowdsourced KG. Our approach employs LLM chaining, i.e., a paradigm that leverages multiple LLM calls sequentially, to generate SPARQL queries, with the aim of creating an open system that ensures transparency and allows direct inspection of its components. Utilizing an experimental research approach, we evaluated the generated SPARQL queries and found that incorporating a broader set of property candidates into the prompts significantly boosts performance, achieving a Jaccard similarity score of 0.7806. These findings demonstrate the system's effectiveness in SPARQL query generation, highlighting its potential for further development. However, we consider the limitation of the LLM's context window and the hallucination phenomenon as the major challenges that limit the system's performance.

Original languageEnglish
Title of host publication7th International Seminar on Research of Information Technology and Intelligent Systems
Subtitle of host publicationAdvanced Intelligent Systems in Contemporary Society, ISRITI 2024 - Proceedings
EditorsFerry Wahyu Wibowo
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages44-49
Number of pages6
ISBN (Electronic)9798331519643
DOIs
Publication statusPublished - 2024
Event7th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2024 - Hybird, Yogyakarta, Indonesia
Duration: 11 Dec 2024 → …

Publication series

Name7th International Seminar on Research of Information Technology and Intelligent Systems: Advanced Intelligent Systems in Contemporary Society, ISRITI 2024 - Proceedings

Conference

Conference7th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2024
Country/TerritoryIndonesia
CityHybird, Yogyakarta
Period11/12/24 → …

Keywords

  • GraphRAG
  • KG
  • LLM
  • RAG
  • Wikidata

Fingerprint

Dive into the research topics of 'Towards an Open NLI LLM-based System for KGs: A Case Study of Wikidata'. Together they form a unique fingerprint.

Cite this