AutoAlign: Fully Automatic and Effective Knowledge Graph Alignment Enabled by Large Language Models

Research output: Contribution to journalArticlepeer-review

11 Citations (Scopus)

Abstract

The task of entity alignment between knowledge graphs (KGs) aims to identify every pair of entities from two different KGs that represent the same entity. Many machine learning-based methods have been proposed for this task. However, to our best knowledge, existing methods all require manually crafted seed alignments, which are expensive to obtain. In this paper, we propose the first fully automatic alignment method named AutoAlign, which does not require any manually crafted seed alignments. Specifically, for predicate embeddings, AutoAlign constructs a predicate-proximity-graph with the help of large language models to automatically capture the similarity between predicates across two KGs. For entity embeddings, AutoAlign first computes the entity embeddings of each KG independently using TransE, and then shifts the two KGs’ entity embeddings into the same vector space by computing the similarity between entities based on their attributes. Thus, both predicate alignment and entity alignment can be done without manually crafted seed alignments. AutoAlign is not only fully automatic, but also highly effective. Experiments using real-world KGs show that AutoAlign improves the performance of entity alignment significantly compared to state-of-the-art methods.
Original languageEnglish
Pages (from-to)2357-2371
JournalIEEE Transactions on Knowledge and Data Engineering
Volume36
Issue number6
DOIs
Publication statusPublished - 19 Oct 2023

Keywords

  • Attribute embeddings
  • deep learning
  • entity alignment
  • knowledge base
  • knowledge graph
  • knowledge graph alignment
  • large language model
  • predicate proximity graph
  • representation learning

Fingerprint

Dive into the research topics of 'AutoAlign: Fully Automatic and Effective Knowledge Graph Alignment Enabled by Large Language Models'. Together they form a unique fingerprint.

Cite this