Semi-supervised Named-Entity Recognition for Product Attribute Extraction in Book Domain

Hadi Syah Putra, Faisal Satrio Priatmadji, Rahmad Mahendra

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

Products sold in today’s marketplace are very numerous and varied. One of them is the book product. Detail information about the book, such as the title of the book, author, and publisher, is often presented in unstructured format in the product title. In order to be useful for the commercial applications, for example catalogs, search functions, and recommendation systems, the attributes need to be extracted from the product title. In this study, we apply Named-Entity Recognition model in semi-supervised style to extract the attributes of e-commerce products in book domain. We experiment with the number of features extraction, i.e. lexical, position, word shape, and embedding features. We extract the book attributes from near to 30K product title data with F-1 measure 65%.

Original languageEnglish
Title of host publicationDigital Libraries at Times of Massive Societal Transition - 22nd International Conference on Asia-Pacific Digital Libraries, ICADL 2020, Proceedings
EditorsEmi Ishita, Natalie Lee Pang, Lihong Zhou
PublisherSpringer Science and Business Media Deutschland GmbH
Pages43-51
Number of pages9
ISBN (Print)9783030644512
DOIs
Publication statusPublished - 2020
Event22nd International Conference on Asia-Pacific Digital Libraries, ICADL 2020 - Kyoto, Japan
Duration: 30 Nov 20201 Dec 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12504 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference22nd International Conference on Asia-Pacific Digital Libraries, ICADL 2020
Country/TerritoryJapan
CityKyoto
Period30/11/201/12/20

Keywords

  • Attribute extraction
  • Book
  • E-commerce
  • Named-Entity Recognition
  • Product title

Fingerprint

Dive into the research topics of 'Semi-supervised Named-Entity Recognition for Product Attribute Extraction in Book Domain'. Together they form a unique fingerprint.

Cite this