OpenTag: Open Attribute Value Extraction from Product Profiles

doi:10.1145/3219819.3219839

Open AccessProceedings ArticleDOI

OpenTag: Open Attribute Value Extraction from Product Profiles

Guineng Zheng, +3 more

- pp 1049-1058

Chats0

TLDR

OpenTag as mentioned in this paper leverages product profile information such as titles and descriptions to discover missing values of product attributes, and proposes a novel sampling strategy exploring active learning to reduce the burden of human annotation.

Abstract:

Extraction of missing attribute values is to find values describing an attribute of interest from a free text input. Most past related work on extraction of missing attribute values work with a closed world assumption with the possible set of values known beforehand, or use dictionaries of values and hand-crafted features. How can we discover new attribute values that we have never seen before? Can we do this with limited human annotation or supervision? We study this problem in the context of product catalogs that often have missing values for many attributes of interest. In this work, we leverage product profile information such as titles and descriptions to discover missing values of product attributes. We develop a novel deep tagging model OpenTag for this extraction problem with the following contributions: (1) we formalize the problem as a sequence tagging task, and propose a joint model exploiting recurrent neural networks (specifically, bidirectional LSTM) to capture context and semantics, and Conditional Random Fields (CRF) to enforce tagging consistency; (2) we develop a novel attention mechanism to provide interpretable explanation for our model's decisions; (3) we propose a novel sampling strategy exploring active learning to reduce the burden of human annotation. OpenTag does not use any dictionary or hand-crafted features as in prior works. Extensive experiments in real-life datasets in different domains show that OpenTag with our active learning strategy discovers new attribute values from as few as 150 annotated samples (reduction in 3.3x amount of annotation effort) with a high F-score of 83%, outperforming state-of-the-art models.

OpenTag: Open Attribute Value Extraction from Product Profiles

Citations

Machine Knowledge: Creation and Curation of Comprehensive Knowledge Bases

Challenges and Innovations in Building a Product Knowledge Graph

CASIE: Extracting Cybersecurity Event Information from Text

Scaling up Open Tagging from Tens to Thousands: Comprehension Empowered Attribute Value Extraction from Product Title

AutoKnow: Self-Driving Knowledge Collection for Products of Thousands of Types

References

Adam: A Method for Stochastic Optimization

Long short-term memory

Dropout: a simple way to prevent neural networks from overfitting

Glove: Global Vectors for Word Representation

Distributed Representations of Words and Phrases and their Compositionality

Related Papers (5)

Long short-term memory

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Bidirectional LSTM-CRF Models for Sequence Tagging

Glove: Global Vectors for Word Representation

Neural Architectures for Named Entity Recognition