Home
/
Authors
/
Sangwoo Seo

Author

Sangwoo Seo

Bio: Sangwoo Seo is an academic researcher from Hanyang University. The author has contributed to research in topics: Relationship extraction & WordNet. The author has an hindex of 2, co-authored 3 publications receiving 72 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Semantic Relation Classification via Bidirectional LSTM Networks with Entity-Aware Attention Using Latent Entity Typing

[...]

Joohong Lee¹, Sangwoo Seo¹, Yong Suk Choi•Institutions (1)

Hanyang University¹

23 Jan 2019-Symmetry

TL;DR: This article proposed an end-to-end recurrent neural model that incorporates an entity-aware attention mechanism with a latent entity typing (LET) method, which not only effectively utilizes entities and their latent types as features, but also builds word representations by applying self-attention based on symmetrical similarity of a sentence itself.

...read moreread less

Abstract: Classifying semantic relations between entity pairs in sentences is an important task in natural language processing (NLP). Most previous models applied to relation classification rely on high-level lexical and syntactic features obtained by NLP tools such as WordNet, the dependency parser, part-of-speech (POS) tagger, and named entity recognizers (NER). In addition, state-of-the-art neural models based on attention mechanisms do not fully utilize information related to the entity, which may be the most crucial feature for relation classification. To address these issues, we propose a novel end-to-end recurrent neural model that incorporates an entity-aware attention mechanism with a latent entity typing (LET) method. Our model not only effectively utilizes entities and their latent types as features, but also builds word representations by applying self-attention based on symmetrical similarity of a sentence itself. Moreover, the model is interpretable by visualizing applied attention mechanisms. Experimental results obtained with the SemEval-2010 Task 8 dataset, which is one of the most popular relation classification tasks, demonstrate that our model outperforms existing state-of-the-art models without any high-level features.

...read moreread less

90 citations

Posted Content•

Semantic Relation Classification via Bidirectional LSTM Networks with Entity-aware Attention using Latent Entity Typing.

[...]

Joohong Lee¹, Sangwoo Seo¹, Yong Suk Choi¹•Institutions (1)

Hanyang University¹

23 Jan 2019-arXiv: Computation and Language

TL;DR: This work proposes a novel end-to-end recurrent neural model which incorporates an entity-aware attention mechanism with a latent entity typing (LET) method and demonstrates that the model outperforms existing state-of-the-art models without any high-level features.

...read moreread less

Abstract: Classifying semantic relations between entity pairs in sentences is an important task in Natural Language Processing (NLP). Most previous models for relation classification rely on the high-level lexical and syntactic features obtained by NLP tools such as WordNet, dependency parser, part-of-speech (POS) tagger, and named entity recognizers (NER). In addition, state-of-the-art neural models based on attention mechanisms do not fully utilize information of entity that may be the most crucial features for relation classification. To address these issues, we propose a novel end-to-end recurrent neural model which incorporates an entity-aware attention mechanism with a latent entity typing (LET) method. Our model not only utilizes entities and their latent types as features effectively but also is more interpretable by visualizing attention mechanisms applied to our model and results of LET. Experimental results on the SemEval-2010 Task 8, one of the most popular relation classification task, demonstrate that our model outperforms existing state-of-the-art models without any high-level features.

...read moreread less

25 citations

Posted Content•

KLUE: Korean Language Understanding Evaluation.

[...]

Sungjoon Park¹, Jihyung Moon, Sungdong Kim², Won Ik Cho³, Jiyoon Han⁴, Jang-Won Park, Chisung Song, Junseong Kim, Yongsook Song⁵, Tae-Hwan Oh⁴, Joohong Lee, Juhyun Oh³, Sungwon Lyu, Younghoon Jeong⁶, Inkwon Lee², Sangwoo Seo, Dongjun Lee, Hyunwoo Kim³, Myeonghwa Lee¹, Seongbo Jang, Seungwon Do, Sunkyoung Kim¹, KyungTae Lim⁷, Jongwon Lee, Kyumin Park¹, Jamin Shin, Seonghyun Kim, Lucy Park, Alice Oh, Jung-Woo Ha², Kyunghyun Cho⁸ - Show less +27 more•Institutions (8)

KAIST¹, Naver Corporation², Seoul National University³, Yonsei University⁴, Kyung Hee University⁵, Sogang University⁶, Hanbat National University⁷, New York University⁸

20 May 2021-arXiv: Computation and Language

TL;DR: The Korean Language Understanding Evaluation (KLUE) benchmark as mentioned in this paper is a collection of 8 Korean NLP tasks, including Topic Classification, SemanticTextual Similarity, Natural Language Inference, Named Entity Recognition, Relation Extraction, Dependency Parsing, Machine Reading Comprehension, and Dialogue State Tracking.

...read moreread less

Abstract: We introduce Korean Language Understanding Evaluation (KLUE) benchmark. KLUE is a collection of 8 Korean natural language understanding (NLU) tasks, including Topic Classification, SemanticTextual Similarity, Natural Language Inference, Named Entity Recognition, Relation Extraction, Dependency Parsing, Machine Reading Comprehension, and Dialogue State Tracking. We build all of the tasks from scratch from diverse source corpora while respecting copyrights, to ensure accessibility for anyone without any restrictions. With ethical considerations in mind, we carefully design annotation protocols. Along with the benchmark tasks and data, we provide suitable evaluation metrics and fine-tuning recipes for pretrained language models for each task. We furthermore release the pretrained language models (PLM), KLUE-BERT and KLUE-RoBERTa, to help reproducing baseline models on KLUE and thereby facilitate future research. We make a few interesting observations from the preliminary experiments using the proposed KLUE benchmark suite, already demonstrating the usefulness of this new benchmark suite. First, we find KLUE-RoBERTa-large outperforms other baselines, including multilingual PLMs and existing open-source Korean PLMs. Second, we see minimal degradation in performance even when we replace personally identifiable information from the pretraining corpus, suggesting that privacy and NLU capability are not at odds with each other. Lastly, we find that using BPE tokenization in combination with morpheme-level pre-tokenization is effective in tasks involving morpheme-level tagging, detection and generation. In addition to accelerating Korean NLP research, our comprehensive documentation on creating KLUE will facilitate creating similar resources for other languages in the future. KLUE is available at this https URL.

...read moreread less

7 citations

Journal Article•DOI•

Unsupervised Episode Generation for Graph Meta-learning

[...]

Sangwoo Seo, Chanyoung Park

27 Jun 2023-arXiv.org

TL;DR: In this paper , an unsupervised episode generation method was proposed to solve the label-scarcity problem in graph meta-learning without labels, which can be used to solve few-shot node classification.

...read moreread less

Abstract: In this paper, we investigate Unsupervised Episode Generation methods to solve Few-Shot Node-Classification (FSNC) problem via Meta-learning without labels. Dominant meta-learning methodologies for FSNC were developed under the existence of abundant labeled nodes for training, which however may not be possible to obtain in the real-world. Although few studies have been proposed to tackle the label-scarcity problem, they still rely on a limited amount of labeled data, which hinders the full utilization of the information of all nodes in a graph. Despite the effectiveness of Self-Supervised Learning (SSL) approaches on FSNC without labels, they mainly learn generic node embeddings without consideration on the downstream task to be solved, which may limit its performance. In this work, we propose unsupervised episode generation methods to benefit from their generalization ability for FSNC tasks while resolving label-scarcity problem. We first propose a method that utilizes graph augmentation to generate training episodes called g-UMTRA, which however has several drawbacks, i.e., 1) increased training time due to the computation of augmented features and 2) low applicability to existing baselines. Hence, we propose Neighbors as Queries (NaQ), which generates episodes from structural neighbors found by graph diffusion. Our proposed methods are model-agnostic, that is, they can be plugged into any existing graph meta-learning models, while not sacrificing much of their performance or sometimes even improving them. We provide theoretical insights to support why our unsupervised episode generation methodologies work, and extensive experimental results demonstrate the potential of our unsupervised episode generation methods for graph meta-learning towards FSNC problems.

...read moreread less

Proceedings Article•DOI•

Explanation-based Graph Neural Networks for Graph Classification

[...]

Sangwoo Seo, Seungjun Jung, Changick Kim

21 Aug 2022

TL;DR: This article proposed an Explanation-Based Graph Neural Networks (EBGNN) that utilizes contrastive learning at the instance level, by applying explanation components to improve the performance of graph classification.

...read moreread less

Abstract: Graph Neural Network models can be used to quickly analyze interactions between multiple data expressed in a graph structure, with high accuracy. Previous studies accurately extract subgraphs which have a significant influence on the whole graph, providing accurate explanations for predictions of GNN. We noted that explanation components could help improve classification performance as unique representations of each class. Therefore, we suggest the GNN performance can be further improved by using explanation components. In this paper, we propose an Explanation-Based Graph Neural Networks (EBGNN) that utilizes contrastive learning at the instance level, by applying explanation components. In EBGNN, the explanation components ensure similarity for instances within the same class, and promote separability for instances in different classes. Finally, we conducted an evaluation on five benchmark datasets (MUTAG, IMDB-BINARY, PROTEINS, NCI1, and DD). Our experiment showed a significant increase in graph classification performance compared to state-of-the-art methods.

...read moreread less

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

Enriching Pre-trained Language Model with Entity Information for Relation Classification

[...]

Shanchan Wu¹, Yifan He¹•Institutions (1)

Alibaba Group¹

03 Nov 2019

TL;DR: This paper proposes a model that both leverages the pre-trained BERT language model and incorporates information from the target entities to tackle the relation classification task and achieves significant improvement over the state-of-the-art method on the SemEval-2010 task 8 relational dataset.

...read moreread less

Abstract: Relation classification is an important NLP task to extract relations between entities. The state-of-the-art methods for relation classification are primarily based on Convolutional or Recurrent Neural Networks. Recently, the pre-trained BERT model achieves very successful results in many NLP classification / sequence labeling tasks. Relation classification differs from those tasks in that it relies on information of both the sentence and the two target entities. In this paper, we propose a model that both leverages the pre-trained BERT language model and incorporates information from the target entities to tackle the relation classification task. We locate the target entities and transfer the information through the pre-trained architecture and incorporate the corresponding encoding of the two entities. We achieve significant improvement over the state-of-the-art method on the SemEval-2010 task 8 relational dataset.

...read moreread less

254 citations

Proceedings Article•DOI•

Chinese Relation Extraction with Multi-Grained Information and External Linguistic Knowledge.

[...]

Ziran Li¹, Ning Ding¹, Zhiyuan Liu¹, Hai-Tao Zheng¹, Ying Shen² - Show less +1 more•Institutions (2)

Tsinghua University¹, Peking University²

01 Jul 2019

TL;DR: A multi-grained lattice framework for Chinese relation extraction is proposed, which incorporates word-level information into character sequence inputs so that segmentation errors can be avoided and model multiple senses of polysemous words with the help of external linguistic knowledge to alleviate polysemy ambiguity.

...read moreread less

Abstract: Chinese relation extraction is conducted using neural networks with either character-based or word-based inputs, and most existing methods typically suffer from segmentation errors and ambiguity of polysemy. To address the issues, we propose a multi-grained lattice framework (MG lattice) for Chinese relation extraction to take advantage of multi-grained language information and external linguistic knowledge. In this framework, (1) we incorporate word-level information into character sequence inputs so that segmentation errors can be avoided. (2) We also model multiple senses of polysemous words with the help of external linguistic knowledge, so as to alleviate polysemy ambiguity. Experiments on three real-world datasets in distinct domains show consistent and significant superiority and robustness of our model, as compared with other baselines. We will release the source code of this paper in the future.

...read moreread less

65 citations

Proceedings Article•DOI•

Beyond Word Attention: Using Segment Attention in Neural Relation Extraction

[...]

Bowen Yu¹, Zhenyu Zhang¹, Tingwen Liu¹, Bin Wang², Sujian Li³, Quangang Li¹ - Show less +2 more•Institutions (3)

Chinese Academy of Sciences¹, Xiaomi², Peking University³

01 Aug 2019

TL;DR: Experimental results show that the method can attend to continuous relational expressions without explicit annotations, and achieve the state-of-the-art performance on the large-scale TACRED dataset.

...read moreread less

Abstract: Relation extraction studies the issue of predicting semantic relations between pairs of entities in sentences. Attention mechanisms are often used in this task to alleviate the inner-sentence noise by performing soft selections of words independently. Based on the observation that information pertinent to relations is usually contained within segments (continuous words in a sentence), it is possible to make use of this phenomenon for better extraction. In this paper, we aim to incorporate such segment information into neural relation extractor. Our approach views the attention mechanism as linear-chain conditional random fields over a set of latent variables whose edges encode the desired structure, and regards attention weight as the marginal distribution of each word being selected as a part of the relational expression. Experimental results show that our method can attend to continuous relational expressions without explicit annotations, and achieve the state-of-the-art performance on the large-scale TACRED dataset.

...read moreread less

50 citations

Journal Article•DOI•

A comprehensive exploration of semantic relation extraction via pre-trained CNNs

[...]

Qing Li¹, Lili Li¹, Weinan Wang², Qi Li³, Jiang Zhong¹ - Show less +1 more•Institutions (3)

Chongqing University¹, Peking University², Shaoxing University³

22 Apr 2020-Knowledge Based Systems

TL;DR: A new pre-trained network architecture for this task, called the XM-CNN, which utilizes word embedding and position embedding information and is designed to reinforce the contextual output from the MT-DNN K D pre- trained model.

...read moreread less

Abstract: Semantic relation extraction between entity pairs is a crucial task in information extraction from text. In this paper, we propose a new pre-trained network architecture for this task, and it is called the XM-CNN. The XM-CNN utilizes word embedding and position embedding information. It is designed to reinforce the contextual output from the MT-DNN K D pre-trained model. Our model effectively utilized an entity-aware attention mechanisms to detected the features and also adopts and applies more relation-specific pooling attention mechanisms applied to it. The experimental results show that the XM-CNN achieves state-of-the-art results on the SemEval-2010 task 8, and a thorough evaluation of the method is conducted.

...read moreread less

33 citations

Journal Article•DOI•

Relation extraction with convolutional network over learnable syntax-transport graph

[...]

Kai Sun¹, Richong Zhang¹, Yongyi Mao², Samuel Mensah¹, Xudong Liu¹ - Show less +1 more•Institutions (2)

Beihang University¹, University of Ottawa²

03 Apr 2020

TL;DR: This work learns to transform the dependency tree into a weighted graph by considering the syntax dependencies of the connected nodes and persisting the structure of the original dependency tree, and refers to this graph as a syntax-transport graph.

...read moreread less

Abstract: A large majority of approaches have been proposed to leverage the dependency tree in the relation classification task. Recent works have focused on pruning irrelevant information from the dependency tree. The state-of-the-art Attention Guided Graph Convolutional Networks (AGGCNs) transforms the dependency tree into a weighted-graph to distinguish the relevance of nodes and edges for relation classification. However, in their approach, the graph is fully connected, which destroys the structure information of the original dependency tree. How to effectively make use of relevant information while ignoring irrelevant information from the dependency trees remains a challenge in the relation classification task. In this work, we learn to transform the dependency tree into a weighted graph by considering the syntax dependencies of the connected nodes and persisting the structure of the original dependency tree. We refer to this graph as a syntax-transport graph. We further propose a learnable syntax-transport attention graph convolutional network (LST-AGCN) which operates on the syntax-transport graph directly to distill the final representation which is sufficient for classification. Experiments on Semeval-2010 Task 8 and Tacred show our approach outperforms previous methods.

...read moreread less

30 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

Collapse