scispace - formally typeset
Open AccessPosted Content

Figurative Usage Detection of Symptom Words to Improve Personal Health Mention Detection

Reads0
Chats0
TLDR
In this paper, the authors combine a state-of-the-art figurative usage detection with CNN-based personal health mention detection for predicting whether or not a given sentence is a report of a health condition.
Abstract
Personal health mention detection deals with predicting whether or not a given sentence is a report of a health condition. Past work mentions errors in this prediction when symptom words, i.e. names of symptoms of interest, are used in a figurative sense. Therefore, we combine a state-of-the-art figurative usage detection with CNN-based personal health mention detection. To do so, we present two methods: a pipeline-based approach and a feature augmentation-based approach. The introduction of figurative usage detection results in an average improvement of 2.21% F-score of personal health mention detection, in the case of the feature augmentation-based approach. This paper demonstrates the promise of using figurative usage detection to improve personal health mention detection.

read more

Citations
More filters
Journal ArticleDOI

Can Large Language Models Transform Computational Social Science?

TL;DR: This article showed that large language models (LLMs) like ChatGPT are capable of performing many language processing tasks zero-shot (without the need for training data) and they could effectively transform Computational Social Science (CSS).
Proceedings ArticleDOI

IMPLI: Investigating NLI Models’ Performance on Figurative Language

TL;DR: IMPLI is introduced, an English dataset consisting of paired sentences spanning idioms and metaphors and it is shown that while NLI models can reliably detect entailment relationship between figurative phrases with their literal counterparts, they perform poorly on similarly structured examples where pairs are designed to be non-entailing.
Journal ArticleDOI

COVID-19 personal health mention detection from tweets using dual convolutional neural network

TL;DR: Wang et al. as mentioned in this paper built a COVID-19 PHM dataset containing more than 11,000 annotated tweets, and proposed a dual convolutional neural network (CNN) framework using this dataset.
Proceedings ArticleDOI

Identification of Disease or Symptom terms in Reddit to Improve Health Mention Classification

TL;DR: This work presents a Reddit health mention dataset (RHMD), a new dataset of multi-domain Reddit data for the health mention classification (HMC) task, and proposes HMCNET that combines a target keyword (disease or symptom term) identification and user behavior hierarchically to improve HMC.
Journal ArticleDOI

Performance Comparison of Transformer-Based Models on Twitter Health Mention Classification

TL;DR: In this paper , the authors compared nine widely used transformer methods and compared their performance on the personal health mention classification of tweet data, and analyzed the impact of model size on the classification task and provided a brief interpretation of the classification decision made by the best performing classifier.
References
More filters
Proceedings ArticleDOI

Glove: Global Vectors for Word Representation

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.
Journal ArticleDOI

WordNet: a lexical database for English

TL;DR: WordNet1 provides a more effective combination of traditional lexicographic information and modern computing, and is an online lexical database designed for use under program control.
Posted Content

Distributed Representations of Words and Phrases and their Compositionality

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.
Proceedings ArticleDOI

The Stanford CoreNLP Natural Language Processing Toolkit

TL;DR: The design and use of the Stanford CoreNLP toolkit is described, an extensible pipeline that provides core natural language analysis, and it is suggested that this follows from a simple, approachable design, straightforward interfaces, the inclusion of robust and good quality analysis components, and not requiring use of a large amount of associated baggage.
Proceedings Article

ConceptNet 5.5: An Open Multilingual Graph of General Knowledge

TL;DR: ConceptNet as mentioned in this paper is a knowledge graph that connects words and phrases of natural language with labeled edges to represent the general knowledge involved in understanding language, improving natural language applications by allowing the application to better understand the meanings behind the words people use.
Related Papers (5)