Figurative Usage Detection of Symptom Words to Improve Personal Health Mention Detection

Open AccessPosted Content

Figurative Usage Detection of Symptom Words to Improve Personal Health Mention Detection

Adith Iyer, +4 more

- 13 Jun 2019 -

arXiv: Computation and Language

Chats0

TLDR

In this paper, the authors combine a state-of-the-art figurative usage detection with CNN-based personal health mention detection for predicting whether or not a given sentence is a report of a health condition.

Abstract:

Personal health mention detection deals with predicting whether or not a given sentence is a report of a health condition. Past work mentions errors in this prediction when symptom words, i.e. names of symptoms of interest, are used in a figurative sense. Therefore, we combine a state-of-the-art figurative usage detection with CNN-based personal health mention detection. To do so, we present two methods: a pipeline-based approach and a feature augmentation-based approach. The introduction of figurative usage detection results in an average improvement of 2.21% F-score of personal health mention detection, in the case of the feature augmentation-based approach. This paper demonstrates the promise of using figurative usage detection to improve personal health mention detection.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Can Large Language Models Transform Computational Social Science?

Caleb Ziems, +5 more

- 12 Apr 2023 -

arXiv.org

TL;DR: This article showed that large language models (LLMs) like ChatGPT are capable of performing many language processing tasks zero-shot (without the need for training data) and they could effectively transform Computational Social Science (CSS).

...read moreread less

Proceedings ArticleDOI

IMPLI: Investigating NLI Models’ Performance on Figurative Language

Kimberly A. Stowe, +2 more

TL;DR: IMPLI is introduced, an English dataset consisting of paired sentences spanning idioms and metaphors and it is shown that while NLI models can reliably detect entailment relationship between figurative phrases with their literal counterparts, they perform poorly on similarly structured examples where pairs are designed to be non-entailing.

...read moreread less

Journal ArticleDOI

COVID-19 personal health mention detection from tweets using dual convolutional neural network

Linkai Luo, +2 more

- 01 Apr 2022 -

Expert systems with applications

TL;DR: Wang et al. as mentioned in this paper built a COVID-19 PHM dataset containing more than 11,000 annotated tweets, and proposed a dual convolutional neural network (CNN) framework using this dataset.

...read moreread less

Proceedings ArticleDOI

Identification of Disease or Symptom terms in Reddit to Improve Health Mention Classification

Usman Naseem, +3 more

TL;DR: This work presents a Reddit health mention dataset (RHMD), a new dataset of multi-domain Reddit data for the health mention classification (HMC) task, and proposes HMCNET that combines a target keyword (disease or symptom term) identification and user behavior hierarchically to improve HMC.

...read moreread less

Journal ArticleDOI

Performance Comparison of Transformer-Based Models on Twitter Health Mention Classification

Pervaiz Iqbal Khan, +3 more

- 01 Jun 2023 -

IEEE Transactions on Computational Socia...

TL;DR: In this paper , the authors compared nine widely used transformer methods and compared their performance on the personal health mention classification of tweet data, and analyzed the impact of model size on the classification task and provided a brief interpretation of the classification decision made by the best performing classifier.

...read moreread less

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

Journal ArticleDOI

WordNet: a lexical database for English

George A. Miller

- 01 Nov 1995 -

Communications of The ACM

TL;DR: WordNet1 provides a more effective combination of traditional lexicographic information and modern computing, and is an online lexical database designed for use under program control.

...read moreread less

Posted Content

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

- 16 Oct 2013 -

arXiv: Computation and Language

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.

...read moreread less

Proceedings ArticleDOI

The Stanford CoreNLP Natural Language Processing Toolkit

Christopher D. Manning, +5 more

TL;DR: The design and use of the Stanford CoreNLP toolkit is described, an extensible pipeline that provides core natural language analysis, and it is suggested that this follows from a simple, approachable design, straightforward interfaces, the inclusion of robust and good quality analysis components, and not requiring use of a large amount of associated baggage.

...read moreread less

Proceedings Article

ConceptNet 5.5: An Open Multilingual Graph of General Knowledge

Robert Speer, +2 more

TL;DR: ConceptNet as mentioned in this paper is a knowledge graph that connects words and phrases of natural language with labeled edges to represent the general knowledge involved in understanding language, improving natural language applications by allowing the application to better understand the meanings behind the words people use.

...read moreread less

Information-an International Interdiscip...

Figurative Usage Detection of Symptom Words to Improve Personal Health Mention Detection

Citations

Can Large Language Models Transform Computational Social Science?

IMPLI: Investigating NLI Models’ Performance on Figurative Language

COVID-19 personal health mention detection from tweets using dual convolutional neural network

Identification of Disease or Symptom terms in Reddit to Improve Health Mention Classification

Performance Comparison of Transformer-Based Models on Twitter Health Mention Classification

References

Glove: Global Vectors for Word Representation

WordNet: a lexical database for English

Distributed Representations of Words and Phrases and their Compositionality

The Stanford CoreNLP Natural Language Processing Toolkit

ConceptNet 5.5: An Open Multilingual Graph of General Knowledge

Related Papers (5)

Figurative Usage Detection of Symptom Words to Improve Personal Health Mention Detection

Improving Personal Health Mention Detection on Twitter Using Permutation Based Word Representation Learning

Improving Automatic Categorization of Technical vs. Laymen Medical Words using FastText Word Embeddings

Topic Identification Challenge Based on Short Word History

FastText-Based Intent Detection for Inflected Languages