Summarization and sentiment analysis from user health posts

doi:10.1109/PERVASIVE.2015.7087087

Home
/
Papers
/
Summarization and sentiment analysis from user health posts

Proceedings Article•DOI•

Summarization and sentiment analysis from user health posts

Vinod L. Mane¹, Suja S. Panicker¹, Vidya B. Patil¹•Institutions (1)

Massachusetts Institute of Technology¹

16 Apr 2015-pp 1-4

TL;DR: This work collects real time health posts from reputed websites, where patients express their views, including their experiences and side-effects on drugs used by them, and proposes to classify the users based on their `emotional state of mind'.

read less

Abstract: Online health communities continue to offer huge variety of medical information useful for medical practitioners, system administrators and patients alike. In this work we collect real time health posts from reputed websites, where patients express their views, including their experiences and side-effects on drugs used by them. We propose to perform Summarization of user posts per drug, and come out with useful conclusions for medical fraternity as well as patient community at a glance. Further, we propose to classify the users based on their ‘emotional state of mind’. Also, we shall perform knowledge discovery from user posts, whereby useful ‘patterns’ about the triad ‘drugs-symptoms-medicine’ is done by Association Rule Mining.

...read moreread less

Citations

PDF

Open Access

More filters

Posted Content•

A Hierarchical End-to-End Model for Jointly Improving Text Summarization and Sentiment Classification

[...]

Shuming Ma¹, Xu Sun¹, Junyang Lin¹, Xuancheng Ren¹•Institutions (1)

Peking University¹

03 May 2018-arXiv: Computation and Language

TL;DR: This work proposes a hierarchical end-to-end model for joint learning of text summarization and sentiment classification, where the sentiment classification label is treated as the further ``summarization'' of theText summarization output.

...read moreread less

Abstract: Text summarization and sentiment classification both aim to capture the main ideas of the text but at different levels. Text summarization is to describe the text within a few sentences, while sentiment classification can be regarded as a special type of summarization which "summarizes" the text into a even more abstract fashion, i.e., a sentiment class. Based on this idea, we propose a hierarchical end-to-end model for joint learning of text summarization and sentiment classification, where the sentiment classification label is treated as the further "summarization" of the text summarization output. Hence, the sentiment classification layer is put upon the text summarization layer, and a hierarchical structure is derived. Experimental results on Amazon online reviews datasets show that our model achieves better performance than the strong baseline systems on both abstractive summarization and sentiment classification.

...read moreread less

39 citations

Journal Article•DOI•

A survey on opinion summarization techniques for social media

[...]

Mohammed Elsaid Moussa¹, Ensaf Hussein Mohamed¹, Mohamed H. Haggag¹•Institutions (1)

Helwan University¹

01 Jun 2018-Future Computing and Informatics Journal

TL;DR: This survey shows the current opinion summarization challenges for social media, then the necessary pre-summarization steps like preprocessing, features extraction, noise elimination, and handling of synonym features are shown.

...read moreread less

38 citations

Proceedings Article•DOI•

Mining Twitter Data for Depression Detection

[...]

Priyanka Arora¹, Parul Arora¹•Institutions (1)

Jaypee Institute of Information Technology¹

07 Mar 2019

TL;DR: This paper tries to analyze health tweets for Depression, Anxiety from the mixed tweets by using Multinomial Naive Bayes and Support Vector Regression (SVR) Algorithm as a classifier.

...read moreread less

Abstract: Health care twitter analysis deals with the health related tweets through sentimental analysis by the patients themselves. The application of sentiment analysis has grown enormously. Its application in health care has great potential to analyze and improve the health of a country. In this paper, we try to analyze health tweets for Depression, Anxiety from the mixed tweets by using Multinomial Naive Bayes and Support Vector Regression (SVR) Algorithm as a classifier.

...read moreread less

36 citations

Cites methods from "Summarization and sentiment analysi..."

...[12] have used Summarization approach using simplified Lesk algorithm to arrange sentences in descending order....
[...]

Proceedings Article•DOI•

A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss

[...]

Hou Pong Chan¹, Wang Chen¹, Irwin King¹•Institutions (1)

The Chinese University of Hong Kong¹

25 Jul 2020

TL;DR: A novel dual-view model is proposed that jointly improves the performance of these two tasks, review summarization and sentiment classification, and helps the decoder to generate a summary to have a consistent sentiment tendency with the review and also helps the two sentiment classifiers learn from each other.

...read moreread less

Abstract: Acquiring accurate summarization and sentiment from user reviews is an essential component of modern e-commerce platforms. Review summarization aims at generating a concise summary that describes the key opinions and sentiment of a review, while sentiment classification aims to predict a sentiment label indicating the sentiment attitude of a review. To effectively leverage the shared sentiment information in both review summarization and sentiment classification tasks, we propose a novel dual-view model that jointly improves the performance of these two tasks. In our model, an encoder first learns a context representation for the review, then a summary decoder generates a review summary word by word. After that, a source-view sentiment classifier uses the encoded context representation to predict a sentiment label for the review, while a summary-view sentiment classifier uses the decoder hidden states to predict a sentiment label for the generated summary. During training, we introduce an inconsistency loss to penalize the disagreement between these two classifiers. It helps the decoder to generate a summary to have a consistent sentiment tendency with the review and also helps the two sentiment classifiers learn from each other. Experiment results on four real-world datasets from different domains demonstrate the effectiveness of our model.

...read moreread less

25 citations

Cites methods from "Summarization and sentiment analysi..."

...Though some previous methods [16, 25] can predict both the sentiment label and the summary for a social media text, the sentiment classification and summarization modules are trained separately and they rely on rich hand-crafted features....
[...]
...Two models [16, 25] were proposed to jointly extract a summary and predict the sentiment label for a social media post, but the summarization module and classification module of these models are trained separately and they require rich hand-crafted features....
[...]

Proceedings Article•

A Self-Attentive Hierarchical Model for Jointly Improving Text Summarization and Sentiment Classification

[...]

Hongli Wang, Jiangtao Ren

04 Nov 2018

TL;DR: A Self-Attentive Hierarchical model for jointly improving text Summarization and Sentiment Classification (SAHSSC), which outperforms the state-of-the-art baselines on both abstractive text summarization and sentiment classification by a considerable margin.

...read moreread less

Abstract: Text summarization and sentiment classification, in NLP, are two main tasks implemented on text analysis, focusing on extracting the major idea of a text at different levels. Based on the characteristics of both, sentiment classification can be regarded as a more abstractive summarization task. According to the scheme, a Self-Attentive Hierarchical model for jointly improving text Summarization and Sentiment Classification (SAHSSC) is proposed in this paper. This model jointly performs abstractive text summarization and sentiment classification within a hierarchical end-to-end neural framework, in which the sentiment classification layer on top of the summarization layer predicts the sentiment label in the light of the text and the generated summary. Furthermore, a self-attention layer is also proposed in the hierarchical framework, which is the bridge that connects the summarization layer and the sentiment classification layer and aims at capturing emotional information at text-level as well as summary-level. The proposed model can generate a more relevant summary and lead to a more accurate summary-aware sentiment prediction. Experimental results evaluated on SNAP amazon online review datasets show that our model outperforms the state-of-the-art baselines on both abstractive text summarization and sentiment classification by a considerable margin.

...read moreread less

17 citations

Cites background or methods from "Summarization and sentiment analysi..."

...In past years, there have been several systems of text analysis [Hole and Takalikar (2013); Mane et al. (2015)], which have been able to produce the summary and the sentiment label from the source content by lots of hand-crafted features....
[...]
...Hole and Takalikar (2013) and Mane et al. (2015) jointed the text summarization and the sentiment classification into a text analysis system as two independent function modules....
[...]

1
2
3
4
…
5

References

PDF

Open Access

More filters

Journal Article•DOI•

Introduction to WordNet: An On-line Lexical Database

[...]

George A. Miller¹, Richard Beckwith¹, Christiane Fellbaum¹, Derek Gross², Katherine J. Miller¹ - Show less +1 more•Institutions (2)

Princeton University¹, University of Rochester²

01 Dec 1990-International Journal of Lexicography

TL;DR: Standard alphabetical procedures for organizing lexical information put together words that are spelled alike and scatter words with similar or related meanings haphazardly through the list.

...read moreread less

Abstract: Standard alphabetical procedures for organizing lexical information put together words that are spelled alike and scatter words with similar or related meanings haphazardly through the list. Unfortunately, there is no obvious alternative, no other simple way for lexicographers to keep track of what has been done or for readers to find the word they are looking for. But a frequent objection to this solution is that finding things on an alphabetical list can be tedious and time-consuming. Many people who would like to refer to a dictionary decide not to bother with it because finding the information would interrupt their work and break their train of thought.

...read moreread less

5,038 citations

"Summarization and sentiment analysi..." refers methods in this paper

...We are using Wordnetdictionary [13] to detect correct sense of word....
[...]

Journal Article•DOI•

Sentiment analysis algorithms and applications: A survey

[...]

Walaa Medhat¹, Ahmed Hassan², Hoda Korashy²•Institutions (2)

Hodges University¹, Ain Shams University²

01 Dec 2014-Ain Shams Engineering Journal

TL;DR: This survey paper tackles a comprehensive overview of the last update in this field of sentiment analysis with sophisticated categorizations of a large number of recent articles and the illustration of the recent trend of research in the sentiment analysis and its related areas.

...read moreread less

2,152 citations

Journal Article•DOI•

Association rule mining to detect factors which contribute to heart disease in males and females

[...]

Jesmin Nahar¹, Tasadduq Imam¹, Kevin S. Tickle¹, Yi-Ping Phoebe Chen²•Institutions (2)

Central Queensland University¹, La Trobe University²

01 Mar 2013-Expert Systems With Applications

TL;DR: It is seen that factors such as chest pain being asymptomatic and the presence of exercise-induced angina indicate the likely existence of heart disease for both men and women, and resting ECG status is a key distinct factor for heart disease prediction.

...read moreread less

Abstract: This paper investigates the sick and healthy factors which contribute to heart disease for males and females. Association rule mining, a computational intelligence approach, is used to identify these factors and the UCI Cleveland dataset, a biological database, is considered along with the three rule generation algorithms - Apriori, Predictive Apriori and Tertius. Analyzing the information available on sick and healthy individuals and taking confidence as an indicator, females are seen to have less chance of coronary heart disease then males. Also, the attributes indicating healthy and sick conditions were identified. It is seen that factors such as chest pain being asymptomatic and the presence of exercise-induced angina indicate the likely existence of heart disease for both men and women. However, resting ECG being either normal or hyper and slope being flat are potential high risk factors for women only. For men, on the other hand, only a single rule expressing resting ECG being hyper was shown to be a significant factor. This means, for women, resting ECG status is a key distinct factor for heart disease prediction. Comparing the healthy status of men and women, slope being up, number of coloured vessels being zero, and oldpeak being less than or equal to 0.56 indicate a healthy status for both genders.

...read moreread less

329 citations

"Summarization and sentiment analysi..." refers background in this paper

...Their research shows that how computational intelligence can be used to identify important factors responsible for disease [3]....
[...]
...The rule says that RHS is likely to occur whenever the LHS set occurs [3]....
[...]

Journal Article•DOI•

Mining opinion components from unstructured reviews: A review

[...]

Khairullah Khan¹, Baharum Baharudin¹, Aurnagzeb Khan², Ashraf Ullah²•Institutions (2)

Universiti Teknologi Petronas¹, University of Science and Technology²

01 Sep 2014-Journal of King Saud University - Computer and Information Sciences

TL;DR: This study presents a systematic literature survey regarding the computational techniques, models and algorithms for mining opinion components from unstructured reviews.

...read moreread less

118 citations

"Summarization and sentiment analysi..." refers background in this paper

...The major challenges in opinion mining including ambiguity, semantic relatedness, and context dependency are addressed in [12]....
[...]

Proceedings Article•DOI•

People on drugs: credibility of user statements in health communities

[...]

Subhabrata Mukherjee¹, Gerhard Weikum¹, Cristian Danescu-Niculescu-Mizil¹•Institutions (1)

Max Planck Society¹

24 Aug 2014

TL;DR: The authors proposed a method for automatically establishing the credibility of user-generated medical statements and the trustworthiness of their authors by exploiting linguistic cues and distant supervision from expert sources, which can reliably extract side-effects and filter out false statements, while identifying trustworthy users that are likely to contribute valuable medical information.

...read moreread less

Abstract: Online health communities are a valuable source of information for patients and physicians. However, such user-generated resources are often plagued by inaccuracies and misinformation. In this work we propose a method for automatically establishing the credibility of user-generated medical statements and the trustworthiness of their authors by exploiting linguistic cues and distant supervision from expert sources. To this end we introduce a probabilistic graphical model that jointly learns user trustworthiness, statement credibility, and language objectivity.We apply this methodology to the task of extracting rare or unknown side-effects of medical drugs --- this being one of the problems where large scale non-expert data has the potential to complement expert medical knowledge. We show that our method can reliably extract side-effects and filter out false statements, while identifying trustworthy users that are likely to contribute valuable medical information.

...read moreread less

105 citations