scispace - formally typeset
Search or ask a question
Proceedings ArticleDOI

Summarization and sentiment analysis from user health posts

TL;DR: This work collects real time health posts from reputed websites, where patients express their views, including their experiences and side-effects on drugs used by them, and proposes to classify the users based on their `emotional state of mind'.
Abstract: Online health communities continue to offer huge variety of medical information useful for medical practitioners, system administrators and patients alike. In this work we collect real time health posts from reputed websites, where patients express their views, including their experiences and side-effects on drugs used by them. We propose to perform Summarization of user posts per drug, and come out with useful conclusions for medical fraternity as well as patient community at a glance. Further, we propose to classify the users based on their ‘emotional state of mind’. Also, we shall perform knowledge discovery from user posts, whereby useful ‘patterns’ about the triad ‘drugs-symptoms-medicine’ is done by Association Rule Mining.
Citations
More filters
Posted Content
TL;DR: This work proposes a hierarchical end-to-end model for joint learning of text summarization and sentiment classification, where the sentiment classification label is treated as the further ``summarization'' of theText summarization output.
Abstract: Text summarization and sentiment classification both aim to capture the main ideas of the text but at different levels. Text summarization is to describe the text within a few sentences, while sentiment classification can be regarded as a special type of summarization which "summarizes" the text into a even more abstract fashion, i.e., a sentiment class. Based on this idea, we propose a hierarchical end-to-end model for joint learning of text summarization and sentiment classification, where the sentiment classification label is treated as the further "summarization" of the text summarization output. Hence, the sentiment classification layer is put upon the text summarization layer, and a hierarchical structure is derived. Experimental results on Amazon online reviews datasets show that our model achieves better performance than the strong baseline systems on both abstractive summarization and sentiment classification.

39 citations

Journal ArticleDOI
TL;DR: This survey shows the current opinion summarization challenges for social media, then the necessary pre-summarization steps like preprocessing, features extraction, noise elimination, and handling of synonym features are shown.

38 citations

Proceedings ArticleDOI
07 Mar 2019
TL;DR: This paper tries to analyze health tweets for Depression, Anxiety from the mixed tweets by using Multinomial Naive Bayes and Support Vector Regression (SVR) Algorithm as a classifier.
Abstract: Health care twitter analysis deals with the health related tweets through sentimental analysis by the patients themselves. The application of sentiment analysis has grown enormously. Its application in health care has great potential to analyze and improve the health of a country. In this paper, we try to analyze health tweets for Depression, Anxiety from the mixed tweets by using Multinomial Naive Bayes and Support Vector Regression (SVR) Algorithm as a classifier.

36 citations


Cites methods from "Summarization and sentiment analysi..."

  • ...[12] have used Summarization approach using simplified Lesk algorithm to arrange sentences in descending order....

    [...]

Proceedings ArticleDOI
25 Jul 2020
TL;DR: A novel dual-view model is proposed that jointly improves the performance of these two tasks, review summarization and sentiment classification, and helps the decoder to generate a summary to have a consistent sentiment tendency with the review and also helps the two sentiment classifiers learn from each other.
Abstract: Acquiring accurate summarization and sentiment from user reviews is an essential component of modern e-commerce platforms. Review summarization aims at generating a concise summary that describes the key opinions and sentiment of a review, while sentiment classification aims to predict a sentiment label indicating the sentiment attitude of a review. To effectively leverage the shared sentiment information in both review summarization and sentiment classification tasks, we propose a novel dual-view model that jointly improves the performance of these two tasks. In our model, an encoder first learns a context representation for the review, then a summary decoder generates a review summary word by word. After that, a source-view sentiment classifier uses the encoded context representation to predict a sentiment label for the review, while a summary-view sentiment classifier uses the decoder hidden states to predict a sentiment label for the generated summary. During training, we introduce an inconsistency loss to penalize the disagreement between these two classifiers. It helps the decoder to generate a summary to have a consistent sentiment tendency with the review and also helps the two sentiment classifiers learn from each other. Experiment results on four real-world datasets from different domains demonstrate the effectiveness of our model.

25 citations


Cites methods from "Summarization and sentiment analysi..."

  • ...Though some previous methods [16, 25] can predict both the sentiment label and the summary for a social media text, the sentiment classification and summarization modules are trained separately and they rely on rich hand-crafted features....

    [...]

  • ...Two models [16, 25] were proposed to jointly extract a summary and predict the sentiment label for a social media post, but the summarization module and classification module of these models are trained separately and they require rich hand-crafted features....

    [...]

Proceedings Article
04 Nov 2018
TL;DR: A Self-Attentive Hierarchical model for jointly improving text Summarization and Sentiment Classification (SAHSSC), which outperforms the state-of-the-art baselines on both abstractive text summarization and sentiment classification by a considerable margin.
Abstract: Text summarization and sentiment classification, in NLP, are two main tasks implemented on text analysis, focusing on extracting the major idea of a text at different levels. Based on the characteristics of both, sentiment classification can be regarded as a more abstractive summarization task. According to the scheme, a Self-Attentive Hierarchical model for jointly improving text Summarization and Sentiment Classification (SAHSSC) is proposed in this paper. This model jointly performs abstractive text summarization and sentiment classification within a hierarchical end-to-end neural framework, in which the sentiment classification layer on top of the summarization layer predicts the sentiment label in the light of the text and the generated summary. Furthermore, a self-attention layer is also proposed in the hierarchical framework, which is the bridge that connects the summarization layer and the sentiment classification layer and aims at capturing emotional information at text-level as well as summary-level. The proposed model can generate a more relevant summary and lead to a more accurate summary-aware sentiment prediction. Experimental results evaluated on SNAP amazon online review datasets show that our model outperforms the state-of-the-art baselines on both abstractive text summarization and sentiment classification by a considerable margin.

17 citations


Cites background or methods from "Summarization and sentiment analysi..."

  • ...In past years, there have been several systems of text analysis [Hole and Takalikar (2013); Mane et al. (2015)], which have been able to produce the summary and the sentiment label from the source content by lots of hand-crafted features....

    [...]

  • ...Hole and Takalikar (2013) and Mane et al. (2015) jointed the text summarization and the sentiment classification into a text analysis system as two independent function modules....

    [...]

References
More filters
Journal ArticleDOI
TL;DR: Standard alphabetical procedures for organizing lexical information put together words that are spelled alike and scatter words with similar or related meanings haphazardly through the list.
Abstract: Standard alphabetical procedures for organizing lexical information put together words that are spelled alike and scatter words with similar or related meanings haphazardly through the list. Unfortunately, there is no obvious alternative, no other simple way for lexicographers to keep track of what has been done or for readers to find the word they are looking for. But a frequent objection to this solution is that finding things on an alphabetical list can be tedious and time-consuming. Many people who would like to refer to a dictionary decide not to bother with it because finding the information would interrupt their work and break their train of thought.

5,038 citations


"Summarization and sentiment analysi..." refers methods in this paper

  • ...We are using Wordnetdictionary [13] to detect correct sense of word....

    [...]

Journal ArticleDOI
TL;DR: This survey paper tackles a comprehensive overview of the last update in this field of sentiment analysis with sophisticated categorizations of a large number of recent articles and the illustration of the recent trend of research in the sentiment analysis and its related areas.

2,152 citations

Journal ArticleDOI
TL;DR: It is seen that factors such as chest pain being asymptomatic and the presence of exercise-induced angina indicate the likely existence of heart disease for both men and women, and resting ECG status is a key distinct factor for heart disease prediction.
Abstract: This paper investigates the sick and healthy factors which contribute to heart disease for males and females. Association rule mining, a computational intelligence approach, is used to identify these factors and the UCI Cleveland dataset, a biological database, is considered along with the three rule generation algorithms - Apriori, Predictive Apriori and Tertius. Analyzing the information available on sick and healthy individuals and taking confidence as an indicator, females are seen to have less chance of coronary heart disease then males. Also, the attributes indicating healthy and sick conditions were identified. It is seen that factors such as chest pain being asymptomatic and the presence of exercise-induced angina indicate the likely existence of heart disease for both men and women. However, resting ECG being either normal or hyper and slope being flat are potential high risk factors for women only. For men, on the other hand, only a single rule expressing resting ECG being hyper was shown to be a significant factor. This means, for women, resting ECG status is a key distinct factor for heart disease prediction. Comparing the healthy status of men and women, slope being up, number of coloured vessels being zero, and oldpeak being less than or equal to 0.56 indicate a healthy status for both genders.

329 citations


"Summarization and sentiment analysi..." refers background in this paper

  • ...Their research shows that how computational intelligence can be used to identify important factors responsible for disease [3]....

    [...]

  • ...The rule says that RHS is likely to occur whenever the LHS set occurs [3]....

    [...]

Journal ArticleDOI
TL;DR: This study presents a systematic literature survey regarding the computational techniques, models and algorithms for mining opinion components from unstructured reviews.

118 citations


"Summarization and sentiment analysi..." refers background in this paper

  • ...The major challenges in opinion mining including ambiguity, semantic relatedness, and context dependency are addressed in [12]....

    [...]

Proceedings ArticleDOI
24 Aug 2014
TL;DR: The authors proposed a method for automatically establishing the credibility of user-generated medical statements and the trustworthiness of their authors by exploiting linguistic cues and distant supervision from expert sources, which can reliably extract side-effects and filter out false statements, while identifying trustworthy users that are likely to contribute valuable medical information.
Abstract: Online health communities are a valuable source of information for patients and physicians. However, such user-generated resources are often plagued by inaccuracies and misinformation. In this work we propose a method for automatically establishing the credibility of user-generated medical statements and the trustworthiness of their authors by exploiting linguistic cues and distant supervision from expert sources. To this end we introduce a probabilistic graphical model that jointly learns user trustworthiness, statement credibility, and language objectivity.We apply this methodology to the task of extracting rare or unknown side-effects of medical drugs --- this being one of the problems where large scale non-expert data has the potential to complement expert medical knowledge. We show that our method can reliably extract side-effects and filter out false statements, while identifying trustworthy users that are likely to contribute valuable medical information.

105 citations