An artificial neural network based approach for sentiment analysis of opinionated text

doi:10.1145/2401603.2401611

Home
/
Papers
/
An artificial neural network based approach for sentiment analysis of opinionated text

Proceedings Article•DOI•

An artificial neural network based approach for sentiment analysis of opinionated text

Anuj Sharma¹, Shubhamoy Dey¹•Institutions (1)

Indian Institute of Management Ahmedabad¹

23 Oct 2012-pp 37-42

TL;DR: A sentiment classification model using back-propagation artificial neural network (BPANN) is proposed that combines the strength of BPANN in classification accuracy with utilizing intrinsic domain knowledge available in the sentiment lexicons.

read less

Abstract: The Internet and Web 2.0 social media have emerged as an important medium for expressing sentiments, opinions, evaluations, and reviews. Sentiment analysis or opinion mining is becoming an open research domain due to the abundance of discussion forums, Weblogs, e-commerce portals, social networking and content sharing sites where people tend to express their opinions. Sentiment Analysis involves classifying text documents based on the opinion expressed being positive or negative about a given topic. This paper proposes a sentiment classification model using back-propagation artificial neural network (BPANN). Information Gain and three popular sentiment lexicons are used to extract sentiment representing features that are then used to train and test the BPANN. This novel approach combines the strength of BPANN in classification accuracy with utilizing intrinsic domain knowledge available in the sentiment lexicons. The results obtained on the movie-review corpora have shown that the proposed approach has been able to reduce dimensionality, while producing accurate sentiment based classification of text.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Wave2Vec: Vectorizing Electroencephalography Bio-Signal for Prediction of Brain Disease

[...]

Seonho Kim¹, Seonho Kim², Jungjoon Kim¹, Jungjoon Kim², Hong-Woo Chun², Hong-Woo Chun¹ - Show less +2 more•Institutions (2)

Korea Institute of Science and Technology Information¹, Korea Institute of Science and Technology²

15 Aug 2018-International Journal of Environmental Research and Public Health

TL;DR: An encoding-based Wave2vec time series classifier model, which combines signal-processing and deep learning-based natural language processing techniques, is provided, which facilitates intuitive and easy recognition, and identification of influential patterns.

...read moreread less

Abstract: Interest in research involving health-medical information analysis based on artificial intelligence, especially for deep learning techniques, has recently been increasing. Most of the research in this field has been focused on searching for new knowledge for predicting and diagnosing disease by revealing the relation between disease and various information features of data. These features are extracted by analyzing various clinical pathology data, such as EHR (electronic health records), and academic literature using the techniques of data analysis, natural language processing, etc. However, still needed are more research and interest in applying the latest advanced artificial intelligence-based data analysis technique to bio-signal data, which are continuous physiological records, such as EEG (electroencephalography) and ECG (electrocardiogram). Unlike the other types of data, applying deep learning to bio-signal data, which is in the form of time series of real numbers, has many issues that need to be resolved in preprocessing, learning, and analysis. Such issues include leaving feature selection, learning parts that are black boxes, difficulties in recognizing and identifying effective features, high computational complexities, etc. In this paper, to solve these issues, we provide an encoding-based Wave2vec time series classifier model, which combines signal-processing and deep learning-based natural language processing techniques. To demonstrate its advantages, we provide the results of three experiments conducted with EEG data of the University of California Irvine, which are a real-world benchmark bio-signal dataset. After converting the bio-signals (in the form of waves), which are a real number time series, into a sequence of symbols or a sequence of wavelet patterns that are converted into symbols, through encoding, the proposed model vectorizes the symbols by learning the sequence using deep learning-based natural language processing. The models of each class can be constructed through learning from the vectorized wavelet patterns and training data. The implemented models can be used for prediction and diagnosis of diseases by classifying the new data. The proposed method enhanced data readability and intuition of feature selection and learning processes by converting the time series of real number data into sequences of symbols. In addition, it facilitates intuitive and easy recognition, and identification of influential patterns. Furthermore, real-time large-capacity data analysis is facilitated, which is essential in the development of real-time analysis diagnosis systems, by drastically reducing the complexity of calculation without deterioration of analysis performance by data simplification through the encoding process.

...read moreread less

22 citations

Cites methods from "An artificial neural network based ..."

...Document classification, another instance of the sequence classification problem, is performed by using the order of words as a major feature [25], a sentimental analysis is performed by using the sentiment lexicons of words [26], and the pattern-based sequence classification is studied in more general areas [27]....
[...]

Analyzing sentiment in Indian languages micro text using recurrent neural network

[...]

S.a b Seshadri, A.K.a b Madasamy, S.K.a b Padannayil, M. Anand Kumar

01 Jan 2016

TL;DR: The system performs well for recurrent neural network when compared with the system submitted to the shared task as the accuracy of the system had increased and the network seeks to pursue sentiment oriented feature which improves in analyzing the sentiments on tweets.

...read moreread less

Abstract: This paper aims at improving the system which is submitted to the shared task on Sentiment Analysis in Indian Languages (SAIL2015) at MIKE 2015. In this work the tweets are classified into three polarity category namely positive, negative and neutral. Twitter data of three languages namely Tamil, Hindi and Bengali are already provided by SAIL 2015 task organizers as we have participated in the contest. Recurrent neural network is used for analyzing the sentiment in the tweets. The system performs well for recurrent neural network when compared with the system submitted to the shared task as the accuracy of the system had increased. This is due to the fact that the recurrent neural network concentrates more on language specific feature. In training, the recurrent neural network tries to learn based on the error that are generated as intermediate output. By this way the network seeks to pursue sentiment oriented feature which improves in analyzing the sentiments on tweets. We have obtained a state accuracy for the proposed system, where we achieved an accuracy of 88%, 72.01% and 65.16% for Tamil, Hindi and Bengali languages respectively for SAIL 2015 dataset.

...read moreread less

19 citations

Cites methods from "An artificial neural network based ..."

...In this work along with BPANN, it uses domain knowledge which are available in sentiment lexicon [8]....
[...]

Journal Article•DOI•

Sexual harassment in academe is underreported, especially by students in the life and physical sciences.

[...]

Stephen J. Aguilar¹, Clare Baek¹•Institutions (1)

University of Southern California¹

10 Mar 2020-PLOS ONE

TL;DR: The results suggest that institutional and departmental barriers driven by power asymmetries play a large role in the underreporting of sexual harassment among students—especially those in STEM disciplines.

...read moreread less

Abstract: What factors predict the underreporting of sexual harassment in academe? We used logistic regression and sentiment analysis to examine 2,343 reports of sexual harassment involving members of university communities. Results indicate students were 1.6 times likely to not report their experiences when compared to faculty. Respondents in the life and physical sciences were 1.7 times more likely to not report their experiences when compared to respondents in other disciplines. Men represented 90% of the reported perpetrators of sexual harassment. Analysis of respondents' written accounts show variation of overall sentiment based on discipline, student type, and the type of institution attended, particularly with regard to mental health. Our results suggest that institutional and departmental barriers driven by power asymmetries play a large role in the underreporting sexual harassment among students-especially those in STEM disciplines.

...read moreread less

19 citations

Proceedings Article•DOI•

Topic Model Based Opinion Mining and Sentiment Analysis

[...]

Krishna B Vamshi¹, Ajeet Kumar Pandey, Kumar A. P. Siva¹•Institutions (1)

Jawaharlal Nehru Technological University, Anantapur¹

01 Jan 2018

TL;DR: A new topic model based approach for opinion mining and sentiment analysis of text reviews posted in web forums or social media site which are mostly in unstructured in nature is discussed.

...read moreread less

Abstract: This paper discusses a new topic model based approach for opinion mining and sentiment analysis of text reviews posted in web forums or social media site which are mostly in unstructured in nature. In recent years, opinions are exchanged in clouds about any product, person, event or any interested topic. These opinions help in decision making for choosing a product or getting feedback about any topic. Opinion mining and sentiment analysis are related in a sense that opining mining deals with analyzing and summarizing expressed opinions whereas sentiment analysis classifies opinionated text into positive and negative. Aspect extraction is a crucial problem in sentiment analysis. Model proposed in the paper utilizes topic model for aspect extraction and support vector machine learning technique for sentiment classification of textual reviews. The goal is to automate the process of mining attitudes, opinions and hidden emotions from text.

...read moreread less

12 citations

Cites methods from "An artificial neural network based ..."

...On reviewing literature, it is found that various supervised machine learning algorithms like Naïve Bayes [12] [13] Support Vector Machines[14] [15] and Neural Networks [16] have been used for opinion mining of text to classify positive and negative sentiment....
[...]

Proceedings Article•DOI•

An experimental study based on Fuzzy Systems and Artificial Neural Networks to estimate the importance of reviews about product and services

[...]

Roney Lira de Sales Santos¹, Rogerio F. de Sousa¹, Ricardo A. L. Rabelo¹, Raimundo Santos Moura¹•Institutions (1)

Federal University of Piauí¹

01 Jul 2016

TL;DR: This work proposes an experimental study between their approach using Fuzzy Systems and an execution using Artificial Neural Network to verify which is the most appropriate to solve the problem to estimate the importance of reviews.

...read moreread less

Abstract: With the evolution of e-commerce and Online Social Networks, the web information has constantly increased, so the relevance to create methods for automatic knowledge extraction and data mining earned notoriety. Information as opinion evaluation is a point studied by Sentiment Analysis area, which is becoming important nowadays. Be aware of the best reviews is a factor that must be taken into account. Sousa et al. proposed an approach to estimate the degree of importance of reviews about product and services using Fuzzy System, reporting good results. This work proposes an experimental study between their approach using Fuzzy Systems and an execution using Artificial Neural Network to verify which is the most appropriate to solve the problem to estimate the importance of reviews.

...read moreread less

12 citations

Cites methods from "An artificial neural network based ..."

...Sharma and Dey [10] proposed a sentiment classification model using backpropagation artificial neural network, using information gain and three popular sentiment lexicons to extract sentiment representing features that are then used to train and test the network....
[...]

1
2
3
4
…
5
6
7

Collapse

References

PDF

Open Access

More filters

Proceedings Article•DOI•

Mining and summarizing customer reviews

[...]

Minqing Hu¹, Bing Liu¹•Institutions (1)

University of Illinois at Chicago¹

22 Aug 2004

TL;DR: This research aims to mine and to summarize all the customer reviews of a product, and proposes several novel techniques to perform these tasks.

...read moreread less

Abstract: Merchants selling products on the Web often ask their customers to review the products that they have purchased and the associated services. As e-commerce is becoming more and more popular, the number of customer reviews that a product receives grows rapidly. For a popular product, the number of reviews can be in hundreds or even thousands. This makes it difficult for a potential customer to read them to make an informed decision on whether to purchase the product. It also makes it difficult for the manufacturer of the product to keep track and to manage customer opinions. For the manufacturer, there are additional difficulties because many merchant sites may sell the same product and the manufacturer normally produces many kinds of products. In this research, we aim to mine and to summarize all the customer reviews of a product. This summarization task is different from traditional text summarization because we only mine the features of the product on which the customers have expressed their opinions and whether the opinions are positive or negative. We do not summarize the reviews by selecting a subset or rewrite some of the original sentences from the reviews to capture the main points as in the classic text summarization. Our task is performed in three steps: (1) mining product features that have been commented on by customers; (2) identifying opinion sentences in each review and deciding whether each opinion sentence is positive or negative; (3) summarizing the results. This paper proposes several novel techniques to perform these tasks. Our experimental results using reviews of a number of products sold online demonstrate the effectiveness of the techniques.

...read moreread less

7,330 citations

"An artificial neural network based ..." refers background in this paper

...Positive and negative sentiment based summaries for product features from reviews were proposed by Hu and Liu (2004)....
[...]

Thumbs up? Sentiment Classiflcation using Machine Learning Techniques

[...]

Bo Pang, Lillian Lee, Shivakumar Vaithyanathan

01 Jan 2002

TL;DR: In this paper, the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative, was considered and three machine learning methods (Naive Bayes, maximum entropy classiflcation, and support vector machines) were employed.

...read moreread less

Abstract: We consider the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative. Using movie reviews as data, we flnd that standard machine learning techniques deflnitively outperform human-produced baselines. However, the three machine learning methods we employed (Naive Bayes, maximum entropy classiflcation, and support vector machines) do not perform as well on sentiment classiflcation as on traditional topic-based categorization. We conclude by examining factors that make the sentiment classiflcation problem more challenging.

...read moreread less

6,980 citations

Proceedings Article•DOI•

Thumbs up? Sentiment Classification using Machine Learning Techniques

[...]

Bo Pang¹, Lillian Lee¹, Shivakumar Vaithyanathan²•Institutions (2)

Cornell University¹, IBM²

06 Jul 2002

TL;DR: This work considers the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative, and concludes by examining factors that make the sentiment classification problem more challenging.

...read moreread less

Abstract: We consider the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative. Using movie reviews as data, we find that standard machine learning techniques definitively outperform human-produced baselines. However, the three machine learning methods we employed (Naive Bayes, maximum entropy classification, and support vector machines) do not perform as well on sentiment classification as on traditional topic-based categorization. We conclude by examining factors that make the sentiment classification problem more challenging.

...read moreread less

6,626 citations

Posted Content•

Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews

[...]

Peter D. Turney¹•Institutions (1)

National Research Council¹

11 Dec 2002-arXiv: Learning

TL;DR: A simple unsupervised learning algorithm for classifying reviews as recommended (thumbs up) or not recommended (Thumbs down) if the average semantic orientation of its phrases is positive.

...read moreread less

Abstract: This paper presents a simple unsupervised learning algorithm for classifying reviews as recommended (thumbs up) or not recommended (thumbs down). The classification of a review is predicted by the average semantic orientation of the phrases in the review that contain adjectives or adverbs. A phrase has a positive semantic orientation when it has good associations (e.g., "subtle nuances") and a negative semantic orientation when it has bad associations (e.g., "very cavalier"). In this paper, the semantic orientation of a phrase is calculated as the mutual information between the given phrase and the word "excellent" minus the mutual information between the given phrase and the word "poor". A review is classified as recommended if the average semantic orientation of its phrases is positive. The algorithm achieves an average accuracy of 74% when evaluated on 410 reviews from Epinions, sampled from four different domains (reviews of automobiles, banks, movies, and travel destinations). The accuracy ranges from 84% for automobile reviews to 66% for movie reviews.

...read moreread less

4,526 citations

Journal Article•DOI•

The Effect of Word of Mouth on Sales: Online Book Reviews

[...]

Judith A. Chevalier¹, Dina Mayzlin¹•Institutions (1)

Yale University¹

01 Aug 2006-Journal of Marketing Research

TL;DR: The authors examine the effect of consumer reviews on relative sales of books at Amazon.com and Barnesandnoble.com, and find that reviews are overwhelmingly positive at both sites, but there are more reviews and longer reviews at Amazon and that an improvement in a book's reviews leads to an increase in relative sales.

...read moreread less

Abstract: The authors examine the effect of consumer reviews on relative sales of books at Amazon.com and Barnesandnoble.com. The authors find that (1) reviews are overwhelmingly positive at both sites, but there are more reviews and longer reviews at Amazon.com; (2) an improvement in a book's reviews leads to an increase in relative sales at that site; (3) for most samples in the study, the impact of one-star reviews is greater than the impact of five-star reviews; and (4) evidence from review-length data suggests that customers read review text rather than relying only on summary statistics.

...read moreread less

4,180 citations