scispace - formally typeset
Search or ask a question
Proceedings ArticleDOI

An artificial neural network based approach for sentiment analysis of opinionated text

23 Oct 2012-pp 37-42
TL;DR: A sentiment classification model using back-propagation artificial neural network (BPANN) is proposed that combines the strength of BPANN in classification accuracy with utilizing intrinsic domain knowledge available in the sentiment lexicons.
Abstract: The Internet and Web 2.0 social media have emerged as an important medium for expressing sentiments, opinions, evaluations, and reviews. Sentiment analysis or opinion mining is becoming an open research domain due to the abundance of discussion forums, Weblogs, e-commerce portals, social networking and content sharing sites where people tend to express their opinions. Sentiment Analysis involves classifying text documents based on the opinion expressed being positive or negative about a given topic. This paper proposes a sentiment classification model using back-propagation artificial neural network (BPANN). Information Gain and three popular sentiment lexicons are used to extract sentiment representing features that are then used to train and test the BPANN. This novel approach combines the strength of BPANN in classification accuracy with utilizing intrinsic domain knowledge available in the sentiment lexicons. The results obtained on the movie-review corpora have shown that the proposed approach has been able to reduce dimensionality, while producing accurate sentiment based classification of text.
Citations
More filters
Journal ArticleDOI
TL;DR: An encoding-based Wave2vec time series classifier model, which combines signal-processing and deep learning-based natural language processing techniques, is provided, which facilitates intuitive and easy recognition, and identification of influential patterns.
Abstract: Interest in research involving health-medical information analysis based on artificial intelligence, especially for deep learning techniques, has recently been increasing. Most of the research in this field has been focused on searching for new knowledge for predicting and diagnosing disease by revealing the relation between disease and various information features of data. These features are extracted by analyzing various clinical pathology data, such as EHR (electronic health records), and academic literature using the techniques of data analysis, natural language processing, etc. However, still needed are more research and interest in applying the latest advanced artificial intelligence-based data analysis technique to bio-signal data, which are continuous physiological records, such as EEG (electroencephalography) and ECG (electrocardiogram). Unlike the other types of data, applying deep learning to bio-signal data, which is in the form of time series of real numbers, has many issues that need to be resolved in preprocessing, learning, and analysis. Such issues include leaving feature selection, learning parts that are black boxes, difficulties in recognizing and identifying effective features, high computational complexities, etc. In this paper, to solve these issues, we provide an encoding-based Wave2vec time series classifier model, which combines signal-processing and deep learning-based natural language processing techniques. To demonstrate its advantages, we provide the results of three experiments conducted with EEG data of the University of California Irvine, which are a real-world benchmark bio-signal dataset. After converting the bio-signals (in the form of waves), which are a real number time series, into a sequence of symbols or a sequence of wavelet patterns that are converted into symbols, through encoding, the proposed model vectorizes the symbols by learning the sequence using deep learning-based natural language processing. The models of each class can be constructed through learning from the vectorized wavelet patterns and training data. The implemented models can be used for prediction and diagnosis of diseases by classifying the new data. The proposed method enhanced data readability and intuition of feature selection and learning processes by converting the time series of real number data into sequences of symbols. In addition, it facilitates intuitive and easy recognition, and identification of influential patterns. Furthermore, real-time large-capacity data analysis is facilitated, which is essential in the development of real-time analysis diagnosis systems, by drastically reducing the complexity of calculation without deterioration of analysis performance by data simplification through the encoding process.

22 citations


Cites methods from "An artificial neural network based ..."

  • ...Document classification, another instance of the sequence classification problem, is performed by using the order of words as a major feature [25], a sentimental analysis is performed by using the sentiment lexicons of words [26], and the pattern-based sequence classification is studied in more general areas [27]....

    [...]

01 Jan 2016
TL;DR: The system performs well for recurrent neural network when compared with the system submitted to the shared task as the accuracy of the system had increased and the network seeks to pursue sentiment oriented feature which improves in analyzing the sentiments on tweets.
Abstract: This paper aims at improving the system which is submitted to the shared task on Sentiment Analysis in Indian Languages (SAIL2015) at MIKE 2015. In this work the tweets are classified into three polarity category namely positive, negative and neutral. Twitter data of three languages namely Tamil, Hindi and Bengali are already provided by SAIL 2015 task organizers as we have participated in the contest. Recurrent neural network is used for analyzing the sentiment in the tweets. The system performs well for recurrent neural network when compared with the system submitted to the shared task as the accuracy of the system had increased. This is due to the fact that the recurrent neural network concentrates more on language specific feature. In training, the recurrent neural network tries to learn based on the error that are generated as intermediate output. By this way the network seeks to pursue sentiment oriented feature which improves in analyzing the sentiments on tweets. We have obtained a state accuracy for the proposed system, where we achieved an accuracy of 88%, 72.01% and 65.16% for Tamil, Hindi and Bengali languages respectively for SAIL 2015 dataset.

19 citations


Cites methods from "An artificial neural network based ..."

  • ...In this work along with BPANN, it uses domain knowledge which are available in sentiment lexicon [8]....

    [...]

Journal ArticleDOI
10 Mar 2020-PLOS ONE
TL;DR: The results suggest that institutional and departmental barriers driven by power asymmetries play a large role in the underreporting of sexual harassment among students—especially those in STEM disciplines.
Abstract: What factors predict the underreporting of sexual harassment in academe? We used logistic regression and sentiment analysis to examine 2,343 reports of sexual harassment involving members of university communities. Results indicate students were 1.6 times likely to not report their experiences when compared to faculty. Respondents in the life and physical sciences were 1.7 times more likely to not report their experiences when compared to respondents in other disciplines. Men represented 90% of the reported perpetrators of sexual harassment. Analysis of respondents' written accounts show variation of overall sentiment based on discipline, student type, and the type of institution attended, particularly with regard to mental health. Our results suggest that institutional and departmental barriers driven by power asymmetries play a large role in the underreporting sexual harassment among students-especially those in STEM disciplines.

19 citations

Proceedings ArticleDOI
01 Jan 2018
TL;DR: A new topic model based approach for opinion mining and sentiment analysis of text reviews posted in web forums or social media site which are mostly in unstructured in nature is discussed.
Abstract: This paper discusses a new topic model based approach for opinion mining and sentiment analysis of text reviews posted in web forums or social media site which are mostly in unstructured in nature. In recent years, opinions are exchanged in clouds about any product, person, event or any interested topic. These opinions help in decision making for choosing a product or getting feedback about any topic. Opinion mining and sentiment analysis are related in a sense that opining mining deals with analyzing and summarizing expressed opinions whereas sentiment analysis classifies opinionated text into positive and negative. Aspect extraction is a crucial problem in sentiment analysis. Model proposed in the paper utilizes topic model for aspect extraction and support vector machine learning technique for sentiment classification of textual reviews. The goal is to automate the process of mining attitudes, opinions and hidden emotions from text.

12 citations


Cites methods from "An artificial neural network based ..."

  • ...On reviewing literature, it is found that various supervised machine learning algorithms like Naïve Bayes [12] [13] Support Vector Machines[14] [15] and Neural Networks [16] have been used for opinion mining of text to classify positive and negative sentiment....

    [...]

Proceedings ArticleDOI
01 Jul 2016
TL;DR: This work proposes an experimental study between their approach using Fuzzy Systems and an execution using Artificial Neural Network to verify which is the most appropriate to solve the problem to estimate the importance of reviews.
Abstract: With the evolution of e-commerce and Online Social Networks, the web information has constantly increased, so the relevance to create methods for automatic knowledge extraction and data mining earned notoriety. Information as opinion evaluation is a point studied by Sentiment Analysis area, which is becoming important nowadays. Be aware of the best reviews is a factor that must be taken into account. Sousa et al. proposed an approach to estimate the degree of importance of reviews about product and services using Fuzzy System, reporting good results. This work proposes an experimental study between their approach using Fuzzy Systems and an execution using Artificial Neural Network to verify which is the most appropriate to solve the problem to estimate the importance of reviews.

12 citations


Cites methods from "An artificial neural network based ..."

  • ...Sharma and Dey [10] proposed a sentiment classification model using backpropagation artificial neural network, using information gain and three popular sentiment lexicons to extract sentiment representing features that are then used to train and test the network....

    [...]

References
More filters
Proceedings ArticleDOI
22 Aug 2004
TL;DR: This research aims to mine and to summarize all the customer reviews of a product, and proposes several novel techniques to perform these tasks.
Abstract: Merchants selling products on the Web often ask their customers to review the products that they have purchased and the associated services. As e-commerce is becoming more and more popular, the number of customer reviews that a product receives grows rapidly. For a popular product, the number of reviews can be in hundreds or even thousands. This makes it difficult for a potential customer to read them to make an informed decision on whether to purchase the product. It also makes it difficult for the manufacturer of the product to keep track and to manage customer opinions. For the manufacturer, there are additional difficulties because many merchant sites may sell the same product and the manufacturer normally produces many kinds of products. In this research, we aim to mine and to summarize all the customer reviews of a product. This summarization task is different from traditional text summarization because we only mine the features of the product on which the customers have expressed their opinions and whether the opinions are positive or negative. We do not summarize the reviews by selecting a subset or rewrite some of the original sentences from the reviews to capture the main points as in the classic text summarization. Our task is performed in three steps: (1) mining product features that have been commented on by customers; (2) identifying opinion sentences in each review and deciding whether each opinion sentence is positive or negative; (3) summarizing the results. This paper proposes several novel techniques to perform these tasks. Our experimental results using reviews of a number of products sold online demonstrate the effectiveness of the techniques.

7,330 citations


"An artificial neural network based ..." refers background in this paper

  • ...Positive and negative sentiment based summaries for product features from reviews were proposed by Hu and Liu (2004)....

    [...]

01 Jan 2002
TL;DR: In this paper, the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative, was considered and three machine learning methods (Naive Bayes, maximum entropy classiflcation, and support vector machines) were employed.
Abstract: We consider the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative. Using movie reviews as data, we flnd that standard machine learning techniques deflnitively outperform human-produced baselines. However, the three machine learning methods we employed (Naive Bayes, maximum entropy classiflcation, and support vector machines) do not perform as well on sentiment classiflcation as on traditional topic-based categorization. We conclude by examining factors that make the sentiment classiflcation problem more challenging.

6,980 citations

Proceedings ArticleDOI
06 Jul 2002
TL;DR: This work considers the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative, and concludes by examining factors that make the sentiment classification problem more challenging.
Abstract: We consider the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative. Using movie reviews as data, we find that standard machine learning techniques definitively outperform human-produced baselines. However, the three machine learning methods we employed (Naive Bayes, maximum entropy classification, and support vector machines) do not perform as well on sentiment classification as on traditional topic-based categorization. We conclude by examining factors that make the sentiment classification problem more challenging.

6,626 citations

Posted Content
TL;DR: A simple unsupervised learning algorithm for classifying reviews as recommended (thumbs up) or not recommended (Thumbs down) if the average semantic orientation of its phrases is positive.
Abstract: This paper presents a simple unsupervised learning algorithm for classifying reviews as recommended (thumbs up) or not recommended (thumbs down). The classification of a review is predicted by the average semantic orientation of the phrases in the review that contain adjectives or adverbs. A phrase has a positive semantic orientation when it has good associations (e.g., "subtle nuances") and a negative semantic orientation when it has bad associations (e.g., "very cavalier"). In this paper, the semantic orientation of a phrase is calculated as the mutual information between the given phrase and the word "excellent" minus the mutual information between the given phrase and the word "poor". A review is classified as recommended if the average semantic orientation of its phrases is positive. The algorithm achieves an average accuracy of 74% when evaluated on 410 reviews from Epinions, sampled from four different domains (reviews of automobiles, banks, movies, and travel destinations). The accuracy ranges from 84% for automobile reviews to 66% for movie reviews.

4,526 citations

Journal ArticleDOI
TL;DR: The authors examine the effect of consumer reviews on relative sales of books at Amazon.com and Barnesandnoble.com, and find that reviews are overwhelmingly positive at both sites, but there are more reviews and longer reviews at Amazon and that an improvement in a book's reviews leads to an increase in relative sales.
Abstract: The authors examine the effect of consumer reviews on relative sales of books at Amazon.com and Barnesandnoble.com. The authors find that (1) reviews are overwhelmingly positive at both sites, but there are more reviews and longer reviews at Amazon.com; (2) an improvement in a book's reviews leads to an increase in relative sales at that site; (3) for most samples in the study, the impact of one-star reviews is greater than the impact of five-star reviews; and (4) evidence from review-length data suggests that customers read review text rather than relying only on summary statistics.

4,180 citations