Home
/
Authors
/
Jin Wang

Author

Jin Wang

Bio: Jin Wang is an academic researcher from Yuan Ze University. The author has contributed to research in topics: Computer science & Sentiment analysis. The author has an hindex of 12, co-authored 24 publications receiving 893 citations. Previous affiliations of Jin Wang include Yunnan University.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Dimensional Sentiment Analysis Using a Regional CNN-LSTM Model

[...]

Jin Wang¹, Liang-Chih Yu¹, K. Robert Lai¹, Xuejie Zhang²•Institutions (2)

Yuan Ze University¹, Yunnan University²

07 Aug 2016

TL;DR: A regional CNN-LSTM model consisting of two parts: regional CNN and LSTM to predict the VA ratings of texts is proposed, showing that the proposed method outperforms lexicon-based, regression- based, and NN-based methods proposed in previous studies.

...read moreread less

Abstract: Dimensional sentiment analysis aims to recognize continuous numerical values in multiple dimensions such as the valencearousal (VA) space. Compared to the categorical approach that focuses on sentiment classification such as binary classification (i.e., positive and negative), the dimensional approach can provide more fine-grained sentiment analysis. This study proposes a regional CNN-LSTM model consisting of two parts: regional CNN and LSTM to predict the VA ratings of texts. Unlike a conventional CNN which considers a whole text as input, the proposed regional CNN uses an individual sentence as a region, dividing an input text into several regions such that the useful affective information in each region can be extracted and weighted according to their contribution to the VA prediction. Such regional information is sequentially integrated across regions using LSTM for VA prediction. By combining the regional CNN and LSTM, both local (regional) information within sentences and long-distance dependency across sentences can be considered in the prediction process. Experimental results show that the proposed method outperforms lexicon-based, regression-based, and NN-based methods proposed in previous studies.

...read moreread less

428 citations

Proceedings Article•DOI•

Refining Word Embeddings for Sentiment Analysis

[...]

Liang-Chih Yu¹, Jin Wang¹, K. Robert Lai¹, Xuejie Zhang²•Institutions (2)

Yuan Ze University¹, Yunnan University²

01 Sep 2017

TL;DR: The proposed word vector refinement model is based on adjusting the vector representations of words such that they can be closer to both semantically and sentimentally similar words and further away from sentimentally dissimilar words.

...read moreread less

Abstract: Word embeddings that can capture semantic and syntactic information from contexts have been extensively used for various natural language processing tasks. However, existing methods for learning context-based word embeddings typically fail to capture sufficient sentiment information. This may result in words with similar vector representations having an opposite sentiment polarity (e.g., good and bad), thus degrading sentiment analysis performance. Therefore, this study proposes a word vector refinement model that can be applied to any pre-trained word vectors (e.g., Word2vec and GloVe). The refinement model is based on adjusting the vector representations of words such that they can be closer to both semantically and sentimentally similar words and further away from sentimentally dissimilar words. Experimental results show that the proposed method can improve conventional word embeddings and outperform previously proposed sentiment embeddings for both binary and fine-grained classification on Stanford Sentiment Treebank (SST).

...read moreread less

183 citations

Journal Article•DOI•

Refining Word Embeddings Using Intensity Scores for Sentiment Analysis

[...]

Liang-Chih Yu¹, Jin Wang¹, K. Robert Lai¹, Xuejie Zhang²•Institutions (2)

Yuan Ze University¹, Yunnan University²

01 Mar 2018-IEEE Transactions on Audio, Speech, and Language Processing

TL;DR: A word vector refinement model is proposed to refine existing pretrained word vectors using real-valued sentiment intensity scores provided by sentiment lexicons to improve each word vector such that it can be closer in the lexicon to both semantically and sentimentally similar words.

...read moreread less

Abstract: Word embeddings that provide continuous low-dimensional vector representations of words have been extensively used for various natural language processing tasks. However, existing context-based word embeddings such as Word2vec and GloVe typically fail to capture sufficient sentiment information, which may result in words with similar vector representations having an opposite sentiment polarity (e.g., good and bad ), thus degrading sentiment analysis performance. To tackle this problem, recent studies have suggested learning sentiment embeddings to incorporate the sentiment polarity (positive and negative) information from labeled corpora. This study adopts another strategy to learn sentiment embeddings. Instead of creating a new word embedding from labeled corpora, we propose a word vector refinement model to refine existing pretrained word vectors using real-valued sentiment intensity scores provided by sentiment lexicons. The idea of the refinement model is to improve each word vector such that it can be closer in the lexicon to both semantically and sentimentally similar words (i.e., those with similar intensity scores) and further away from sentimentally dissimilar words (i.e., those with dissimilar intensity scores). An obvious advantage of the proposed method is that it can be applied to any pretrained word embeddings. In addition, the intensity scores can provide more fine-grained (real-valued) sentiment information than binary polarity labels to guide the refinement process. Experimental results show that the proposed refinement model can improve both conventional word embeddings and previously proposed sentiment embeddings for binary, ternary, and fine-grained sentiment classification on the SemEval and Stanford Sentiment Treebank datasets.

...read moreread less

135 citations

Proceedings Article•DOI•

Building Chinese Affective Resources in Valence-Arousal Dimensions

[...]

Liang-Chih Yu¹, Lung-Hao Lee, Shuai Hao², Jin Wang³, Jin Wang¹, Yunchao He³, Yunchao He¹, Jun Hu², K. Robert Lai¹, Xuejie Zhang³ - Show less +6 more•Institutions (3)

Yuan Ze University¹, Nanchang University², Yunnan University³

01 Jun 2016

TL;DR: Experiments using CVAW words to predict the VA ratings of the CVAT corpus show results comparable to those obtained using English affective resources, and a corpus cleanup procedure is used to remove outlier ratings and improper texts.

...read moreread less

Abstract: An increasing amount of research has recently focused on representing affective states as continuous numerical values on multiple dimensions, such as the valence-arousal (VA) space. Compared to the categorical approach that represents affective states as several classes (e.g., positive and negative), the dimensional approach can provide more finegrained sentiment analysis. However, affective resources with valence-arousal ratings are still very rare, especially for the Chinese language. Therefore, this study builds 1) an affective lexicon called Chinese valence-arousal words (CVAW) containing 1,653 words, and 2) an affective corpus called Chinese valencearousal text (CVAT) containing 2,009 sentences extracted from web texts. To improve the annotation quality, a corpus cleanup procedure is used to remove outlier ratings and improper texts. Experiments using CVAW words to predict the VA ratings of the CVAT corpus show results comparable to those obtained using English affective resources.

...read moreread less

134 citations

Journal Article•DOI•

Tree-Structured Regional CNN-LSTM Model for Dimensional Sentiment Analysis

[...]

Jin Wang¹, Liang-Chih Yu², K. Robert Lai², Xuejie Zhang¹•Institutions (2)

Yunnan University¹, Yuan Ze University²

01 Jan 2020-IEEE Transactions on Audio, Speech, and Language Processing

TL;DR: A tree-structured regional CNN-LSTM model consisting of two parts: regional CNN and LSTM to predict the VA ratings of texts is proposed, showing that the proposed method outperforms lexicon-, regression-, conventional NN and other structured NN methods proposed in previous studies.

...read moreread less

Abstract: Dimensional sentiment analysis aims to recognize continuous numerical values in multiple dimensions such as the valence-arousal (VA) space. Compared to the categorical approach that focuses on sentiment classification such as binary classification (i.e., positive and negative), the dimensional approach can provide a more fine-grained sentiment analysis. This article proposes a tree-structured regional CNN-LSTM model consisting of two parts: regional CNN and LSTM to predict the VA ratings of texts. Unlike a conventional CNN which considers a whole text as input, the proposed regional CNN uses a part of the text as a region, dividing an input text into several regions such that the useful affective information in each region can be extracted and weighted according to their contribution to the VA prediction. Such regional information is sequentially integrated across regions using LSTM for VA prediction. By combining the regional CNN and LSTM, both local (regional) information within sentences and long-distance dependencies across sentences can be considered in the prediction process. To further improve performance, a region division strategy is proposed to discover task-relevant phrases and clauses to incorporate structured information into VA prediction. Experimental results on different corpora show that the proposed method outperforms lexicon-, regression-, conventional NN and other structured NN methods proposed in previous studies.

...read moreread less

66 citations

1
2
3
4
…
5
6
7
8

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Recent Trends in Deep Learning Based Natural Language Processing [Review Article]

[...]

Tom Young¹, Devamanyu Hazarika², Soujanya Poria³, Erik Cambria³•Institutions (3)

Beijing Institute of Technology¹, National University of Singapore², Nanyang Technological University³

20 Jul 2018-IEEE Computational Intelligence Magazine

TL;DR: This paper reviews significant deep learning related models and methods that have been employed for numerous NLP tasks and provides a walk-through of their evolution.

...read moreread less

Abstract: Deep learning methods employ multiple processing layers to learn hierarchical representations of data, and have produced state-of-the-art results in many domains. Recently, a variety of model designs and methods have blossomed in the context of natural language processing (NLP). In this paper, we review significant deep learning related models and methods that have been employed for numerous NLP tasks and provide a walk-through of their evolution. We also summarize, compare and contrast the various models and put forward a detailed understanding of the past, present and future of deep learning in NLP.

...read moreread less

2,466 citations

Proceedings Article•DOI•

SemEval-2017 Task 4: Sentiment Analysis in Twitter

[...]

Sara Rosenthal¹, Noura Farra¹, Preslav Nakov²•Institutions (2)

Columbia University¹, Qatar Computing Research Institute²

01 Aug 2017

TL;DR: Crowdourcing on Amazon Mechanical Turk was used to label a large Twitter training dataset along with additional test sets of Twitter and SMS messages for both subtasks, which included two subtasks: A, an expression-level subtask, and B, a message level subtask.

...read moreread less

Abstract: This paper describes the fifth year of the Sentiment Analysis in Twitter task. SemEval-2017 Task 4 continues with a rerun of the subtasks of SemEval-2016 Task 4, which include identifying the overall sentiment of the tweet, sentiment towards a topic with classification on a two-point and on a five-point ordinal scale, and quantification of the distribution of sentiment towards a topic across a number of tweets: again on a two-point and on a five-point ordinal scale. Compared to 2016, we made two changes: (i) we introduced a new language, Arabic, for all subtasks, and (ii) we made available information from the profiles of the Twitter users who posted the target tweets. The task continues to be very popular, with a total of 48 teams participating this year.

...read moreread less

1,107 citations

Journal Article•DOI•

Deep learning for sentiment analysis: A survey

[...]

Lei Zhang¹, Shuai Wang², Bing Liu²•Institutions (2)

LinkedIn¹, University of Illinois at Urbana–Champaign²

01 Jul 2018-Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery

TL;DR: Deep learning has emerged as a powerful machine learning technique that learns multiple layers of representations or features of the data and produces state-of-the-art prediction results as mentioned in this paper, which is also popularly used in sentiment analysis in recent years.

...read moreread less

Abstract: Deep learning has emerged as a powerful machine learning technique that learns multiple layers of representations or features of the data and produces state-of-the-art prediction results. Along with the success of deep learning in many other application domains, deep learning is also popularly used in sentiment analysis in recent years. This paper first gives an overview of deep learning and then provides a comprehensive survey of its current applications in sentiment analysis.

...read moreread less

917 citations

Proceedings Article•DOI•

SemEval-2016 Task 4: Sentiment Analysis in Twitter

[...]

Preslav Nakov¹, Alan Ritter², Sara Rosenthal³, Fabrizio Sebastiani⁴, Veselin Stoyanov⁵ - Show less +1 more•Institutions (5)

Qatar Foundation¹, Ohio State University², Columbia University³, Qatar Computing Research Institute⁴, Facebook⁵

01 Jun 2016

TL;DR: The SemEval-2016 Task 4 comprises five subtasks, three of which represent a significant departure from previous editions. as mentioned in this paper discusses the fourth year of the Sentiment Analysis in Twitter Task and discusses the three new subtasks focus on two variants of the basic sentiment classification in Twitter task.

...read moreread less

Abstract: This paper discusses the fourth year of the ”Sentiment Analysis in Twitter Task”. SemEval-2016 Task 4 comprises five subtasks, three of which represent a significant departure from previous editions. The first two subtasks are reruns from prior years and ask to predict the overall sentiment, and the sentiment towards a topic in a tweet. The three new subtasks focus on two variants of the basic “sentiment classification in Twitter” task. The first variant adopts a five-point scale, which confers an ordinal character to the classification task. The second variant focuses on the correct estimation of the prevalence of each class of interest, a task which has been called quantification in the supervised learning literature. The task continues to be very popular, attracting a total of 43 teams.

...read moreread less

702 citations

Journal Article•DOI•

Predicting residential energy consumption using CNN-LSTM neural networks

[...]

Tae Young Kim¹, Sung-Bae Cho¹•Institutions (1)

Yonsei University¹

01 Sep 2019-Energy

TL;DR: This paper proposes a CNN-LSTM neural network that can extract spatial and temporal features to effectively predict the housing energy consumption and achieves almost perfect prediction performance for electric energy consumption that was previously difficult to predict.

...read moreread less

677 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse