Home
/
Authors
/
Jiangming Liu

Author

Jiangming Liu

Other affiliations: Beijing Jiaotong University, Singapore University of Technology and Design

Bio: Jiangming Liu is an academic researcher from University of Edinburgh. The author has contributed to research in topics: Parsing & Tree structure. The author has an hindex of 13, co-authored 27 publications receiving 699 citations. Previous affiliations of Jiangming Liu include Beijing Jiaotong University & Singapore University of Technology and Design.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Evaluating Models’ Local Decision Boundaries via Contrast Sets

[...]

Matt Gardner¹, Yoav Artzi², Victoria Basmov, Jonathan Berant³, Ben Bogin⁴, Sihao Chen⁵, Pradeep Dasigi⁶, Dheeru Dua⁶, Yanai Elazar⁷, Ananth Gottumukkala, Nitish Gupta³, Hannaneh Hajishirzi⁶, Gabriel Ilharco¹, Daniel Khashabi⁶, Kevin Lin¹, Jiangming Liu⁸, Nelson F. Liu⁹, Phoebe Mulcaire¹, Qiang Ning⁶, Sameer Singh¹⁰, Noah A. Smith⁶, Sanjay Subramanian³, Reut Tsarfaty⁷, Eric Wallace¹¹, Ally Zhang, Ben Zhou⁶ - Show less +22 more•Institutions (11)

University of Washington¹, Cornell University², Tel Aviv University³, IBM⁴, University of Pennsylvania⁵, Allen Institute for Artificial Intelligence⁶, Bar-Ilan University⁷, University of Edinburgh⁸, Stanford University⁹, University of California, Irvine¹⁰, University of California, Berkeley¹¹

01 Nov 2020

TL;DR: A more rigorous annotation paradigm for NLP that helps to close systematic gaps in the test data, and recommends that the dataset authors manually perturb the test instances in small but meaningful ways that (typically) change the gold label, creating contrast sets.

...read moreread less

Abstract: Standard test sets for supervised learning evaluate in-distribution generalization. Unfortunately, when a dataset has systematic gaps (e.g., annotation artifacts), these evaluations are misleading: a model can learn simple decision rules that perform well on the test set but do not capture the abilities a dataset is intended to test. We propose a more rigorous annotation paradigm for NLP that helps to close systematic gaps in the test data. In particular, after a dataset is constructed, we recommend that the dataset authors manually perturb the test instances in small but meaningful ways that (typically) change the gold label, creating contrast sets. Contrast sets provide a local view of a model’s decision boundary, which can be used to more accurately evaluate a model’s true linguistic capabilities. We demonstrate the efficacy of contrast sets by creating them for 10 diverse NLP datasets (e.g., DROP reading comprehension, UD parsing, and IMDb sentiment analysis). Although our contrast sets are not explicitly adversarial, model performance is significantly lower on them than on the original test sets—up to 25% in some cases. We release our contrast sets as new evaluation benchmarks and encourage future dataset construction efforts to follow similar annotation processes.

...read moreread less

250 citations

Proceedings Article•DOI•

Attention Modeling for Targeted Sentiment.

[...]

Jiangming Liu¹, Yue Zhang¹•Institutions (1)

Singapore University of Technology and Design¹

01 Apr 2017

TL;DR: Results show that by using attention to model the contribution of each word in a sentence with respect to the target, this model gives significantly improved results over two standard benchmarks.

...read moreread less

Abstract: Neural network models have been used for target-dependent sentiment analysis. Previous work focus on learning a target specific representation for a given input sentence which is used for classification. However, they do not explicitly model the contribution of each word in a sentence with respect to targeted sentiment polarities. We investigate an attention model to this end. In particular, a vanilla LSTM model is used to induce an attention value of the whole sentence. The model is further extended to differentiate left and right contexts given a certain target following previous work. Results show that by using attention to model the contribution of each word with respect to the target, our model gives significantly improved results over two standard benchmarks. We report the best accuracy for this task.

...read moreread less

194 citations

Posted Content•

Evaluating NLP Models via Contrast Sets

[...]

Matt Gardner, Yoav Artzi, Victoria Basmova, Jonathan Berant, Ben Bogin, Sihao Chen, Pradeep Dasigi, Dheeru Dua, Yanai Elazar, Ananth Gottumukkala, Nitish Gupta, Hanna Hajishirzi, Gabriel Ilharco, Daniel Khashabi, Kevin Lin, Jiangming Liu, Nelson F. Liu, Phoebe Mulcaire, Qiang Ning, Sameer Singh, Noah A. Smith, Sanjay Subramanian, Reut Tsarfaty, Eric Wallace, Ally Zhang, Ben Zhou - Show less +22 more

06 Apr 2020

TL;DR: A new annotation paradigm for NLP is proposed that helps to close systematic gaps in the test data, and it is recommended that after a dataset is constructed, the dataset authors manually perturb the test instances in small but meaningful ways that change the gold label, creating contrast sets.

...read moreread less

Abstract: Standard test sets for supervised learning evaluate in-distribution generalization. Unfortunately, when a dataset has systematic gaps (e.g., annotation artifacts), these evaluations are misleading: a model can learn simple decision rules that perform well on the test set but do not capture a dataset's intended capabilities. We propose a new annotation paradigm for NLP that helps to close systematic gaps in the test data. In particular, after a dataset is constructed, we recommend that the dataset authors manually perturb the test instances in small but meaningful ways that (typically) change the gold label, creating contrast sets. Contrast sets provide a local view of a model's decision boundary, which can be used to more accurately evaluate a model's true linguistic capabilities. We demonstrate the efficacy of contrast sets by creating them for 10 diverse NLP datasets (e.g., DROP reading comprehension, UD parsing, IMDb sentiment analysis). Although our contrast sets are not explicitly adversarial, model performance is significantly lower on them than on the original test sets---up to 25\% in some cases. We release our contrast sets as new evaluation benchmarks and encourage future dataset construction efforts to follow similar annotation processes.

...read moreread less

82 citations

Journal Article•DOI•

In-Order Transition-based Constituent Parsing

[...]

Jiangming Liu¹, Yue Zhang¹•Institutions (1)

Singapore University of Technology and Design¹

11 Nov 2017-Transactions of the Association for Computational Linguistics

TL;DR: A novel parsing system based on in-order traversal over syntactic trees, designing a set of transition actions to find a compromise between bottom-up constituent information and top-down lookahead information is proposed.

...read moreread less

Abstract: Both bottom-up and top-down strategies have been used for neural transition-based constituent parsing. The parsing strategies differ in terms of the order in which they recognize productions in the derivation tree, where bottom-up strategies and top-down strategies take post-order and pre-order traversal over trees, respectively. Bottom-up parsers benefit from rich features from readily built partial parses, but lack lookahead guidance in the parsing process; top-down parsers benefit from non-local guidance for local decisions, but rely on a strong encoder over the input to predict a constituent hierarchy before its construction. To mitigate both issues, we propose a novel parsing system based on in-order traversal over syntactic trees, designing a set of transition actions to find a compromise between bottom-up constituent information and top-down lookahead information. Based on stack-LSTM, our psycholinguistically motivated constituent parsing system achieves 91.8 F1 on WSJ benchmark. Furthermore, the system achieves 93.6 F1 with supervised reranking and 94.2 F1 with semi-supervised reranking, which are the best results on the WSJ benchmark.

...read moreread less

80 citations

Posted Content•

Evaluating Models' Local Decision Boundaries via Contrast Sets.

[...]

Matt Gardner¹, Yoav Artzi, Victoria Basmova, Jonathan Berant, Ben Bogin, Sihao Chen, Pradeep Dasigi, Dheeru Dua, Yanai Elazar, Ananth Gottumukkala, Nitish Gupta, Hanna Hajishirzi, Gabriel Ilharco, Daniel Khashabi, Kevin Lin, Jiangming Liu, Nelson F. Liu, Phoebe Mulcaire, Qiang Ning, Sameer Singh, Noah A. Smith, Sanjay Subramanian, Reut Tsarfaty, Eric Wallace, Ally Zhang, Ben Zhou - Show less +22 more•Institutions (1)

Allen Institute for Artificial Intelligence¹

06 Apr 2020-arXiv: Computation and Language

TL;DR: Contrast sets as mentioned in this paper is a new annotation paradigm for NLP that helps to close systematic gaps in the test data, where the dataset authors manually perturb the test instances in small but meaningful ways that change the gold label, creating contrast sets.

...read moreread less

Abstract: Standard test sets for supervised learning evaluate in-distribution generalization. Unfortunately, when a dataset has systematic gaps (e.g., annotation artifacts), these evaluations are misleading: a model can learn simple decision rules that perform well on the test set but do not capture a dataset's intended capabilities. We propose a new annotation paradigm for NLP that helps to close systematic gaps in the test data. In particular, after a dataset is constructed, we recommend that the dataset authors manually perturb the test instances in small but meaningful ways that (typically) change the gold label, creating contrast sets. Contrast sets provide a local view of a model's decision boundary, which can be used to more accurately evaluate a model's true linguistic capabilities. We demonstrate the efficacy of contrast sets by creating them for 10 diverse NLP datasets (e.g., DROP reading comprehension, UD parsing, IMDb sentiment analysis). Although our contrast sets are not explicitly adversarial, model performance is significantly lower on them than on the original test sets---up to 25\% in some cases. We release our contrast sets as new evaluation benchmarks and encourage future dataset construction efforts to follow similar annotation processes.

...read moreread less

59 citations

1
2
3
4
…
5
6

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Deep learning for sentiment analysis: A survey

[...]

Lei Zhang¹, Shuai Wang², Bing Liu²•Institutions (2)

LinkedIn¹, University of Illinois at Urbana–Champaign²

01 Jul 2018-Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery

TL;DR: Deep learning has emerged as a powerful machine learning technique that learns multiple layers of representations or features of the data and produces state-of-the-art prediction results as mentioned in this paper, which is also popularly used in sentiment analysis in recent years.

...read moreread less

Abstract: Deep learning has emerged as a powerful machine learning technique that learns multiple layers of representations or features of the data and produces state-of-the-art prediction results. Along with the success of deep learning in many other application domains, deep learning is also popularly used in sentiment analysis in recent years. This paper first gives an overview of deep learning and then provides a comprehensive survey of its current applications in sentiment analysis.

...read moreread less

917 citations

From Discourse to Logic

[...]

Tomek Strzalkowski

01 Mar 1991

605 citations

Proceedings Article•DOI•

Constituency Parsing with a Self-Attentive Encoder

[...]

Nikita Kitaev¹, Dan Klein¹•Institutions (1)

University of California, Berkeley¹

02 May 2018

TL;DR: This paper used an LSTM encoder with a self-attentive architecture and achieved state-of-the-art performance on the Penn Treebank with 93.55 F1 without the use of any external data.

...read moreread less

Abstract: We demonstrate that replacing an LSTM encoder with a self-attentive architecture can lead to improvements to a state-of-the-art discriminative constituency parser. The use of attention makes explicit the manner in which information is propagated between different locations in the sentence, which we use to both analyze our model and propose potential improvements. For example, we find that separating positional and content information in the encoder can lead to improved parsing accuracy. Additionally, we evaluate different approaches for lexical representation. Our parser achieves new state-of-the-art results for single models trained on the Penn Treebank: 93.55 F1 without the use of any external data, and 95.13 F1 when using pre-trained word representations. Our parser also outperforms the previous best-published accuracy figures on 8 of the 9 languages in the SPMRL dataset.

...read moreread less

380 citations

Proceedings Article•DOI•

Multi-grained Attention Network for Aspect-Level Sentiment Classification

[...]

Feifan Fan¹, Yansong Feng¹, Dongyan Zhao²•Institutions (2)

Peking University¹, Michigan State University²

01 Jan 2018

TL;DR: Experimental results show that the multi-grained attention network consistently outperforms the state-of-the-art methods on all three datasets, and the effectiveness of aspect alignment loss indicates the aspect-level interactions can bring extra useful information and further improve the performance.

...read moreread less

Abstract: We propose a novel multi-grained attention network (MGAN) model for aspect level sentiment classification. Existing approaches mostly adopt coarse-grained attention mechanism, which may bring information loss if the aspect has multiple words or larger context. We propose a fine-grained attention mechanism, which can capture the word-level interaction between aspect and context. And then we leverage the fine-grained and coarse-grained attention mechanisms to compose the MGAN framework. Moreover, unlike previous works which train each aspect with its context separately, we design an aspect alignment loss to depict the aspect-level interactions among the aspects that have the same context. We evaluate the proposed approach on three datasets: laptop and restaurant are from SemEval 2014, and the last one is a twitter dataset. Experimental results show that the multi-grained attention network consistently outperforms the state-of-the-art methods on all three datasets. We also conduct experiments to evaluate the effectiveness of aspect alignment loss, which indicates the aspect-level interactions can bring extra useful information and further improve the performance.

...read moreread less

314 citations

Proceedings Article•DOI•

Transformation Networks for Target-Oriented Sentiment Classification

[...]

Xin Li¹, Lidong Bing², Wai Lam³, Bei Shi³•Institutions (3)

Tsinghua University¹, Tencent², The Chinese University of Hong Kong³

03 May 2018

TL;DR: The authors proposed a new model that employs a CNN layer to extract salient features from the transformed word representations originated from a bi-directional RNN layer, which achieved state-of-the-art performance.

...read moreread less

Abstract: Target-oriented sentiment classification aims at classifying sentiment polarities over individual opinion targets in a sentence. RNN with attention seems a good fit for the characteristics of this task, and indeed it achieves the state-of-the-art performance. After re-examining the drawbacks of attention mechanism and the obstacles that block CNN to perform well in this classification task, we propose a new model that achieves new state-of-the-art results on a few benchmarks. Instead of attention, our model employs a CNN layer to extract salient features from the transformed word representations originated from a bi-directional RNN layer. Between the two layers, we propose a component which first generates target-specific representations of words in the sentence, and then incorporates a mechanism for preserving the original contextual information from the RNN layer.

...read moreread less

282 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168

Collapse