Home
/
Authors
/
An Thanh Nguyen

Author

An Thanh Nguyen

Bio: An Thanh Nguyen is an academic researcher from University of Texas at Austin. The author has contributed to research in topics: Deep learning & Artificial neural network. The author has an hindex of 7, co-authored 10 publications receiving 369 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Neural information retrieval: at the end of the early years

[...]

Kezban Dilek Onal¹, Kezban Dilek Onal², Ye Zhang³, Ismail Sengor Altingovde², Md. Mustafizur Rahman³, Pinar Karagoz², Alexander Braylan³, Brandon Dang³, Heng-Lu Chang³, Henna Kim³, Quinten McNamara³, Aaron Angert⁴, Edward Banner⁵, Vivek Khetan³, Tyler McDonnell³, An Thanh Nguyen³, Dan Xu³, Byron C. Wallace⁵, Maarten de Rijke¹, Matthew Lease³ - Show less +16 more•Institutions (5)

University of Amsterdam¹, Middle East Technical University², University of Texas at Austin³, IBM⁴, Northeastern University⁵

01 Jun 2018-Information Retrieval

TL;DR: The successes of neural IR thus far are highlighted, obstacles to its wider adoption are cataloged, and potentially promising directions for future research are suggested.

...read moreread less

Abstract: A recent “third wave” of neural network (NN) approaches now delivers state-of-the-art performance in many machine learning tasks, spanning speech recognition, computer vision, and natural language processing. Because these modern NNs often comprise multiple interconnected layers, work in this area is often referred to as deep learning. Recent years have witnessed an explosive growth of research into NN-based approaches to information retrieval (IR). A significant body of work has now been created. In this paper, we survey the current landscape of Neural IR research, paying special attention to the use of learned distributed representations of textual units. We highlight the successes of neural IR thus far, catalog obstacles to its wider adoption, and suggest potentially promising directions for future research.

...read moreread less

124 citations

Proceedings Article•DOI•

Aggregating and Predicting Sequence Labels from Crowd Annotations.

[...]

An Thanh Nguyen¹, Byron C. Wallace², Junyi Jessy Li³, Ani Nenkova³, Matthew Lease¹ - Show less +1 more•Institutions (3)

University of Texas at Austin¹, Northeastern University², University of Pennsylvania³

01 Jan 2017

TL;DR: A suite of methods for aggregating sequential crowd labels to infer a best single set of consensus annotations and using crowd annotations as training data for a model that can predict sequences in unannotated text are evaluated.

...read moreread less

Abstract: Despite sequences being core to NLP, scant work has considered how to handle noisy sequence labels from multiple annotators for the same text. Given such annotations, we consider two complementary tasks: (1) aggregating sequential crowd labels to infer a best single set of consensus annotations; and (2) using crowd annotations as training data for a model that can predict sequences in unannotated text. For aggregation, we propose a novel Hidden Markov Model variant. To predict sequences in unannotated text, we propose a neural approach using Long Short Term Memory. We evaluate a suite of methods across two different applications and text genres: Named-Entity Recognition in news articles and Information Extraction from biomedical abstracts. Results show improvement over strong baselines. Our source code and data are available online.

...read moreread less

81 citations

Proceedings Article•

Combining Crowd and Expert Labels Using Decision Theoretic Active Learning

[...]

An Thanh Nguyen¹, Byron C. Wallace¹, Matthew Lease¹•Institutions (1)

University of Texas at Austin¹

23 Sep 2015

TL;DR: A hybrid human-machine approach which blends automatic machine learning with human labeling across a tiered workforce composed of domain experts and crowd workers is adopted to effectively achieve high-accuracy labels over the instances in the pool at minimal cost.

...read moreread less

Abstract: We consider a finite-pool data categorization scenario which requires exhaustively classifying a given set of examples with a limited budget. We adopt a hybrid human-machine approach which blends automatic machine learning with human labeling across a tiered workforce composed of domain experts and crowd workers. To effectively achieve high-accuracy labels over the instances in the pool at minimal cost, we develop a novel approach based on decision-theoretic active learning. On the important task of biomedical citation screening for systematic reviews, results on real data show that our method achieves consistent improvements over baseline strategies. To foster further research by others, we have made our data available online.

...read moreread less

74 citations

Proceedings Article•DOI•

Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact-Checking

[...]

An Thanh Nguyen¹, Aditya Kharosekar¹, Saumyaa Krishnan¹, Siddhesh Krishnan¹, Elizabeth Tate¹, Byron C. Wallace², Matthew Lease¹ - Show less +3 more•Institutions (2)

University of Texas at Austin¹, Northeastern University²

11 Oct 2018

TL;DR: In a user study in which participants used the design and evaluation of the system to aid their own assessment of claims, the results suggest that individuals tend to trust the system: participant accuracy assessing claims improved when exposed to correct model predictions.

...read moreread less

Abstract: Fact-checking, the task of assessing the veracity of claims, is an important, timely, and challenging problem. While many automated fact-checking systems have been recently proposed, the human side of the partnership has been largely neglected: how might people understand, interact with, and establish trust with an AI fact-checking system? Does such a system actually help people better assess the factuality of claims? In this paper, we present the design and evaluation of a mixed-initiative approach to fact-checking, blending human knowledge and experience with the efficiency and scalability of automated information retrieval and ML. In a user study in which participants used our system to aid their own assessment of claims, our results suggest that individuals tend to trust the system: participant accuracy assessing claims improved when exposed to correct model predictions. However, this trust perhaps goes too far: when the model was wrong, exposure to its predictions often degraded human accuracy. Participants given the option to interact with these incorrect predictions were often able improve their own performance. This suggests that transparent models are key to facilitating effective human interaction with fallible AI models.

...read moreread less

63 citations

Posted Content•

Neural Information Retrieval: A Literature Review

[...]

Ye Zhang, Md. Mustafizur Rahman, Alexander Braylan, Brandon Dang, Heng-Lu Chang, Henna Kim, Quinten McNamara, Aaron Angert, Edward Banner, Vivek Khetan, Tyler McDonnell, An Thanh Nguyen, Dan Xu, Byron C. Wallace, Matthew Lease - Show less +11 more

18 Nov 2016-arXiv: Information Retrieval

TL;DR: The current landscape of Neural IR research is surveyed, paying special attention to the use of learned representations of queries and documents (i.e., neural embeddings), to highlight the successes and obstacles to its wider adoption, and suggest potentially promising directions for future research.

...read moreread less

Abstract: A recent "third wave" of Neural Network (NN) approaches now delivers state-of-the-art performance in many machine learning tasks, spanning speech recognition, computer vision, and natural language processing. Because these modern NNs often comprise multiple interconnected layers, this new NN research is often referred to as deep learning. Stemming from this tide of NN work, a number of researchers have recently begun to investigate NN approaches to Information Retrieval (IR). While deep NNs have yet to achieve the same level of success in IR as seen in other areas, the recent surge of interest and work in NNs for IR suggest that this state of affairs may be quickly changing. In this work, we survey the current landscape of Neural IR research, paying special attention to the use of learned representations of queries and documents (i.e., neural embeddings). We highlight the successes of neural IR thus far, catalog obstacles to its wider adoption, and suggest potentially promising directions for future research.

...read moreread less

45 citations

Cited by

PDF

Open Access

More filters

Posted Content•

Deep Reinforcement Learning: An Overview

[...]

Yuxi Li

25 Jan 2017-arXiv: Learning

TL;DR: This work discusses core RL elements, including value function, in particular, Deep Q-Network (DQN), policy, reward, model, planning, and exploration, and important mechanisms for RL, including attention and memory, unsupervised learning, transfer learning, multi-agent RL, hierarchical RL, and learning to learn.

...read moreread less

Abstract: We give an overview of recent exciting achievements of deep reinforcement learning (RL). We discuss six core elements, six important mechanisms, and twelve applications. We start with background of machine learning, deep learning and reinforcement learning. Next we discuss core RL elements, including value function, in particular, Deep Q-Network (DQN), policy, reward, model, planning, and exploration. After that, we discuss important mechanisms for RL, including attention and memory, unsupervised learning, transfer learning, multi-agent RL, hierarchical RL, and learning to learn. Then we discuss various applications of RL, including games, in particular, AlphaGo, robotics, natural language processing, including dialogue systems, machine translation, and text generation, computer vision, neural architecture design, business management, finance, healthcare, Industry 4.0, smart grid, intelligent transportation systems, and computer systems. We mention topics not reviewed yet, and list a collection of RL resources. After presenting a brief summary, we close with discussions. Please see Deep Reinforcement Learning, arXiv:1810.06339, for a significant update.

...read moreread less

935 citations

Posted Content•

Learning from Noisy Labels with Deep Neural Networks: A Survey

[...]

Hwanjun Song¹, Minseok Kim², Dongmin Park², Jae-Gil Lee²•Institutions (2)

Naver Corporation¹, KAIST²

16 Jul 2020-arXiv: Learning

TL;DR: A comprehensive review of 62 state-of-the-art robust training methods, all of which are categorized into five groups according to their methodological difference, followed by a systematic comparison of six properties used to evaluate their superiority.

...read moreread less

Abstract: Deep learning has achieved remarkable success in numerous domains with help from large amounts of big data. However, the quality of data labels is a concern because of the lack of high-quality labels in many real-world scenarios. As noisy labels severely degrade the generalization performance of deep neural networks, learning from noisy labels (robust training) is becoming an important task in modern deep learning applications. In this survey, we first describe the problem of learning with label noise from a supervised learning perspective. Next, we provide a comprehensive review of 46 state-of-the-art robust training methods, all of which are categorized into seven groups according to their methodological difference, followed by a systematic comparison of six properties used to evaluate their superiority. Subsequently, we summarize the typically used evaluation methodology, including public noisy datasets and evaluation metrics. Finally, we present several promising research directions that can serve as a guideline for future studies.

...read moreread less

474 citations

Journal Article•

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

[...]

Adithya Renduchintala, Rebecca Knowles, Philipp Koehn, Jason Eisner

01 Aug 2016-The Association for Computational Linguistics

434 citations

Posted Content•

Pretrained Transformers for Text Ranking: BERT and Beyond

[...]

Jimmy Lin¹, Rodrigo Nogueira¹, Andrew Yates²•Institutions (2)

University of Waterloo¹, University of Amsterdam²

13 Oct 2020-arXiv: Information Retrieval

TL;DR: This tutorial provides an overview of text ranking with neural network architectures known as transformers, of which BERT (Bidirectional Encoder Representations from Transformers) is the best-known example, and covers a wide range of techniques.

...read moreread less

Abstract: The goal of text ranking is to generate an ordered list of texts retrieved from a corpus in response to a query. Although the most common formulation of text ranking is search, instances of the task can also be found in many natural language processing applications. This survey provides an overview of text ranking with neural network architectures known as transformers, of which BERT is the best-known example. The combination of transformers and self-supervised pretraining has been responsible for a paradigm shift in natural language processing (NLP), information retrieval (IR), and beyond. In this survey, we provide a synthesis of existing work as a single point of entry for practitioners who wish to gain a better understanding of how to apply transformers to text ranking problems and researchers who wish to pursue work in this area. We cover a wide range of modern techniques, grouped into two high-level categories: transformer models that perform reranking in multi-stage architectures and dense retrieval techniques that perform ranking directly. There are two themes that pervade our survey: techniques for handling long documents, beyond typical sentence-by-sentence processing in NLP, and techniques for addressing the tradeoff between effectiveness (i.e., result quality) and efficiency (e.g., query latency, model and index size). Although transformer architectures and pretraining techniques are recent innovations, many aspects of how they are applied to text ranking are relatively well understood and represent mature techniques. However, there remain many open research questions, and thus in addition to laying out the foundations of pretrained transformers for text ranking, this survey also attempts to prognosticate where the field is heading.

...read moreread less

315 citations

Journal Article•

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

[...]

Ankur P. Parikh, Shay B. Cohen, Eric P. Xing

01 Jun 2014-The Association for Computational Linguistics

297 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91

Collapse