Home
/
Authors
/
Inho Kang

Author

Inho Kang

Other affiliations: KAIST, Samsung

Bio: Inho Kang is an academic researcher from Naver Corporation. The author has contributed to research in topics: Computer science & Sentence. The author has an hindex of 10, co-authored 27 publications receiving 984 citations. Previous affiliations of Inho Kang include KAIST & Samsung.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Query type classification for web document retrieval

[...]

Inho Kang, Gil-Chang Kim

28 Jul 2003

TL;DR: A user query classification scheme that uses the difference of distribution, mutual information, the usage rate as anchor texts, and the POS information for the classification and could get the best performance when the OKAPI scoring algorithm was used.

...read moreread less

Abstract: The heterogeneous Web exacerbates IR problems and short user queries make them worse. The contents of web documents are not enough to find good answer documents. Link information and URL information compensates for the insufficiencies of content information. However, static combination of multiple evidences may lower the retrieval performance. We need different strategies to find target documents according to a query type. We can classify user queries as three categories, the topic relevance task, the homepage finding task, and the service finding task. In this paper, a user query classification scheme is proposed. This scheme uses the difference of distribution, mutual information, the usage rate as anchor texts, and the POS information for the classification. After we classified a user query, we apply different algorithms and information for the better results. For the topic relevance task, we emphasize the content information, on the other hand, for the homepage finding task, we emphasize the Link information and the URL information. We could get the best performance when our proposed classification method with the OKAPI scoring algorithm was used.

...read moreread less

295 citations

Patent•

Apparatus for providing voice dialogue service and method of operating the same

[...]

Byung Kwan Kwak¹, Cho Jeong Mi¹, Inho Kang¹•Institutions (1)

Samsung¹

28 Aug 2006

TL;DR: In this paper, a speech dialogue service apparatus including a language analysis module tagging a part of speech (POS) of each word included in a sentence recorded in a predetermined text, syntactically analyzing the sentence by classifying a meaning of each respective word, and generating at least one semantic frame corresponding to the sentence according to a result of the syntactical analysis was presented.

...read moreread less

Abstract: A speech dialogue service apparatus including: a language analysis module tagging a part of speech (POS) of each respective word included in a sentence recorded in a predetermined text, syntactically analyzing the sentence by classifying a meaning of each respective word, and generating at least one semantic frame corresponding to the sentence according to a result of the syntactical analysis; and a dialogue management module analyzing an intention of the sentence corresponding to the at least one respective semantic frame, and generating a system response corresponding to the sentence intention by selecting a predetermined sentence intention according to whether an action corresponding to the intention of the respective sentence can be performed.

...read moreread less

220 citations

Journal Article•DOI•

Semantic Sentence Matching with Densely-connected Recurrent and Co-attentive Information

[...]

Seonhoon Kim¹, Inho Kang¹, Nojun Kwak²•Institutions (2)

Naver Corporation¹, Seoul National University²

17 Jul 2019

TL;DR: A densely-connected co-attentive recurrent neural network, each layer of which uses concatenated information of attentive features as well as hidden features of all the preceding recurrent layers, which achieves state-of-the-art performances for most of the tasks.

...read moreread less

Abstract: Sentence matching is widely used in various natural language tasks such as natural language inference, paraphrase identification, and question answering. For these tasks, understanding logical and semantic relationship between two sentences is required but it is yet challenging. Although attention mechanism is useful to capture the semantic relationship and to properly align the elements of two sentences, previous methods of attention mechanism simply use a summation operation which does not retain original features enough. Inspired by DenseNet, a densely connected convolutional network, we propose a densely-connected co-attentive recurrent neural network, each layer of which uses concatenated information of attentive features as well as hidden features of all the preceding recurrent layers. It enables preserving the original and the co-attentive feature information from the bottommost word embedding layer to the uppermost recurrent layer. To alleviate the problem of an ever-increasing size of feature vectors due to dense concatenation operations, we also propose to use an autoencoder after dense concatenation. We evaluate our proposed architecture on highly competitive benchmark datasets related to sentence matching. Experimental results show that our architecture, which retains recurrent and attentive features, achieves state-of-the-art performances for most of the tasks.

...read moreread less

142 citations

Patent•

Apparatus and method for detecting named entity

[...]

Jae-won Lee¹, Inho Kang¹, Jeong-Su Kim¹, Haechang Rim¹•Institutions (1)

Samsung¹

03 Aug 2006

TL;DR: In this article, an apparatus and method for detecting a named entity is presented, which includes a candidate-named-entity extraction module that detects a candidate named entity based on an initial learning example and feature information regarding morphemes constituting an inputted sentence, a storage module that stores information regarding a named-entity dictionary and a rule, and a learning-example-regeneration module for finally determining whether the candidate-entity included in the provided sentence is a valid named entity.

...read moreread less

Abstract: An apparatus and method for detecting a named-entity. The apparatus includes a candidate-named-entity extraction module that detects a candidate-named-entity based on an initial learning example and feature information regarding morphemes constituting an inputted sentence, the candidate-named-entity extraction module providing a tagged sentence including the detected candidate-named-entity; a storage module that stores information regarding a named-entity dictionary and a rule; and a learning-example-regeneration module for finally determining whether the candidate-named-entity included in the provided sentence is a valid named-entity, based on the named-entity dictionary and the rule, the learning-example-regeneration module providing the sentence as a learning example, based on a determination result, so that a probability of candidate-named-entity detection is gradually updated.

...read moreread less

124 citations

Posted Content•

Semantic Sentence Matching with Densely-connected Recurrent and Co-attentive Information

[...]

Seonhoon Kim¹, Inho Kang¹, Nojun Kwak²•Institutions (2)

Naver Corporation¹, Seoul National University²

29 May 2018-arXiv: Computation and Language

TL;DR: The authors proposed a densely-connected co-attentive recurrent neural network (C-RNN), which uses concatenated information of attentive features as well as hidden features of all the preceding recurrent layers.

...read moreread less

107 citations

1
2
3
4
…
5
6
7

Collapse

Cited by

PDF

Open Access

More filters

Book•

Learning to Rank for Information Retrieval

[...]

Tie-Yan Liu¹•Institutions (1)

Microsoft¹

27 Jun 2009

TL;DR: Three major approaches to learning to rank are introduced, i.e., the pointwise, pairwise, and listwise approaches, the relationship between the loss functions used in these approaches and the widely-used IR evaluation measures are analyzed, and the performance of these approaches on the LETOR benchmark datasets is evaluated.

...read moreread less

Abstract: This tutorial is concerned with a comprehensive introduction to the research area of learning to rank for information retrieval. In the first part of the tutorial, we will introduce three major approaches to learning to rank, i.e., the pointwise, pairwise, and listwise approaches, analyze the relationship between the loss functions used in these approaches and the widely-used IR evaluation measures, evaluate the performance of these approaches on the LETOR benchmark datasets, and demonstrate how to use these approaches to solve real ranking applications. In the second part of the tutorial, we will discuss some advanced topics regarding learning to rank, such as relational ranking, diverse ranking, semi-supervised ranking, transfer ranking, query-dependent ranking, and training data preprocessing. In the third part, we will briefly mention the recent advances on statistical learning theory for ranking, which explain the generalization ability and statistical consistency of different ranking methods. In the last part, we will conclude the tutorial and show several future research directions.

...read moreread less

2,515 citations

Patent•

Intelligent Automated Assistant

[...]

Thomas R. Gruber¹, Adam Cheyer¹, Dag Kittlaus¹, Didier Rene Guzzoni¹, Christopher Dean Brigham¹, Richard Donald Giuli¹, Marcello Bastea-Forte¹, Harry J. Saddler¹ - Show less +4 more•Institutions (1)

Apple Inc.¹

11 Jan 2011

TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.

...read moreread less

Abstract: An intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions. The system can be implemented using any of a number of different platforms, such as the web, email, smartphone, and the like, or any combination thereof. In one embodiment, the system is based on sets of interrelated domains and tasks, and employs additional functionally powered by external services with which the system can interact.

...read moreread less

1,462 citations

Patent•

Automated Response to and Sensing of User Activity in Portable Devices

[...]

Brian Q. Huppi¹, Anthony M. Fadell¹, Derek Boyd Barrentine¹, Daniel Freeman¹•Institutions (1)

Apple Inc.¹

19 Oct 2007

TL;DR: In this paper, various methods and devices described herein relate to devices which, in at least certain embodiments, may include one or more sensors for providing data relating to user activity and at least one processor for causing the device to respond based on the user activity which was determined, at least in part, through the sensors.

...read moreread less

Abstract: The various methods and devices described herein relate to devices which, in at least certain embodiments, may include one or more sensors for providing data relating to user activity and at least one processor for causing the device to respond based on the user activity which was determined, at least in part, through the sensors. The response by the device may include a change of state of the device, and the response may be automatically performed after the user activity is determined.

...read moreread less

844 citations

Journal Article•DOI•

Translation techniques in cross-language information retrieval

[...]

Dong Zhou¹, Mark Truran², Tim Brailsford³, Vincent Wade⁴, Helen Ashman⁵ - Show less +1 more•Institutions (5)

Hunan University of Science and Technology¹, Teesside University², University of Nottingham³, Trinity College, Dublin⁴, University of South Australia⁵

07 Dec 2012-ACM Computing Surveys

TL;DR: Over the last 15 years, the CLIR community has developed a wide range of techniques and models supporting free text translation, with a special emphasis on recent developments.

...read moreread less

Abstract: Cross-language information retrieval (CLIR) is an active sub-domain of information retrieval (IR). Like IR, CLIR is centered on the search for documents and for information contained within those documents. Unlike IR, CLIR must reconcile queries and documents that are written in different languages. The usual solution to this mismatch involves translating the query and/or the documents before performing the search. Translation is therefore a pivotal activity for CLIR engines. Over the last 15 years, the CLIR community has developed a wide range of techniques and models supporting free text translation. This article presents an overview of those techniques, with a special emphasis on recent developments.

...read moreread less

720 citations

Proceedings Article•DOI•

Multi-Task Deep Neural Networks for Natural Language Understanding

[...]

Xiaodong Liu¹, Pengcheng He¹, Weizhu Chen¹, Jianfeng Gao¹•Institutions (1)

Microsoft¹

31 Jan 2019

TL;DR: The authors proposed a multi-task deep neural network (MT-DNN) for learning representations across multiple natural language understanding (NLU) tasks, which not only leverages large amounts of cross-task data, but also benefits from a regularization effect that leads to more general representations to help adapt to new tasks and domains.

...read moreread less

Abstract: In this paper, we present a Multi-Task Deep Neural Network (MT-DNN) for learning representations across multiple natural language understanding (NLU) tasks. MT-DNN not only leverages large amounts of cross-task data, but also benefits from a regularization effect that leads to more general representations to help adapt to new tasks and domains. MT-DNN extends the model proposed in Liu et al. (2015) by incorporating a pre-trained bidirectional transformer language model, known as BERT (Devlin et al., 2018). MT-DNN obtains new state-of-the-art results on ten NLU tasks, including SNLI, SciTail, and eight out of nine GLUE tasks, pushing the GLUE benchmark to 82.7% (2.2% absolute improvement) as of February 25, 2019 on the latest GLUE test set. We also demonstrate using the SNLI and SciTail datasets that the representations learned by MT-DNN allow domain adaptation with substantially fewer in-domain labels than the pre-trained BERT representations. Our code and pre-trained models will be made publicly available.

...read moreread less

647 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186

Collapse