Home
/
Authors
/
Glen Jeh

Author

Glen Jeh

Bio: Glen Jeh is an academic researcher from Google. The author has contributed to research in topics: Web search query & Search engine. The author has an hindex of 21, co-authored 33 publications receiving 5570 citations. Previous affiliations of Glen Jeh include Stanford University.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

SimRank: a measure of structural-context similarity

[...]

Glen Jeh¹, Jennifer Widom¹•Institutions (1)

Stanford University¹

23 Jul 2002

TL;DR: A complementary approach, applicable in any domain with object-to-object relationships, that measures similarity of the structural context in which objects occur, based on their relationships with other objects is proposed.

...read moreread less

Abstract: The problem of measuring "similarity" of objects arises in many applications, and many domain-specific measures have been developed, e.g., matching text across documents or computing overlap among item-sets. We propose a complementary approach, applicable in any domain with object-to-object relationships, that measures similarity of the structural context in which objects occur, based on their relationships with other objects. Effectively, we compute a measure that says "two objects are similar if they are related to similar objects:" This general similarity measure, called SimRank, is based on a simple and intuitive graph-theoretic model. For a given domain, SimRank can be combined with other domain-specific similarity measures. We suggest techniques for efficient computation of SimRank scores, and provide experimental results on two application domains showing the computational feasibility and effectiveness of our approach.

...read moreread less

2,036 citations

Proceedings Article•DOI•

Scaling personalized web search

[...]

Glen Jeh¹, Jennifer Widom¹•Institutions (1)

Stanford University¹

20 May 2003

TL;DR: The approach enables incremental computation, so that the construction of personalized views from partial vectors is practical at query time, and experimental results demonstrate the effectiveness and scalability of the techniques.

...read moreread less

Abstract: Recent web search techniques augment traditional text matching with a global notion of "importance" based on the linkage structure of the web, such as in Google's PageRank algorithm. For more refined searches, this global notion of importance can be specialized to create personalized views of importance--for example, importance scores can be biased according to a user-specified set of initially-interesting pages. Computing and storing all possible personalized views in advance is impractical, as is computing personalized views at query time, since the computation of each view requires an iterative computation over the web graph. We present new graph-theoretical results, and a new technique based on these results, that encode personalized views as partial vectors. Partial vectors are shared across multiple personalized views, and their computation and storage costs scale well with the number of views. Our approach enables incremental computation, so that the construction of personalized views from partial vectors is practical at query time. We present efficient dynamic programming algorithms for computing partial vectors, an algorithm for constructing personalized views from partial vectors, and experimental results demonstrating the effectiveness and scalability of our techniques.

...read moreread less

1,370 citations

Patent•

Anticipated query generation and processing in a search engine

[...]

Sepandar D. Kamvar¹, Taher H. Haveliwala¹, Glen Jeh¹•Institutions (1)

Google¹

22 Jun 2004

TL;DR: In this paper, a search system monitors the input of a search query by a user, and sends a portion of the query as a partial query to the search engine for possible selection.

...read moreread less

Abstract: A search system monitors the input of a search query by a user. Before the user finishes entering the search query, the search system identifies and sends a portion of the query as a partial query to the search engine. Based on the partial query, the search engine creates a set of predicted queries. This process may take into account prior queries submitted by a community of users, and may take into account a user profile. The predicted queries are be sent back to the user for possible selection. The search system may also cache search results corresponding to one or more of the predicted queries in anticipation of the user selecting one of the predicted queries. The search engine may also return at least a portion of the search results corresponding to one or more of the predicted queries.

...read moreread less

545 citations

Patent•

Results based personalization of advertisements in a search engine

[...]

Taher H. Haveliwala¹, Glen Jeh¹, Sepandar D. Kamvar¹•Institutions (1)

Google¹

21 Jun 2005

TL;DR: In this article, personalized advertisements are provided to a user using a search engine to obtain documents relevant to a search query, where advertisements are personalized in response to a query profile that is derived from personalized search results.

...read moreread less

Abstract: Personalized advertisements are provided to a user using a search engine to obtain documents relevant to a search query. The advertisements are personalized in response to a search profile that is derived from personalized search results. The search results are personalized based on a user profile of the user providing the query. The user profile describes interests of the user, and can be derived from a variety of sources, including prior search queries, prior search results, expressed interests, demographic, geographic, psychographic, and activity information.

...read moreread less

387 citations

An Analytical Comparison of Approaches to Personalizing PageRank

[...]

Taher H. Haveliwala, Sepandar D. Kamvar, Glen Jeh

20 Jun 2003

TL;DR: This work analytically compares three recent approaches to personalizing PageRank and discusses the tradeoffs of each one.

...read moreread less

Abstract: PageRank, the popular link-analysis algorithm for ranking web pages, assigns a query and user independent estimate of "importance" to web pages Query and user sensitive extensions of PageRank, which use a basis set of biased PageRank vectors, have been proposed in order to personalize the ranking function in a tractable way We analytically compare three recent approaches to personalizing PageRank and discuss the tradeoffs of each one

...read moreread less

231 citations

1
2
3
4
…
5
6
7

Collapse

Cited by

PDF

Open Access

More filters

Book•

Data Mining: Concepts and Techniques

[...]

Jiawei Han¹, Micheline Kamber², Jian Pei²•Institutions (2)

University of Illinois at Urbana–Champaign¹, Simon Fraser University²

08 Sep 2000

TL;DR: This book presents dozens of algorithms and implementation examples, all in pseudo-code and suitable for use in real-world, large-scale data mining projects, and provides a comprehensive, practical look at the concepts and techniques you need to get the most out of real business data.

...read moreread less

Abstract: The increasing volume of data in modern business and science calls for more complex and sophisticated tools. Although advances in data mining technology have made extensive data collection much easier, it's still always evolving and there is a constant need for new techniques and tools that can help us transform this data into useful information and knowledge. Since the previous edition's publication, great advances have been made in the field of data mining. Not only does the third of edition of Data Mining: Concepts and Techniques continue the tradition of equipping you with an understanding and application of the theory and practice of discovering patterns hidden in large data sets, it also focuses on new, important topics in the field: data warehouses and data cube technology, mining stream, mining social networks, and mining spatial, multimedia and other complex data. Each chapter is a stand-alone guide to a critical topic, presenting proven algorithms and sound implementations ready to be used directly or with strategic modification against live data. This is the resource you need if you want to apply today's most powerful data mining techniques to meet real business challenges. * Presents dozens of algorithms and implementation examples, all in pseudo-code and suitable for use in real-world, large-scale data mining projects. * Addresses advanced topics such as mining object-relational databases, spatial databases, multimedia databases, time-series databases, text databases, the World Wide Web, and applications in several fields. *Provides a comprehensive, practical look at the concepts and techniques you need to get the most out of real business data

...read moreread less

23,600 citations

Journal Article•DOI•

The Graph Neural Network Model

[...]

Franco Scarselli¹, Marco Gori¹, Ah Chung Tsoi², Markus Hagenbuchner³, Gabriele Monfardini¹ - Show less +1 more•Institutions (3)

University of Siena¹, Hong Kong Baptist University², University of Wollongong³

01 Jan 2009-IEEE Transactions on Neural Networks

TL;DR: A new neural network model, called graph neural network (GNN) model, that extends existing neural network methods for processing the data represented in graph domains, and implements a function tau(G,n) isin IRm that maps a graph G and one of its nodes n into an m-dimensional Euclidean space.

...read moreread less

Abstract: Many underlying relationships among data in several areas of science and engineering, e.g., computer vision, molecular chemistry, molecular biology, pattern recognition, and data mining, can be represented in terms of graphs. In this paper, we propose a new neural network model, called graph neural network (GNN) model, that extends existing neural network methods for processing the data represented in graph domains. This GNN model, which can directly process most of the practically useful types of graphs, e.g., acyclic, cyclic, directed, and undirected, implements a function tau(G,n) isin IRm that maps a graph G and one of its nodes n into an m-dimensional Euclidean space. A supervised learning algorithm is derived to estimate the parameters of the proposed GNN model. The computational cost of the proposed algorithm is also considered. Some experimental results are shown to validate the proposed learning algorithm, and to demonstrate its generalization capabilities.

...read moreread less

5,701 citations

Journal Issue•DOI•

The link-prediction problem for social networks

[...]

David Liben-Nowell¹, Jon Kleinberg²•Institutions (2)

Carleton College¹, Cornell University²

01 May 2007-Journal of the Association for Information Science and Technology

TL;DR: Experiments on large coauthorship networks suggest that information about future interactions can be extracted from network topology alone, and that fairly subtle measures for detecting node proximity can outperform more direct measures.

...read moreread less

Abstract: Given a snapshot of a social network, can we infer which new interactions among its members are likely to occur in the near future? We formalize this question as the link-prediction problem, and we develop approaches to link prediction based on measures for analyzing the “proximity” of nodes in a network. Experiments on large coauthorship networks suggest that information about future interactions can be extracted from network topology alone, and that fairly subtle measures for detecting node proximity can outperform more direct measures. © 2007 Wiley Periodicals, Inc.

...read moreread less

4,181 citations

Journal Article•DOI•

Link prediction in complex networks: A survey

[...]

Linyuan Lü¹, Linyuan Lü², Linyuan Lü³, Tao Zhou², Tao Zhou⁴ - Show less +1 more•Institutions (4)

University of Shanghai for Science and Technology¹, University of Electronic Science and Technology of China², University of Fribourg³, University of Science and Technology of China⁴

15 Mar 2011-Physica A-statistical Mechanics and Its Applications

TL;DR: Recent progress about link prediction algorithms is summarized, emphasizing on the contributions from physical perspectives and approaches, such as the random-walk-based methods and the maximum likelihood methods.

...read moreread less

Abstract: Link prediction in complex networks has attracted increasing attention from both physical and computer science communities. The algorithms can be used to extract missing information, identify spurious interactions, evaluate network evolving mechanisms, and so on. This article summaries recent progress about link prediction algorithms, emphasizing on the contributions from physical perspectives and approaches, such as the random-walk-based methods and the maximum likelihood methods. We also introduce three typical applications: reconstruction of networks, evaluation of network evolving mechanism and classification of partially labeled networks. Finally, we introduce some applications and outline future challenges of link prediction algorithms.

...read moreread less

2,530 citations

Proceedings Article•DOI•

SimRank: a measure of structural-context similarity

[...]

Glen Jeh¹, Jennifer Widom¹•Institutions (1)

Stanford University¹

23 Jul 2002

...read moreread less

2,036 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse