Home
/
Authors
/
J. Shane Culpepper

Author

J. Shane Culpepper

Other affiliations: University of Melbourne

Bio: J. Shane Culpepper is an academic researcher from RMIT University. The author has contributed to research in topics: Ranking & Ranking (information retrieval). The author has an hindex of 24, co-authored 122 publications receiving 1803 citations. Previous affiliations of J. Shane Culpepper include University of Melbourne.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2007
2006
2005
1995

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Efficient set intersection for inverted indexing

[...]

J. Shane Culpepper¹, Alistair Moffat¹•Institutions (1)

University of Melbourne¹

27 Dec 2010-ACM Transactions on Information Systems

TL;DR: This article investigates intersection techniques that make use of both uncompressed “integer” representations, as well as compressed arrangements, and proposes a simple hybrid method that provides both compact storage and faster intersection computations for conjunctive querying than is possible even with uncompressed representations.

...read moreread less

Abstract: Conjunctive Boolean queries are a key component of modern information retrieval systems, especially when Web-scale repositories are being searched. A conjunctive query q is equivalent to a vqv-way intersection over ordered sets of integers, where each set represents the documents containing one of the terms, and each integer in each set is an ordinal document identifier. As is the case with many computing applications, there is tension between the way in which the data is represented, and the ways in which it is to be manipulated. In particular, the sets representing index data for typical document collections are highly compressible, but are processed using random access techniques, meaning that methods for carrying out set intersections must be alert to issues to do with access patterns and data representation. Our purpose in this article is to explore these trade-offs, by investigating intersection techniques that make use of both uncompressed “integer” representations, as well as compressed arrangements. We also propose a simple hybrid method that provides both compact storage, and also faster intersection computations for conjunctive querying than is possible even with uncompressed representations.

...read moreread less

173 citations

Journal Article•DOI•

Research Frontiers in Information Retrieval: Report from the Third Strategic Workshop on Information Retrieval in Lorne (SWIRL 2018)

[...]

J. Shane Culpepper¹, Fernando Diaz², Mark D. Smucker³•Institutions (3)

RMIT University¹, Microsoft², University of Waterloo³

31 Aug 2018

TL;DR: The intent is that this description of open problems will help to inspire researchers and graduate students to address the questions, and will provide funding agencies data to focus and coordinate support for information retrieval research.

...read moreread less

Abstract: The purpose of the Strategic Workshop in Information Retrieval in Lorne is to explore the long-range issues of the Information Retrieval field, to recognize challenges that are on - or even over - the horizon, to build consensus on some of the key challenges, and to disseminate the resulting information to the research community The intent is that this description of open problems will help to inspire researchers and graduate students to address the questions, and will provide funding agencies data to focus and coordinate support for information retrieval research

...read moreread less

116 citations

Posted Content•

A Survey on Trajectory Data Management, Analytics, and Learning

[...]

Sheng Wang¹, Zhifeng Bao², J. Shane Culpepper², Gao Cong³•Institutions (3)

New York University¹, RMIT University², Nanyang Technological University³

25 Mar 2020-arXiv: Databases

TL;DR: This survey comprehensively review recent research trends in trajectory data management, ranging from trajectory pre-processing, storage, common trajectory analytic tools, such as querying spatial-only and spatial-textual trajectory data, and trajectory clustering, and explores four closely related analytical tasks commonly used with trajectory data in interactive or real-time processing.

...read moreread less

Abstract: Recent advances in sensor and mobile devices have enabled an unprecedented increase in the availability and collection of urban trajectory data, thus increasing the demand for more efficient ways to manage and analyze the data being produced. In this survey, we comprehensively review recent research trends in trajectory data management, ranging from trajectory pre-processing, storage, common trajectory analytic tools, such as querying spatial-only and spatial-textual trajectory data, and trajectory clustering. We also explore four closely related analytical tasks commonly used with trajectory data in interactive or real-time processing. Deep trajectory learning is also reviewed for the first time. Finally, we outline the essential qualities that a trajectory data management system should possess in order to maximize flexibility.

...read moreread less

85 citations

Book Chapter•DOI•

Top-k ranked document search in general text databases

[...]

J. Shane Culpepper¹, Gonzalo Navarro², Simon J. Puglisi¹, Andrew Turpin¹•Institutions (2)

RMIT University¹, University of Chile²

06 Sep 2010

TL;DR: This paper presents two new algorithms for ranking documents against a query without making any assumptions on the structure of the underlying text, significantly faster than existing methods in RAM and even three times faster than a state-of-the-art inverted file implementation for English text when word queries are issued.

...read moreread less

Abstract: Text search engines return a set of k documents ranked by similarity to a query. Typically, documents and queries are drawn from natural language text, which can readily be partitioned into words, allowing optimizations of data structures and algorithms for ranking. However, in many new search domains (DNA, multimedia, OCR texts, Far East languages) there is often no obvious definition of words and traditional indexing approaches are not so easily adapted, or break down entirely. We present two new algorithms for ranking documents against a query without making any assumptions on the structure of the underlying text. We build on existing theoretical techniques, which we have implemented and compared empirically with new approaches introduced in this paper. Our best approach is significantly faster than existing methods in RAM, and is even three times faster than a state-of-the-art inverted file implementation for English text when word queries are issued.

...read moreread less

81 citations

Proceedings Article•DOI•

Neural Query Performance Prediction using Weak Supervision from Multiple Signals

[...]

Hamed Zamani¹, W. Bruce Croft¹, J. Shane Culpepper²•Institutions (2)

University of Massachusetts Amherst¹, RMIT University²

27 Jun 2018

TL;DR: This paper proposes a general end-to-end query performance prediction framework based on neural networks, called NeuralQPP, which significantly outperforms state-of-the-art baselines, in nearly every case.

...read moreread less

Abstract: Predicting the performance of a search engine for a given query is a fundamental and challenging task in information retrieval. Accurate performance predictors can be used in various ways, such as triggering an action, choosing the most effective ranking function per query, or selecting the best variant from multiple query formulations. In this paper, we propose a general end-to-end query performance prediction framework based on neural networks, called NeuralQPP. Our framework consists of multiple components, each learning a representation suitable for performance prediction. These representations are then aggregated and fed into a prediction sub-network. We train our models with multiple weak supervision signals, which is an unsupervised learning approach that uses the existing unsupervised performance predictors using weak labels. We also propose a simple yet effective component dropout technique to regularize our model. Our experiments on four newswire and web collections demonstrate that NeuralQPP significantly outperforms state-of-the-art baselines, in nearly every case. Furthermore, we thoroughly analyze the effectiveness of each component, each weak supervision signal, and all resulting combinations in our experiments.

...read moreread less

80 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Marine natural products.

[...]

John W. Blunt¹, Brent R. Copp², Murray H. G. Munro¹, Peter T. Northcote³, Michèle R. Prinsep⁴ - Show less +1 more•Institutions (4)

University of Canterbury¹, University of Auckland², Victoria University of Wellington³, University of Waikato⁴

01 Feb 2011-Natural Product Reports

TL;DR: This review covers the literature published in 2014 for marine natural products, with 1116 citations referring to compounds isolated from marine microorganisms and phytoplankton, green, brown and red algae, sponges, cnidarians, bryozoans, molluscs, tunicates, echinoderms, mangroves and other intertidal plants and microorganisms.

...read moreread less

4,649 citations

On robust estimation of the location parameter

[...]

Frederick R. Forst

01 Jan 1980

3,652 citations

Journal Article•DOI•

Organic Azides: An Exploding Diversity of a Unique Class of Compounds

[...]

Stefan Bräse¹, Carmen Gil², Kerstin Knepper², V. Zimmermann²•Institutions (2)

Karlsruhe Institute of Technology¹, University of Bonn²

19 Aug 2005-Angewandte Chemie

TL;DR: In this Review, the fundamental characteristics of azide chemistry and current developments are presented and the focus will be placed on cycloadditions (Huisgen reaction), aza ylide chemistry, and the synthesis of heterocycles.

...read moreread less

Abstract: Since the discovery of organic azides by Peter Griess more than 140 years ago, numerous syntheses of these energy-rich molecules have been developed. In more recent times in particular, completely new perspectives have been developed for their use in peptide chemistry, combinatorial chemistry, and heterocyclic synthesis. Organic azides have assumed an important position at the interface between chemistry, biology, medicine, and materials science. In this Review, the fundamental characteristics of azide chemistry and current developments are presented. The focus will be placed on cycloadditions (Huisgen reaction), aza ylide chemistry, and the synthesis of heterocycles. Further reactions such as the aza-Wittig reaction, the Sundberg rearrangement, the Staudinger ligation, the Boyer and Boyer-Aube rearrangements, the Curtius rearrangement, the Schmidt rearrangement, and the Hemetsberger rearrangement bear witness to the versatility of modern azide chemistry.

...read moreread less

1,766 citations

Journal Article•DOI•

Inverted files for text search engines

[...]

Justin Zobel¹, Alistair Moffat²•Institutions (2)

RMIT University¹, University of Melbourne²

25 Jul 2006-ACM Computing Surveys

TL;DR: This tutorial introduces the key techniques in the area of text indexing, describing both a core implementation and how the core can be enhanced through a range of extensions.

...read moreread less

Abstract: The technology underlying text search engines has advanced dramatically in the past decade. The development of a family of new index representations has led to a wide range of innovations in index storage, index construction, and query evaluation. While some of these developments have been consolidated in textbooks, many specific techniques are not widely known or the textbook descriptions are out of date. In this tutorial, we introduce the key techniques in the area, describing both a core implementation and how the core can be enhanced through a range of extensions. We conclude with a comprehensive bibliography of text indexing literature.

...read moreread less

1,218 citations

C4.5: Programs for Machine Learning (書評)

[...]

重郎金田

01 May 1995

1,164 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse