Home
/
Authors
/
R. Nigel Horspool

Author

R. Nigel Horspool

Other affiliations: McGill University, University of Illinois at Urbana–Champaign

Bio: R. Nigel Horspool is an academic researcher from University of Victoria. The author has contributed to research in topics: Parsing & Compiler. The author has an hindex of 21, co-authored 59 publications receiving 1728 citations. Previous affiliations of R. Nigel Horspool include McGill University & University of Illinois at Urbana–Champaign.

Topics: Parsing, Compiler, LR parser, Parser combinator, Parsing expression grammar ...read more

Papers published on a yearly basis

2016
2015
2014
2013
2010
2009
2008
2007
2006
2005
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1985
1984
1983
1982
1980
1978

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Practical Fast Searching in Strings

[...]

R. Nigel Horspool¹•Institutions (1)

McGill University¹

01 Jun 1980-Software - Practice and Experience

TL;DR: It is discovered that a method developed by Boyer and Moore can outperform even special‐purpose search instructions that may be built into the computer hardware for very short substrings.

...read moreread less

Abstract: SUMMARY The problem of searching through text to find a specified substring is considered in a practical setting It is discovered that a method developed by Boyer and Moore can outperform even special-purpose search instructions that may be built into the, computer hardware For very short substrings however, these special purpose instructions are fastest-provided that they are used in an optimal way

...read moreread less

649 citations

Proceedings Article•DOI•

Efficient type inclusion tests

[...]

Jan Vitek¹, R. Nigel Horspool², Andreas Krall³•Institutions (3)

University of Geneva¹, University of Victoria², Vienna University of Technology³

09 Oct 1997

TL;DR: Three new encodings of the subtype relation, the packed encoding, the bit-packed encoding and the compact encoding are presented, which have different characteristics and are compared with other constant-time type inclusion tests.

...read moreread less

Abstract: A type inclusion test determines whether one type is a subtype of another. Efficient type testing techniques exist for single subtyping, but not for languages with multiple subtyping. To date, the fast constant-time technique relies on a binary matrix encoding of the subtype relation with quadratic space requirements. In this paper, we present three new encodings of the subtype relation, the packed encoding, the bit-packed encoding and the compact encoding. These encodings have different characteristics. The bit-packed encoding delivers the best compression rates: on average 85% for real life programs. The packed encoding performs type inclusion tests in only 4 machine instructions. We present a fast algorithm for computing these encoding which runs in less than 13 milliseconds for PE and BPE, and 23 milliseconds for CE on an Alpha processor. Finally, we compare our results with other constant-time type inclusion tests on a suite of 11 large -benchmark hierarchies.

...read moreread less

85 citations

Journal Article•DOI•

A framework for metamorphic malware analysis and real-time detection

[...]

Shahid Alam¹, R. Nigel Horspool¹, Issa Traore¹, Ibrahim Sogukpinar²•Institutions (2)

University of Victoria¹, Gebze Institute of Technology²

01 Feb 2015-Computers & Security

TL;DR: A new framework called MARD is presented, to protect the end points that are often the last defense, against metamorphic malware, and provides automation, platform independence, optimizations for real-time performance and modularity.

...read moreread less

80 citations

Book Chapter•DOI•

Near optimal hierarchical encoding of types

[...]

Andreas Krall¹, Jan Vitek², R. Nigel Horspool³•Institutions (3)

Vienna University of Technology¹, University of Geneva², University of Victoria³

09 Jun 1997

TL;DR: A new algorithm based on graph coloring which computes a near optimal hierarchical encoding of type hierarchies is presented which improves significantly on previous results - it is faster, simpler and generates smaller bit vectors.

...read moreread less

Abstract: A type inclusion test is a procedure to decide whether two types are related by a given subtyping relationship. An efficient implementation of the type inclusion test plays an important role in the performance of object oriented programming languages with multiple subtyping like C++, Eiffel or Java. There are well-known methods for performing fast constant time type inclusion tests that use a hierarchical bit vector encoding of the partial ordered set representing the type hierarchy. The number of instructions required by the type inclusion test is proportional to the length of those bit vectors. We present a new algorithm based on graph coloring which computes a near optimal hierarchical encoding of type hierarchies. The new algorithm improves significantly on previous results — it is faster, simpler and generates smaller bit vectors.

...read moreread less

70 citations

Journal Article•DOI•

Algorithms for adaptive Huffman codes

[...]

Gordon V. Cormack¹, R. Nigel Horspool¹•Institutions (1)

McGill University¹

22 Mar 1984-Information Processing Letters

TL;DR: Un algorithme d'Huffman permet de generer des codes a redondance minimum pour un ensemble fini de message a frequences de transmissions connues, mais le systeme binaire reste certainement le mieux adapte aux applications informatiques.

...read moreread less

67 citations

1
2
3
4
…
5
6
7
8
9
10
11
12

Collapse

Cited by

PDF

Open Access

More filters

物件導向軟體之架構(Object-Oriented Software Construction)探討

[...]

簡聰富

01 Dec 1989

4,898 citations

Journal Article•DOI•

Arithmetic coding for data compression

[...]

Ian H. Witten¹, Radford M. Neal¹, John G. Cleary¹•Institutions (1)

University of Calgary¹

01 Jun 1987-Communications of The ACM

TL;DR: The state of the art in data compression is arithmetic coding, not the better-known Huffman method, which gives greater compression, is faster for adaptive models, and clearly separates the model from the channel encoding.

...read moreread less

Abstract: The state of the art in data compression is arithmetic coding, not the better-known Huffman method. Arithmetic coding gives greater compression, is faster for adaptive models, and clearly separates the model from the channel encoding.

...read moreread less

3,188 citations

Journal Article•DOI•

A guided tour to approximate string matching

[...]

Gonzalo Navarro¹•Institutions (1)

University of Chile¹

01 Mar 2001-ACM Computing Surveys

TL;DR: This work surveys the current techniques to cope with the problem of string matching that allows errors, and focuses on online searching and mostly on edit distance, explaining the problem and its relevance, its statistical behavior, its history and current developments, and the central ideas of the algorithms.

...read moreread less

Abstract: We survey the current techniques to cope with the problem of string matching that allows errors. This is becoming a more and more relevant issue for many fast growing areas such as information retrieval and computational biology. We focus on online searching and mostly on edit distance, explaining the problem and its relevance, its statistical behavior, its history and current developments, and the central ideas of the algorithms and their complexities. We present a number of experiments to compare the performance of the different algorithms and show which are the best choices. We conclude with some directions for future work and open problems.

...read moreread less

2,723 citations

Book•

Information Retrieval: Data Structures and Algorithms

[...]

William B. Frakes, Ricardo Baeza-Yates¹•Institutions (1)

University of Chile¹

12 Jun 1992

TL;DR: For programmers and students interested in parsing text, automated indexing, its the first collection in book form of the basic data structures and algorithms that are critical to the storage and retrieval of documents.

...read moreread less

Abstract: An edited volume containing data structures and algorithms for information retrieved including a disk with examples written in C. For programmers and students interested in parsing text, automated indexing, its the first collection in book form of the basic data structures and algorithms that are critical to the storage and retrieval of documents.

...read moreread less

2,359 citations

Book•

Principles of program analysis

[...]

Flemming Nielson, Hanne Riis Nielson, Chris Hankin

22 Oct 1999

TL;DR: This book is unique in providing an overview of the four major approaches to program analysis: data flow analysis, constraint-based analysis, abstract interpretation, and type and effect systems.

...read moreread less

Abstract: Program analysis utilizes static techniques for computing reliable information about the dynamic behavior of programs. Applications include compilers (for code improvement), software validation (for detecting errors) and transformations between data representation (for solving problems such as Y2K). This book is unique in providing an overview of the four major approaches to program analysis: data flow analysis, constraint-based analysis, abstract interpretation, and type and effect systems. The presentation illustrates the extensive similarities between the approaches, helping readers to choose the best one to utilize.

...read moreread less

1,955 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse