scispace - formally typeset
D

David A. Hull

Researcher at Xerox

Publications -  26
Citations -  2342

David A. Hull is an academic researcher from Xerox. The author has contributed to research in topics: Track (disk drive) & Natural language. The author has an hindex of 18, co-authored 26 publications receiving 2322 citations.

Papers
More filters
Proceedings ArticleDOI

A comparison of classifiers and document representations for the routing problem

TL;DR: This paper compares learning techniques based on statistical classification to traditional methods of relevance feedback for the document routing problem and indicates that features based on latent semantic indexing are more effective for techniques such as linear discriminant analysis and logistic regression, which have no way to protect against overfitting.
Journal ArticleDOI

Stemming algorithms: a case study for detailed evaluation

TL;DR: A case study of stemming algorithms is described which describes a number of novel approaches to evaluation and demonstrates their value.
Proceedings ArticleDOI

Querying across languages: a dictionary-based approach to multilingual information retrieval

TL;DR: This paper found that correct identification and translation of multi-word terminology is the single most important source of error in the system, although amblguit y in translation also contributes to poor performance.
Proceedings Article

The TREC-9 Filtering Track Final Report.

TL;DR: This report describes the TREC–9 filtering track, presents some evaluation results, and provides a general commentary on lessons learned from this year's track.
Proceedings ArticleDOI

Method combination for document filtering

TL;DR: It is found that simple averaging strategies do indeed improve performance, but that direet averaging of probability estimates is not the correet approach, and the probabiJit y estimates must be renormalized using logistic regression on the known relevance judgments.