scispace - formally typeset
Journal ArticleDOI

An expert system for quality control and duplicate detection in bibliographic databases

M. J. Ridley
- Vol. 26, Iss: 1, pp 1-18
TLDR
The expert system used to determine whether sets of records that appeared to be for the same monograph were in fact duplicates was outlined and problems and further developments in automated examination of bibliographic records are discussed.
Abstract
The QUALCAT project at the University of Bradford attempted to apply automated quality control to databases of bibliographic records. Sets of records, putative duplicates, that appeared to be for the same monograph were grouped together and an expert system used to determine whether they were in fact duplicates, and if so which were the best records. This paper outlines the expert system used and discusses problems and further developments in automated examination of bibliographic records.

read more

Citations
More filters
Proceedings ArticleDOI

Word sense disambiguation and information retrieval

TL;DR: From these results it has become clear that more basic research is needed to investigate the relationship between sense ambiguity, disambiguation, and IR.
Journal Article

Server-Initated Document Dissemination for the WWW.

TL;DR: Results of log analysis and trace driven simulations are presented that quantify the performance gains achievable through the use of a data dissemination mechanism that allows information to propagate from its producers to servers that are closer to its consumers.

Citation context analysis for information retrieval

Anna Ritchie
TL;DR: The main hypothesis that citation terms enhance a full-text representation of scientific papers is proven and the construction of a new, realistic test collection of scientific research papers is documented, with references and associated citations automatically annotated.
Journal Article

Genesis: An Approach to Data Dissemination in Advanced Traveler Information Systems.

TL;DR: Results of log analysis and trace-driven simulations are presented that quantify the performance gains achievable through the use of a data dissemination mechanism that allows information to propagate from its producers to servers that are closer to its consumers.

Duplicate Removal in Information Dissemination

TL;DR: This paper explains why duplicates arise, the problem is quantified, and why it impairs information dissemination is discussed, and a Duplicate Removal Module (DRM) is proposed for an information dissemination system.
References
More filters
Journal ArticleDOI

The universal standard bibliographic code (USBC): its use for clearing, merging and controlling large databases

TL;DR: The history of the Universal Standard Bibliographic Code (USBC) is traced from its original concept as a machine generated control number to its present status as a means of merging catalogues, eliminating duplication and providing quality control in machine‐based bibliographic databases.