Journal ArticleDOI
An expert system for quality control and duplicate detection in bibliographic databases
M. J. Ridley
- Vol. 26, Iss: 1, pp 1-18
TLDR
The expert system used to determine whether sets of records that appeared to be for the same monograph were in fact duplicates was outlined and problems and further developments in automated examination of bibliographic records are discussed.Abstract:
The QUALCAT project at the University of Bradford attempted to apply automated quality control to databases of bibliographic records. Sets of records, putative duplicates, that appeared to be for the same monograph were grouped together and an expert system used to determine whether they were in fact duplicates, and if so which were the best records. This paper outlines the expert system used and discusses problems and further developments in automated examination of bibliographic records.read more
Citations
More filters
Proceedings ArticleDOI
Word sense disambiguation and information retrieval
TL;DR: From these results it has become clear that more basic research is needed to investigate the relationship between sense ambiguity, disambiguation, and IR.
Journal Article
Server-Initated Document Dissemination for the WWW.
Azer Bestavros,Carlos Cunha +1 more
TL;DR: Results of log analysis and trace driven simulations are presented that quantify the performance gains achievable through the use of a data dissemination mechanism that allows information to propagate from its producers to servers that are closer to its consumers.
Citation context analysis for information retrieval
TL;DR: The main hypothesis that citation terms enhance a full-text representation of scientific papers is proven and the construction of a new, realistic test collection of scientific research papers is documented, with references and associated citations automatically annotated.
Journal Article
Genesis: An Approach to Data Dissemination in Advanced Traveler Information Systems.
TL;DR: Results of log analysis and trace-driven simulations are presented that quantify the performance gains achievable through the use of a data dissemination mechanism that allows information to propagate from its producers to servers that are closer to its consumers.
Duplicate Removal in Information Dissemination
Tak W. Yan,Hector Garcia-Molina +1 more
TL;DR: This paper explains why duplicates arise, the problem is quantified, and why it impairs information dissemination is discussed, and a Duplicate Removal Module (DRM) is proposed for an information dissemination system.
References
More filters
Journal ArticleDOI
The universal standard bibliographic code (USBC): its use for clearing, merging and controlling large databases
TL;DR: The history of the Universal Standard Bibliographic Code (USBC) is traced from its original concept as a machine generated control number to its present status as a means of merging catalogues, eliminating duplication and providing quality control in machine‐based bibliographic databases.