scispace - formally typeset
Book ChapterDOI

The DBLP Computer Science Bibliography: Evolution, Research Issues, Perspectives

Reads0
Chats0
TLDR
The most time-consuming task for the maintainers of DBLP may be viewed as a special instance of the authority control problem: how to normalize different spellings of person names.
Abstract
Publications are essential for scientific communication. Access to publications is provided by conventional libraries, digital libraries operated by learned societies or commercial publishers, and a huge number of web sites maintained by the scientists themselves or their institutions. Comprehensive meta-indices for this increasing number of information sources are missing for most areas of science. The DBLP Computer Science Bibliography of the University of Trier has grown from a very specialized small collection of bibliographic information to a major part of the infrastructure used by thousands of computer scientists. This short paper first reports the history of DBLP and sketches the very simple software behind the service. The most time-consuming task for the maintainers of DBLP may be viewed as a special instance of the authority control problem: how to normalize different spellings of person names. The third section of the paper discusses some details of this problem which might be an interesting research issue for the information retrieval community.

read more

Citations
More filters
Journal ArticleDOI

Evolutionary Network Analysis: A Survey

TL;DR: This survey provides an overview of the vast literature on graph evolution analysis and the numerous applications that arise in different contexts.
Proceedings ArticleDOI

Iterative record linkage for cleaning and integration

TL;DR: Results are presented that illustrate the power and feasibility of making use of join information when comparing records and the need to make multiple passes over the data to correctly find all duplicates.
Proceedings ArticleDOI

Measuring the mixing time of social graphs

TL;DR: The findings show that the mixing time of social graphs is much larger than anticipated, and being used in literature, and this implies that either the current security systems based on fast mixing have weaker utility guarantees or have to be less efficient, with less security guarantees, in order to compensate for the slower mixing.
Journal ArticleDOI

Defrosting the Digital Library: Bibliographic Tools for the Next Generation Web

TL;DR: A range of new applications such as Zotero, Mendeley, Mekentosj Papers, MyNCBI, CiteULike, Connotea, and HubMed that exploit the Web to make these digital libraries more personal, sociable, integrated, and accessible places are examined.
References
More filters
Book

Managing Gigabytes: Compressing and Indexing Documents and Images

TL;DR: A guide to the MG system and its applications, as well as a comparison to the NZDL reference index, are provided.
Journal ArticleDOI

XMill: an efficient compressor for XML data

TL;DR: A tool for compressing XML data, with applications in data exchange and archiving, which usually achieves about twice the compression ratio of gzip at roughly the same speed.
Book

Managing gigabytes

Ian H. Witten
Journal ArticleDOI

Fast and flexible word searching on compressed text

TL;DR: A fast compression technique for natural language texts that allows a large number of variations over the basic word and phrase search capability, such as sets of characters, arbitrary regular expressions, and approximate matching.
Related Papers (5)