Book ChapterDOI
The DBLP Computer Science Bibliography: Evolution, Research Issues, Perspectives
Michael Ley
- pp 1-10
Reads0
Chats0
TLDR
The most time-consuming task for the maintainers of DBLP may be viewed as a special instance of the authority control problem: how to normalize different spellings of person names.Abstract:
Publications are essential for scientific communication. Access to publications is provided by conventional libraries, digital libraries operated by learned societies or commercial publishers, and a huge number of web sites maintained by the scientists themselves or their institutions. Comprehensive meta-indices for this increasing number of information sources are missing for most areas of science. The DBLP Computer Science Bibliography of the University of Trier has grown from a very specialized small collection of bibliographic information to a major part of the infrastructure used by thousands of computer scientists. This short paper first reports the history of DBLP and sketches the very simple software behind the service. The most time-consuming task for the maintainers of DBLP may be viewed as a special instance of the authority control problem: how to normalize different spellings of person names. The third section of the paper discusses some details of this problem which might be an interesting research issue for the information retrieval community.read more
Citations
More filters
Journal ArticleDOI
Evolutionary Network Analysis: A Survey
TL;DR: This survey provides an overview of the vast literature on graph evolution analysis and the numerous applications that arise in different contexts.
Proceedings Article
The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics
Steven Bird,Robert Dale,Bonnie J. Dorr,Bryan R. Gibson,Mark Thomas Joseph,Min-Yen Kan,Dongwon Lee,Brett Powley,Dragomir R. Radev,Yee Fan Tan +9 more
TL;DR: This article presented a post-print of a paper from Sixth International Conference on Language Resources and Evaluation 2008 http://www.lrec-conf.org/lrec2008-2008/
Proceedings ArticleDOI
Iterative record linkage for cleaning and integration
TL;DR: Results are presented that illustrate the power and feasibility of making use of join information when comparing records and the need to make multiple passes over the data to correctly find all duplicates.
Proceedings ArticleDOI
Measuring the mixing time of social graphs
TL;DR: The findings show that the mixing time of social graphs is much larger than anticipated, and being used in literature, and this implies that either the current security systems based on fast mixing have weaker utility guarantees or have to be less efficient, with less security guarantees, in order to compensate for the slower mixing.
Journal ArticleDOI
Defrosting the Digital Library: Bibliographic Tools for the Next Generation Web
TL;DR: A range of new applications such as Zotero, Mendeley, Mekentosj Papers, MyNCBI, CiteULike, Connotea, and HubMed that exploit the Web to make these digital libraries more personal, sociable, integrated, and accessible places are examined.
References
More filters
Book
Managing Gigabytes: Compressing and Indexing Documents and Images
TL;DR: A guide to the MG system and its applications, as well as a comparison to the NZDL reference index, are provided.
Journal ArticleDOI
XMill: an efficient compressor for XML data
Hartmut Liefke,Dan Suciu +1 more
TL;DR: A tool for compressing XML data, with applications in data exchange and archiving, which usually achieves about twice the compression ratio of gzip at roughly the same speed.
Journal ArticleDOI
Fast and flexible word searching on compressed text
TL;DR: A fast compression technique for natural language texts that allows a large number of variations over the basic word and phrase search capability, such as sets of characters, arbitrary regular expressions, and approximate matching.