Conference

ACM international conference on Digital libraries

About: ACM international conference on Digital libraries is an academic conference. The conference publishes majorly in the area(s): Digital library & Metadata. Over the lifetime, 496 publications have been published by the conference receiving 14778 citations.

...read moreread less

Topics: Digital library, Metadata, The Internet, Web query classification, Web search query ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Snowball: extracting relations from large plain-text collections

[...]

Eugene Agichtein¹, Luis Gravano¹•Institutions (1)

Columbia University¹

01 Jun 2000

TL;DR: This paper develops a scalable evaluation methodology and metrics for the task, and presents a thorough experimental evaluation of Snowball and comparable techniques over a collection of more than 300,000 newspaper documents.

...read moreread less

Abstract: Text documents often contain valuable structured data that is hidden Yin regular English sentences. This data is best exploited infavailable as arelational table that we could use for answering precise queries or running data mining tasks.We explore a technique for extracting such tables from document collections that requires only a handful of training examples from users. These examples are used to generate extraction patterns, that in turn result in new tuples being extracted from the document collection.We build on this idea and present our Snowball system. Snowball introduces novel strategies for generating patterns and extracting tuples from plain-text documents.At each iteration of the extraction process, Snowball evaluates the quality of these patterns and tuples without human intervention,and keeps only the most reliable ones for the next iteration. In this paper we also develop a scalable evaluation methodology and metrics for our task, and present a thorough experimental evaluation of Snowball and comparable techniques over a collection of more than 300,000 newspaper documents.

...read moreread less

1,399 citations

Proceedings Article•DOI•

Content-based book recommending using learning for text categorization

[...]

Raymond J. Mooney¹, Loriene Roy¹•Institutions (1)

University of Texas at Austin¹

01 Jun 2000

TL;DR: This work describes a content-based book recommending system that utilizes information extraction and a machine-learning algorithm for text categorization and shows initial experimental results demonstrate that this approach can produce accurate recommendations.

...read moreread less

Abstract: Recommender systems improve access to relevant products and information by making personalized suggestions based on previous examples of a user's likes and dislikes. Most existing recommender systems use collaborative filtering methods that base recommendations on other users' preferences. By contrast,content-based methods use information about an item itself to make suggestions.This approach has the advantage of being able to recommend previously unrated items to users with unique interests and to provide explanations for its recommendations. We describe a content-based book recommending system that utilizes information extraction and a machine-learning algorithm for text categorization. Initial experimental results demonstrate that this approach can produce accurate recommendations.

...read moreread less

1,330 citations

Proceedings Article•DOI•

CiteSeer: an automatic citation indexing system

[...]

C. Lee Giles¹, Kurt Bollacker¹, Steve Lawrence¹•Institutions (1)

Princeton University¹

11 May 1998

TL;DR: CiteSeer has many advantages over traditional citation indexes, including the ability to create more up-to-date databases which are not limited to a preselected set of journals or restricted by journal publication delays, completely autonomous operation with a corresponding reduction in cost, and powerful interactive browsing of the literature using the context of citations.

...read moreread less

Abstract: We present CiteSeer: an autonomous citation indexing system which indexes academic literature in electronic format (e.g. Postscript files on the Web). CiteSeer understands how to parse citations, identify citations to the same paper in different formats, and identify the context of citations in the body of articles. CiteSeer provides most of the advantages of traditional (manually constructed) citation indexes (e.g. the ISI citation indexes), including: literature retrieval by following citation links (e.g. by providing a list of papers that cite a given paper), the evaluation and ranking of papers, authors, journals, etc. based on the number of citations, and the identification of research trends. CiteSeer has many advantages over traditional citation indexes, including the ability to create more up-to-date databases which are not limited to a preselected set of journals or restricted by journal publication delays, completely autonomous operation with a corresponding reduction in cost, and powerful interactive browsing of the literature using the context of citations. Given a particular paper of interest, CiteSeer can display the context of how the paper is cited in subsequent publications. This context may contain a brief summary of the paper, another author’s response to the paper, or subsequent work which builds upon the original article. CiteSeer allows the location of papers by keyword search or by citation links. Papers related to a given paper can be located using common citation information or word vector similarity. CiteSeer will soon be available for public use.

...read moreread less

990 citations

Proceedings Article•DOI•

KEA: practical automatic keyphrase extraction

[...]

Ian H. Witten¹, Gordon W. Paynter¹, Eibe Frank¹, Carl Gutwin², Craig G. Nevill-Manning³ - Show less +1 more•Institutions (3)

University of Waikato¹, University of Saskatchewan², Rutgers University³

01 Aug 1999

TL;DR: Kea as mentioned in this paper identifies candidate keyphrases using lexical methods, calculates feature values for each candidate, and uses a machine learning algorithm to predict which candidates are good keyphrase candidates.

...read moreread less

Abstract: Keyphrases provide semantic metadata that summarize and characterize documents. This paper describes Kea, an algorithm for automatically extracting keyphrases from text. Kea identifies candidate keyphrases using lexical methods, calculates feature values for each candidate, and uses a machinelearning algorithm to predict which candidates are good keyphrases. The machine learning scheme first builds a prediction model using training documents with known keyphrases, and then uses the model to find keyphrases in new documents. We use a large test corpus to evaluate Kea’s effectiveness in terms of how many author-assigned keyphrases are correctly identified. The system is simple, robust, and publicly available.

...read moreread less

912 citations

Proceedings Article•DOI•

Annotation: from paper books to the digital library

[...]

Catherine C. Marshall¹•Institutions (1)

Xerox¹

01 Jul 1997

TL;DR: The practice of annotation in a particular situation is examined: the markings students make in university-level textbooks, and their status within a community of fellow textbook readers is examined.

...read moreread less

Abstract: Readers annotate paper books as a routine part of their engagement with the materials; it is a useful practice, manifested through a wide variety of markings made in service of very different purposes. This paper examines the practice of annotation in a particular situation: the markings students make in university-level textbooks. The study focuses on the form and function of these annotations, and their status within a community of fellow textbook readers. Using this study as a basis, I discuss issues and implications for the design of annotation tools for a digital library setting.

...read moreread less

479 citations

Collapse

Performance

Metrics

496

Papers

14,778

Citations

No. of papers from the Conference in previous years
Year	Papers
2021	12
2020	8
2019	13
2018	14
2015	1
2014	1