Applying Graph-based Keyword Extraction to Document Retrieval

Open AccessProceedings Article

Applying Graph-based Keyword Extraction to Document Retrieval

Youngsam Kim, +5 more

- pp 864-868

Chats0

TLDR

A keyword extraction process, based on the PageRank algorithm, to reduce noise of input data for measuring semantic similarity and experimental results showed significantly improved document retrieval performance with this extraction process in place.

Abstract:

This paper proposes a keyword extraction process, based on the PageRank algorithm, to reduce noise of input data for measuring semantic similarity. This paper will introduce several features related to implementation and discuss their effects. It will also discuss experimental results which showed significantly improved document retrieval performance with this extraction process in place.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

DivGraphPointer: A Graph Pointer Network for Extracting Diverse Keyphrases

Zhiqing Sun, +4 more

TL;DR: An end-to-end method called DivGraphPointer is presented for extracting a set of diversified keyphrases from a document that combines the advantages of traditional graph-based ranking methods and recent neural network-based approaches.

...read moreread less

Journal ArticleDOI

Fast and Constrained Absent Keyphrase Generation by Prompt-Based Learning

Huanqin Wu, +3 more

- 28 Jun 2022 -

Proceedings of the ... AAAI Conference o...

TL;DR: The result shows that the proposed constrained absent keyphrase generation method can generate more consistent keyphrases, which can improve document retrieval performance, and with a non-autoregressive decoding manner, can speed up the absentKeyphrase generation by 8.67× compared with the autoregressive method.

...read moreread less

Proceedings ArticleDOI

Hyperbolic Relevance Matching for Neural Keyphrase Extraction

Mingyang Song, +2 more

TL;DR: A newhyperbolic matching model (HyperMatch) is designed to explore keyphrase extraction in hyperbolic space and outperforms the recent state-of-the-art baselines on six benchmark datasets.

...read moreread less

Proceedings ArticleDOI

DivGraphPointer: A Graph Pointer Network for Extracting Diverse Keyphrases

Zhiqing Sun, +4 more

- 19 May 2019 -

arXiv: Computation and Language

TL;DR: DivGraphPointer as discussed by the authors combines the advantages of traditional graph-based ranking methods and recent neural network-based approaches to extract a set of diversified keyphrases from a document.

...read moreread less

Proceedings ArticleDOI

Extraction of keyphrases from single document based on hierarchical concepts

Miroslav Smatana, +1 more

TL;DR: This paper provides modification of approaches for extraction of keyphrases from single textual document based on the hierarchical concepts created upon the text of particular document using FCA-based algorithm known as generalized one-sided concept lattice.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

The anatomy of a large-scale hypertextual Web search engine

Sergey Brin, +1 more

TL;DR: This paper provides an in-depth description of Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext and looks at the problem of how to effectively deal with uncontrolled hypertext collections where anyone can publish anything they want.

...read moreread less

Journal Article

The Anatomy of a Large-Scale Hypertextual Web Search Engine.

Sergey Brin, +1 more

- 01 Jan 1998 -

Computer Networks

TL;DR: Google as discussed by the authors is a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext and is designed to crawl and index the Web efficiently and produce much more satisfying search results than existing systems.

...read moreread less

Book

Introduction to Information Retrieval

Christopher D. Manning, +2 more

TL;DR: In this article, the authors present an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections.

...read moreread less

Journal ArticleDOI

Term Weighting Approaches in Automatic Text Retrieval

Gerard Salton, +1 more

- 01 Aug 1988 -

Information Processing and Management

TL;DR: This paper summarizes the insights gained in automatic term weighting, and provides baseline single term indexing models with which other more elaborate content analysis procedures can be compared.

...read moreread less

Journal ArticleDOI

A vector space model for automatic indexing

Gerard Salton, +2 more

- 01 Nov 1975 -

Communications of The ACM

TL;DR: An approach based on space density computations is used to choose an optimum indexing vocabulary for a collection of documents, demonstating the usefulness of the model.

...read moreread less

Applying Graph-based Keyword Extraction to Document Retrieval

Citations

DivGraphPointer: A Graph Pointer Network for Extracting Diverse Keyphrases

Fast and Constrained Absent Keyphrase Generation by Prompt-Based Learning

Hyperbolic Relevance Matching for Neural Keyphrase Extraction

DivGraphPointer: A Graph Pointer Network for Extracting Diverse Keyphrases

Extraction of keyphrases from single document based on hierarchical concepts

References

The anatomy of a large-scale hypertextual Web search engine

The Anatomy of a Large-Scale Hypertextual Web Search Engine.

Introduction to Information Retrieval

Term Weighting Approaches in Automatic Text Retrieval

A vector space model for automatic indexing

Related Papers (5)

TextRank: Bringing Order into Text

A visual attention-based keyword extraction for document classification

Improved automatic keyword extraction given more linguistic knowledge

Content-Based Document Image Retrieval Based on Document Modeling

OntDR: An Ontology-based Augmented Method for Document Retrieval