Document creation with Information retrieval system support

doi:10.1109/CINTI.2013.6705181

Proceedings ArticleDOI

Document creation with Information retrieval system support

Timotej Betina, +1 more

- pp 147-151

Chats0

TLDR

The Suggested retrieval method consists of text extraction and segmentation from different text document formats and source code documents into logical and semantic segments that are subsequently used to calculate similarity with the new document using an optimal matching algorithm.

Abstract:

This paper proposes a method to directly facilitate the author's needs during the creation of text documents or source code. The Information retrieval system is integrated with a text editor in order to find similar documents to the created document during its writing phase. The Suggested retrieval method consists of text extraction and segmentation from different text document formats and source code documents into logical and semantic segments. These segments are subsequently used to calculate similarity with the new document using an optimal matching algorithm.

Citations

PDF

Open Access

More filters

Patent

Text recommendation method

Yu Fucai, +3 more

TL;DR: In this paper, the authors proposed a text recommendation method based on a matching factor for semantic similarity among the synonyms, which is added on the base of the traditional angle cosine algorithm, the influence of the synonym of the text on the similarity is considered, and the levels of similarity between texts and between the text and the user model are more accurately calculated.

...read moreread less

Journal ArticleDOI

A Novel Approach for Document Retrieval System with User Preferences

Sandeep Kaur, +1 more

- 18 Sep 2014 -

International Journal of Computer Applic...

TL;DR: The probabilistic ranking based Kuhn munkres algorithm uses the graphical model such as Bayesian statistics with Bayesian's theorem to find the probability of documents for more relevant results.

...read moreread less

References

PDF

Open Access

More filters

Journal Article

TextTiling: segmenting text into multi-paragraph subtopic passages

Marti A. Hearst

- 01 Mar 1997 -

Computational Linguistics

TL;DR: The algorithm is fully implemented and is shown to produce segmentation that corresponds well to human judgments of the subtopic boundaries of 12 texts, which should be useful for many text analysis tasks, including information retrieval and summarization.

...read moreread less

Proceedings ArticleDOI

Effective retrieval of structured documents

Ross Wilkinson

TL;DR: This work considers what information is needed to retrieve effectively and shows that knowledge of the structure of documents can lead to improved retrieval performance.

...read moreread less

Proceedings ArticleDOI

AIDAS: incremental logical structure discovery in PDF documents

Anjo Anjewierden

TL;DR: AIDAS is part of a research project in which the aim is to turn technical manuals into a database of indexed training material and the approach AIDAS uses to extract the logical document structure from PDF documents is described.

...read moreread less

Proceedings ArticleDOI

Enhancing document structure analysis using visual analytics

Andreas Stoffel, +3 more

TL;DR: A new approach for analyzing the logical structure of text documents is presented, combining state-of-the-art machine learning with novel interactive visualization techniques, allowing a quick adaptation of the structure analysis process to unknown document classes and new tasks without requiring a predefined training set.

...read moreread less

Journal ArticleDOI

Beyond topical similarity: a structural similarity measure for retrieving highly similar documents

Xiaojun Wan

- 25 Mar 2008 -

Knowledge and Information Systems

TL;DR: The proposed notion of document structural similarity is expected to further evaluate document similarity by comparing document subtopic structures, and the good ability of the proposed overall measure with all three factors to further find highly similar documents from those topically similar documents is much better than that of the popular measures and other baseline structural similarity measures.

...read moreread less

IEEE Intelligent Systems & Their Applica...

Document creation with Information retrieval system support

Citations

Text recommendation method

A Novel Approach for Document Retrieval System with User Preferences

References

TextTiling: segmenting text into multi-paragraph subtopic passages

Effective retrieval of structured documents

AIDAS: incremental logical structure discovery in PDF documents

Enhancing document structure analysis using visual analytics

Beyond topical similarity: a structural similarity measure for retrieving highly similar documents

Related Papers (5)

Biomedical Text Mining Applied to Document Retrieval and Semantic Indexing

Multi-modal Information Integration for Document Retrieval

User-defined template for identifying document type and extracting information from documents

Metadata for Integrating Chinese Text and Speech Documents in a Multimedia Retrieval System

Lightweight document matching for help-desk applications