scispace - formally typeset
Proceedings ArticleDOI

Document creation with Information retrieval system support

Reads0
Chats0
TLDR
The Suggested retrieval method consists of text extraction and segmentation from different text document formats and source code documents into logical and semantic segments that are subsequently used to calculate similarity with the new document using an optimal matching algorithm.
Abstract
This paper proposes a method to directly facilitate the author's needs during the creation of text documents or source code. The Information retrieval system is integrated with a text editor in order to find similar documents to the created document during its writing phase. The Suggested retrieval method consists of text extraction and segmentation from different text document formats and source code documents into logical and semantic segments. These segments are subsequently used to calculate similarity with the new document using an optimal matching algorithm.

read more

Citations
More filters
Patent

Text recommendation method

TL;DR: In this paper, the authors proposed a text recommendation method based on a matching factor for semantic similarity among the synonyms, which is added on the base of the traditional angle cosine algorithm, the influence of the synonym of the text on the similarity is considered, and the levels of similarity between texts and between the text and the user model are more accurately calculated.
Journal ArticleDOI

A Novel Approach for Document Retrieval System with User Preferences

TL;DR: The probabilistic ranking based Kuhn munkres algorithm uses the graphical model such as Bayesian statistics with Bayesian's theorem to find the probability of documents for more relevant results.
References
More filters
Journal Article

TextTiling: segmenting text into multi-paragraph subtopic passages

TL;DR: The algorithm is fully implemented and is shown to produce segmentation that corresponds well to human judgments of the subtopic boundaries of 12 texts, which should be useful for many text analysis tasks, including information retrieval and summarization.
Proceedings ArticleDOI

Effective retrieval of structured documents

TL;DR: This work considers what information is needed to retrieve effectively and shows that knowledge of the structure of documents can lead to improved retrieval performance.
Proceedings ArticleDOI

AIDAS: incremental logical structure discovery in PDF documents

TL;DR: AIDAS is part of a research project in which the aim is to turn technical manuals into a database of indexed training material and the approach AIDAS uses to extract the logical document structure from PDF documents is described.
Proceedings ArticleDOI

Enhancing document structure analysis using visual analytics

TL;DR: A new approach for analyzing the logical structure of text documents is presented, combining state-of-the-art machine learning with novel interactive visualization techniques, allowing a quick adaptation of the structure analysis process to unknown document classes and new tasks without requiring a predefined training set.
Journal ArticleDOI

Beyond topical similarity: a structural similarity measure for retrieving highly similar documents

TL;DR: The proposed notion of document structural similarity is expected to further evaluate document similarity by comparing document subtopic structures, and the good ability of the proposed overall measure with all three factors to further find highly similar documents from those topically similar documents is much better than that of the popular measures and other baseline structural similarity measures.
Related Papers (5)