Proceedings ArticleDOI
Document creation with Information retrieval system support
Timotej Betina,Ivan Polasek +1 more
- pp 147-151
Reads0
Chats0
TLDR
The Suggested retrieval method consists of text extraction and segmentation from different text document formats and source code documents into logical and semantic segments that are subsequently used to calculate similarity with the new document using an optimal matching algorithm.Abstract:
This paper proposes a method to directly facilitate the author's needs during the creation of text documents or source code. The Information retrieval system is integrated with a text editor in order to find similar documents to the created document during its writing phase. The Suggested retrieval method consists of text extraction and segmentation from different text document formats and source code documents into logical and semantic segments. These segments are subsequently used to calculate similarity with the new document using an optimal matching algorithm.read more
Citations
More filters
Patent
Text recommendation method
TL;DR: In this paper, the authors proposed a text recommendation method based on a matching factor for semantic similarity among the synonyms, which is added on the base of the traditional angle cosine algorithm, the influence of the synonym of the text on the similarity is considered, and the levels of similarity between texts and between the text and the user model are more accurately calculated.
Journal ArticleDOI
A Novel Approach for Document Retrieval System with User Preferences
Sandeep Kaur,Nidhi Bhatla +1 more
TL;DR: The probabilistic ranking based Kuhn munkres algorithm uses the graphical model such as Bayesian statistics with Bayesian's theorem to find the probability of documents for more relevant results.
References
More filters
Journal Article
TextTiling: segmenting text into multi-paragraph subtopic passages
TL;DR: The algorithm is fully implemented and is shown to produce segmentation that corresponds well to human judgments of the subtopic boundaries of 12 texts, which should be useful for many text analysis tasks, including information retrieval and summarization.
Proceedings ArticleDOI
Effective retrieval of structured documents
TL;DR: This work considers what information is needed to retrieve effectively and shows that knowledge of the structure of documents can lead to improved retrieval performance.
Proceedings ArticleDOI
AIDAS: incremental logical structure discovery in PDF documents
TL;DR: AIDAS is part of a research project in which the aim is to turn technical manuals into a database of indexed training material and the approach AIDAS uses to extract the logical document structure from PDF documents is described.
Proceedings ArticleDOI
Enhancing document structure analysis using visual analytics
TL;DR: A new approach for analyzing the logical structure of text documents is presented, combining state-of-the-art machine learning with novel interactive visualization techniques, allowing a quick adaptation of the structure analysis process to unknown document classes and new tasks without requiring a predefined training set.
Journal ArticleDOI
Beyond topical similarity: a structural similarity measure for retrieving highly similar documents
TL;DR: The proposed notion of document structural similarity is expected to further evaluate document similarity by comparing document subtopic structures, and the good ability of the proposed overall measure with all three factors to further find highly similar documents from those topically similar documents is much better than that of the popular measures and other baseline structural similarity measures.