scispace - formally typeset
Open AccessJournal ArticleDOI

Graph based Representation and Analysis of Text Document: A Survey of Techniques

Sheetal S. Sonawane, +1 more
- 18 Jun 2014 - 
- Vol. 96, Iss: 19, pp 1-8
Reads0
Chats0
TLDR
The survey results shows that Graph based representation is appropriate way of representing text document and improved result of analysis over traditional model for different text applications.
Abstract
A common and standard approach to model text document is bag-of-words. This model is suitable for capturing word frequency, however structural and semantic information is ignored. Graph representation is mathematical constructs and can model relationship and structural information effectively. A text can appropriately represented as Graph using vertex as feature term and edge relation can be significant relation between the feature terms. Text representation using Graph model provides computations related to various operations like term weight, ranking which is helpful in many applications in information retrieval. This paper presents a systematic survey of existing work on Graph based representation of text and also focused on Graph based analysis of text document for different operations in information retrieval. In this process taxonomy of Graph based representation and analysis of text document is derived and result of different methods of Graph based text representation and analysis are discussed. The survey results shows that Graph based representation is appropriate way of representing text document and improved result of analysis over traditional model for different text applications.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Deep Learning for Intelligent Wireless Networks: A Comprehensive Survey

TL;DR: A comprehensive survey of the applications of DL algorithms for different network layers, including physical layer modulation/coding, data link layer access control/resource allocation, and routing layer path search, and traffic balancing is performed.
Posted Content

Language as an Evolving Word Web

TL;DR: The authors treated language as a self-organizing network of interacting words and showed that the distribution of the number of connections of words in such a network is of a peculiar form which includes two pronounced power-law regions.
Journal Article

An Overview of Graph-Based Keyword Extraction Methods and Approaches

TL;DR: The systematic review of methods was gathered which resulted in a comprehensive review of existing approaches, which provides guidelines for future research plans and encourages the development of new graph-based approaches for keyword extraction.
Journal ArticleDOI

A graph based keyword extraction model using collective node weight

TL;DR: This paper proposes a novel unsupervised graph based keyword extraction method called Keyword Extraction using Collective Node Weight (KECNW) which determines the importance of a keyword by collectively taking various influencing parameters.
Proceedings ArticleDOI

InfraNodus: Generating Insight Using Text Network Analysis

TL;DR: The tool (InfraNodus) can be used by researchers and writers to organize and to better understand their notes, to measure the level of bias in discourse, and to identify the parts of the discourse where there is a potential for insight and new ideas.
References
More filters
Proceedings Article

TextRank: Bringing Order into Text

Rada Mihalcea, +1 more
TL;DR: TextRank, a graph-based ranking model for text processing, is introduced and it is shown how this model can be successfully used in natural language applications.
Journal ArticleDOI

The large-scale structure of semantic networks: statistical analyses and a model of semantic growth.

TL;DR: A simple model for semantic growth is described, in which each new word or concept is connected to an existing network by differentiating the connectivity pattern of an existing node, which generates appropriate small-world statistics and power-law connectivity distributions.
Journal Article

LexRank: Graph-based Centrality as Salience in Text Summarization

TL;DR: A new approach, LexRank, for computing sentence importance based on the concept of eigenvector centrality in a graph representation of sentences is considered and the LexRank with threshold method outperforms the other degree-based techniques including continuous LexRank.
Journal ArticleDOI

Topology of the conceptual network of language

TL;DR: This work maps out the conceptual network of the English language, with the connections being defined by the entries in a Thesaurus dictionary, and finds that this network presents a small-world structure, and appears to exhibit an asymptotic scale-free feature with algebraic connectivity distribution.
Journal ArticleDOI

Global organization of the Wordnet lexicon

TL;DR: A quantitative study of the graph structure of Wordnet to understand the global organization of the lexicon and shows that Wordnet has global properties common to many self-organized systems, and polysemy organizes the semantic graph in a compact and categorical representation, in a way that may explain the ubiquity of polyse my across languages.