Search or ask a question

Showing papers by "Gaurav Harit published in 2007"

PDF

Open Access

Proceedings Article•DOI•

Word image based latent semantic indexing for conceptual querying in document image databases

[...]

Subhashis Banerjee¹, Gaurav Harit¹, Santanu Chaudhury¹•Institutions (1)

Indian Institute of Technology Delhi¹

23 Sep 2007

TL;DR: It is shown through extensive experiments on a large database that use of LSA for document images provides improvements in retrieval precision as is the case with electronic text documents.

...read moreread less

Abstract: In this paper we present an application of latent semantic analysis (LSA) for indexing and retrieval of document images with text The query is specified as a set of word images and the documents which best match with the query representation in the the latent semantic space are retrieved We show through extensive experiments on a large database that use of LSA for document images provides improvements in retrieval precision as is the case with electronic text documents

...read moreread less

7 citations

Proceedings Article•DOI•

An Integrated Scheme for Compression and Interactive Access to Document Images

[...]

Gaurav Harit¹, Ritu Garg¹, Santanu Chaudhury¹•Institutions (1)

Indian Institute of Technology Delhi¹

05 Mar 2007

TL;DR: An integrated scheme for document image compression is presented which preserves the layout structure, and still allows the display of textual portions to adapt to the user preferences and screen area, and derives an SVG representation of the complete document image.

...read moreread less

Abstract: We present an integrated scheme for document image compression which preserves the layout structure, and still allows the display of textual portions to adapt to the user preferences and screen area. We encode the layout structure of the document images in an XML representation. The textual components and picture components are compressed separately into different representations. We derive an SVG (scalable vector graphics) representation of the complete document image. Compression is achieved since the word-images are encoded using specifications for geometric primitives that compose a word. A document rendered from its SVG representation can be adapted for display and interactive access through common browsers on desktop as well as mobile devices. We demonstrate the effectiveness of the proposed scheme for document access

...read moreread less

5 citations

Proceedings Article•DOI•

Pàtrà: A Novel Document Architecture for Integrating Handwriting with Audio-Visual Information

[...]

Gaurav Harit¹, V. Mankar¹, Santanu Chaudhury¹•Institutions (1)

Indian Institute of Technology Delhi¹

23 Sep 2007

TL;DR: An email application in which the users are provided with an authoring and rendering environment to compose, view, and reply to messages in the form of Patra, an integrated document architecture which incorporates handwritten illustrations captured and rendered in a temporal fashion synchronized with audio, video, text, and image data.

...read moreread less

Abstract: In this paper we present Patra - an integrated document architecture which incorporates handwritten illustrations captured and rendered in a temporal fashion synchronized with audio, video, text, and image data. The architecture of Patra permits non-linear growth in the form of multiple hierarchically organized play streams. Semantic metadata is also an integral part of Patra which serves a useful purpose of organizing such documents in a collection. We have developed an email application in which the users are provided with an authoring and rendering environment to compose, view, and reply to messages in the form of Patra.

...read moreread less

4 citations

Journal Article•DOI•

Video Shot Characterization Using Principles of Perceptual Prominence and Perceptual Grouping in Spatio–Temporal Domain

[...]

Gaurav Harit¹, Santanu Chaudhury¹•Institutions (1)

Indian Institute of Technology Delhi¹

01 Dec 2007-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: A computational model for analyzing a video shot based on a novel principle of perceptual prominence that captures the key aspects of mise-en-scene required for characterizing a video scene.

...read moreread less

Abstract: We present a novel approach for applying perceptual grouping principles to the spatio-temporal domain of video. Our perceptual grouping scheme, applied on blobs, makes use of a specified spatio-temporal coherence model. The grouping scheme identifies the blob cliques or perceptual clusters in the scene. We propose a computational model for analyzing a video shot based on a novel principle of perceptual prominence. The principle of perceptual prominence captures the key aspects of mise-en-scene required for characterizing a video scene.

...read moreread less

1 citations

Proceedings Article•DOI•

Using Object Models as Domain Knowledge in Perceptual Organization: An Approach for Object Category Identification in Video Sequences

[...]

Gaurav Harit¹, Rajesh Bharatia¹, Santanu Chaudhury¹•Institutions (1)

Indian Institutes of Technology¹

05 Mar 2007

TL;DR: A framework which integrates object-model knowledge with the perceptual organization process and demonstrates the advantages of the add-on grouping evidences as contributed by the object models for a more robust perceptual organization in the spatio-temporal domain is presented.

...read moreread less

Abstract: In this paper we present a framework which integrates object-model knowledge with the perceptual organization process. We demonstrate the advantages of the add-on grouping evidences as contributed by the object models for a more robust perceptual organization in the spatio-temporal domain. Our system performs detection of foreground objects along with recognition in video

...read moreread less