scispace - formally typeset
Search or ask a question
Author

Rakesh Mohan

Bio: Rakesh Mohan is an academic researcher from IBM. The author has contributed to research in topics: The Internet & User interface. The author has an hindex of 32, co-authored 73 publications receiving 4681 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: This work presents a system that adapts multimedia Web documents to optimally match the capabilities of the client device requesting it using a representation scheme called the InfoPyramid that provides a multimodal, multiresolution representation hierarchy for multimedia.
Abstract: Content delivery over the Internet needs to address both the multimedia nature of the content and the capabilities of the diverse client platforms the content is being delivered to. We present a system that adapts multimedia Web documents to optimally match the capabilities of the client device requesting it. This system has two key components. 1) A representation scheme called the InfoPyramid that provides a multimodal, multiresolution representation hierarchy for multimedia. 2) A customizer that selects the best content representation to meet the client capabilities while delivering the most value. We model the selection process as a resource allocation problem in a generalized rate distortion framework. In this framework, we address the issue of both multiple media types in a Web document and multiple resource types at the client. We extend this framework to allow prioritization on the content items in a Web document. We illustrate our content adaptation technique with a web server that adapts multimedia news stories to clients as diverse as workstations, PDA's and cellular phones.

652 citations

Patent
23 Apr 1999
Abstract: A method of adapting multimedia content to a client device, wherein the multimedia content includes one or more items and the client device has capabilities and resources associated therewith, is provided. The method includes transcoding the multimedia content into a plurality of transcoded content versions, wherein the plurality of transcoded content versions have different modalities and resolutions associated therewith. Next, the transcoded content versions that are not compatible with client device capabilities are filtered out. Then, at least a portion of the resources associated with the client device are allocated among the one or more items of the multimedia content. Lastly, one or more of the transcoded versions of the multimedia content are selected to generate a customized content based on allocation of the client device resources.

387 citations

Patent
Lawrence D. Bergman1, Michelle Y Kim1, Chung-Sheng Li1, Rakesh Mohan1, John R. Smith1 
03 Dec 1999
TL;DR: In this paper, a framework is provided for describing multimedia content and a system in which a plurality of multimedia storage devices employing the content description methods of the present invention can interoperate.
Abstract: A framework is provided for describing multimedia content and a system in which a plurality of multimedia storage devices employing the content description methods of the present invention can interoperate. In accordance with one form of the present invention, the content description framework is a description scheme (DS) for describing streams or aggregations of multimedia objects, which may comprise audio, images, video, text, time series, and various other modalities. This description scheme can accommodate an essentially limitless number of descriptors in terms of features, semantics or metadata, and facilitate content-based search, index, and retrieval, among other capabilities, for both streamed or aggregated multimedia objects.

295 citations

Patent
Rudolf Maarten Bolle1, Jonathan H. Connell1, Norman Haas1, Rakesh Mohan1, Gabriel Taubin1 
24 Feb 1995
TL;DR: In this article, an image processing system is used to recognize objects within a scene using an illumination source for illuminating the scene, which can recognize objects independent of size and number and can also recognize objects that is was not originally programmed to recognize.
Abstract: The present system and apparatus uses image processing to recognize objects within a scene. The system includes an illumination source for illuminating the scene. By controlling the illumination source, an image processing system can take a first digitize image of the scene with the object illuminated a higher level and a second digitized image with the object illuminated at a lower level. Using an algorithm, the object(s) image is segmented from a background image of the scene by a comparison of the two digitized images taken. A processed image (that can be used to characterize features) of the object(s) is then compared to stored reference images. The object is recognized when a match occurs. The system can recognize objects independent of size and number and can be trained to recognize objects that is was not originally programmed to recognize.

250 citations

Patent
29 Jan 1999
TL;DR: In this article, a system for modifying Web content files for display on pervasive computing devices that have smaller displays and various performance limitations compared with desktop computing devices is described, and a link to a content modification file that contains information about how to modify elements within the HTML file so as to render the HTML files displayable via the pervasive computing device.
Abstract: Systems, methods and computer program products are provided for modifying Web content files, such as HTML files, for display via pervasive computing devices that have smaller displays and various performance limitations compared with desktop computing devices. Upon receiving a request from a pervasive computing device for an HTML file, the HTML file is analyzed for a link to a content modification file that contains information about how to modify elements within the HTML file so as to render the HTML file displayable via the pervasive computing device.

190 citations


Cited by
More filters
01 Apr 1997
TL;DR: The objective of this paper is to give a comprehensive introduction to applied cryptography with an engineer or computer scientist in mind on the knowledge needed to create practical systems which supports integrity, confidentiality, or authenticity.
Abstract: The objective of this paper is to give a comprehensive introduction to applied cryptography with an engineer or computer scientist in mind. The emphasis is on the knowledge needed to create practical systems which supports integrity, confidentiality, or authenticity. Topics covered includes an introduction to the concepts in cryptography, attacks against cryptographic systems, key use and handling, random bit generation, encryption modes, and message authentication codes. Recommendations on algorithms and further reading is given in the end of the paper. This paper should make the reader able to build, understand and evaluate system descriptions and designs based on the cryptographic components described in the paper.

2,188 citations

Patent
12 Nov 2013
TL;DR: In this paper, a variety of technologies by which existing functionality can be improved, and new functionality can also be provided, including visual search capabilities, and determining appropriate actions responsive to different image inputs.
Abstract: Cell phones and other portable devices are equipped with a variety of technologies by which existing functionality can be improved, and new functionality can be provided. Some relate to visual search capabilities, and determining appropriate actions responsive to different image inputs. Others relate to processing of image data. Still others concern metadata generation, processing, and representation. Yet others relate to coping with fixed focus limitations of cell phone cameras, e.g., in reading digital watermark data. Still others concern user interface improvements. A great number of other features and arrangements are also detailed.

2,033 citations

Journal ArticleDOI
TL;DR: This paper addresses the problem of retrieving images from large image databases with a method based on local grayvalue invariants which are computed at automatically detected interest points and allows for efficient retrieval from a database of more than 1,000 images.
Abstract: This paper addresses the problem of retrieving images from large image databases. The method is based on local grayvalue invariants which are computed at automatically detected interest points. A voting algorithm and semilocal constraints make retrieval possible. Indexing allows for efficient retrieval from a database of more than 1,000 images. Experimental results show correct retrieval in the case of partial visibility, similarity transformations, extraneous features, and small perspective deformations.

1,756 citations

Patent
30 Mar 2001
TL;DR: In this article, the authors describe a client-server system where a local client computer provides a user interface to interact with at least one remote server computer which implements data processing in response to the local client computers.
Abstract: Client-server systems and methods for transferring data via a network, including a wireless network, between a server (61) and one or more clients (41) or browsers that are spatially distributed (i.e., situated at different locations). At least one local client computer provides a user interface to interact with at least one remote server computer which implements data processing in response to the local client computer. The user interface may be a browser or a thin client.

1,427 citations

Patent
18 Jun 2010
TL;DR: An interactive television program guide system is provided in this article, which provides users with an opportunity to select programs for recording on a remote media server and to designate gift recipients for whom programs may be recorded.
Abstract: An interactive television program guide system is provided. An interactive television program guide provides users with an opportunity to select programs for recording on a remote media server. Programs may also be recorded on a local media server. The program guide provides users with VCR-like control over programs that are played back from the media servers and over real-time cached copies of the programs. The program guide also provides users with an opportunity to designate gift recipients for whom programs may be recorded.

1,316 citations