scispace - formally typeset
S

Srikanth Shoroff

Researcher at Microsoft

Publications -  27
Citations -  1259

Srikanth Shoroff is an academic researcher from Microsoft. The author has contributed to research in topics: File system & Stub file. The author has an hindex of 15, co-authored 27 publications receiving 1259 citations.

Papers
More filters
Patent

Method and system for detecting duplicate documents in web crawls

TL;DR: A web crawler application takes advantage of a document store's ability to provide a content identifier (CID) having a value that is a unique function of the physical storage location of a data object or document, such as a web page.
Patent

Enforcing access control on resources at a location other than the source location

TL;DR: In this article, access security can be enforced at a search engine associated with an indexing system that compiles references to documents at any number of network locations, and the search engine discloses to the requesting user only those documents that the user is authorized to read.
Patent

Method and system for incremental web crawling

TL;DR: In this paper, a Web crawler creates a history table containing a list of URLs for each folder and document found in the first full crawl, also including a local commit time (LCT) for each document and a deleted documents count (DDC).
Patent

Monitoring document changes in a file system of documents with the document change information stored in a persistent log

TL;DR: In this article, a method and system for improved monitoring of document changes in a search engine by an indexing program is presented, which utilizes the logged change information to efficiently maintain the indexes, and to rapidly update the indexes after a shutdown and subsequent restart.
Patent

Monitoring document changes with persistent update sequence numbers

TL;DR: In this article, a method and system for improved monitoring of document changes in a search engine, such as by an indexing program, is presented, which utilizes the logged change information to efficiently maintain the indexes or the like, and rapidly update the indexes after a shutdown and subsequent restart.