scispace - formally typeset
Book ChapterDOI

Web Recency Maintenance Protocol

TLDR
A new protocol namely Web Recency Maintenance Protocol (WRMP) that employs "push" mechanism to maintain the currency of the World Wide Web at Search Engine (SE) site and gives an additional advantage to the SE by enabling it to prioritize the Web sites based on the change frequency.
Abstract
In this paper, we propose a new protocol namely Web Recency Maintenance Protocol (WRMP) that employs "push" mechanism to maintain the currency of the World Wide Web (WWW) at Search Engine (SE) site. As of this writing SEs are adopting "pull" technology, by employing search bots to collect the data from WWW. Pull technology can no longer cater to the growing user base and the information explosion on the web. In view of the gigantic structure and dynamic nature of the WWW, it is impractical to maintain the currency using this technology. This practical limitation makes SEs yield stale results. WRMP ensures the real time update by initiating an update agent as and when the server is modified or updated. The availability of high processing speed enables the SE to maintain the currency of the whole WWW in real time. This protocol cuts down the unwarranted roaming of the search bots and takes the real purpose of the web server establishment i.e. reach the public over Internet as the motivating factor. This protocol gives an additional advantage to the SE by enabling it to prioritize the Web sites based on the change frequency. We have given a formal specification for the protocol and an M/M/1 model based analysis.

read more

Citations
More filters
Proceedings Article

Intelligent Search Engine: Simulation to Implementation.

TL;DR: This paper presents architecture and design specifications for new generation search engines highlighting the need for intelligence in search engines and gives a knowledge framework to capture intuition and characterize relevancy on a per user basis by quantifying user’s knowledge quotient and user's domain expertise.
Book ChapterDOI

User Relevancy Improvisation Protocol

TL;DR: A new protocol namely User Relevancy Improvisation Protocol (URIP) that employs ”push” mechanism to obtain the most current and concise summary of the web sites to eliminate spider activity and yield the minimal and most relevant result set.
Proceedings Article

Architecture, Design, Implementation and Performance Evaluation of Sarvagna: New Generation Search Engine.

TL;DR: A generic architecture for new generation search engine is given and how a prototype search engine SARVAGNA1, based on that architecture is developed is discussed.

Analysis of the Web, Processor Soeed and Bandwidth Growth: Impact on Search Engine Design.

TL;DR: This paper explores the web growth, processor speedup, and bandwidth growth over the time and proves that the web in its entirety is not an uncontrolled system that explodes indefinitely.
References
More filters
Proceedings ArticleDOI

On power-law relationships of the Internet topology

TL;DR: These power-laws hold for three snapshots of the Internet, between November 1997 and December 1998, despite a 45% growth of its size during that period, and can be used to generate and select realistic topologies for simulation purposes.
Journal ArticleDOI

Graph structure in the Web

TL;DR: The study of the web as a graph yields valuable insight into web algorithms for crawling, searching and community discovery, and the sociological phenomena which characterize its evolution.
Journal ArticleDOI

Searching the World Wide Web

TL;DR: The coverage and recency of the major World Wide Web search engines was analyzed, yielding some surprising results, including a lower bound on the size of the indexable Web of 320 million pages.
Journal ArticleDOI

Searching the Web

TL;DR: An overview of current Web search engine design is offered, introducing a generic search engine architecture and the results of several performance analyses conducted to compare different designs.
Journal ArticleDOI

Information retrieval on the web

TL;DR: Overall trends cited by the sources are consistent and point to exponential growth in the past and in the coming decade, and the development of new techniques targeted to resolve some of the problems associated with Web-based information retrieval are discussed.