scispace - formally typeset
Proceedings ArticleDOI

A Multi-Agent Simulation Framework for Spiders Traversing the Semantic Web

Reads0
Chats0
TLDR
This paper introduces BioSpider, an agent-based simulation framework for developing and testing autonomous, intelligent, semantically-focused Web spiders, and assumes a direct analogy of the problem at hand with a multi-variate ecosystem, where each member is self-maintaining.
Abstract
Although search engines traditionally use spiders for traversing and indexing the web, there has not yet been any methodological attempt to model, deploy and test learning spiders. The flourishing of the Semantic Web provides un- derstandable information that may improve the accuracy of search engines. In this paper, we introduce BioSpider, an agent-based simulation framework for developing and test- ing autonomous, intelligent, semantically-focused web spi- ders. BioSpider assumes a direct analogy of the problem at hand with a multi-variate ecosystem, where each mem- ber is self-maintaining. The population of the ecosystem comprises cooperative spiders incorporating communica- tion, mobility and learning skills, striving to improve effi- ciency. Genetic algorithms and classifier rules have been employed for spider adaptation and learning. A set of ex- periments has been performed in order to qualitatively test the efficacy and applicability of the proposed approach.

read more

Citations
More filters
Journal ArticleDOI

Intelligent Social Media Indexing and Sharing Using an Adaptive Indexing Search Engine

TL;DR: The present adaptive search engine allows for the efficient community creation and updating of social media indexes, which is able to instill and propagate deep knowledge into social media concerning the advanced search and usage of media resources.
Journal Article

Agent Based Framework for Semantic Web Content Mining

TL;DR: This work focuses on proving agent-based framework for mining semantic web contents employing clustering techniques that will help provide user with query relevant cluster of web contents, which will better satisfy user requirement and will provide optimal utilization of web surfing time.

Semantic image similarity based on deep knowledge for effective image retrieval

Yuanxi Li
TL;DR: By exploiting the context of Web images, knowledge base and ontology-based similarities, through the analysis of user behavior of image similarity evaluation, a set of formulas is established which allows efficient and accurate semantic similarity measurement of images.

Reducing distributed urls crawlingtime : a comparison of guids and ids

TL;DR: This research project investigates the best crawling speed between dynamic globally unique identifiers (GUIDs) and the traditional static identifiers (IDs) and shows that URLs crawling time can be reduced up to 7% by using GUIDs technique instead of using IDs.
References
More filters
Journal ArticleDOI

The anatomy of a large-scale hypertextual Web search engine

TL;DR: This paper provides an in-depth description of Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext and looks at the problem of how to effectively deal with uncontrolled hypertext collections where anyone can publish anything they want.
Journal Article

The Anatomy of a Large-Scale Hypertextual Web Search Engine.

Sergey Brin, +1 more
- 01 Jan 1998 - 
TL;DR: Google as discussed by the authors is a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext and is designed to crawl and index the Web efficiently and produce much more satisfying search results than existing systems.
Proceedings ArticleDOI

An adaptive model for optimizing performance of an incremental web crawler

TL;DR: This paper outlines the design of a web crawler implemented for IBM Almaden's WebFountain project and describes an optimization model for controlling the crawl strategy and shows that there are compromise objectives which lead to good strategies that are robust against a number of criteria.
Proceedings ArticleDOI

Evaluating topic-driven web crawlers

TL;DR: This work proposes three different methods to evaluate crawling strategies and applies the proposed metrics to compare three topic-driven crawling algorithms based on similarity ranking, link analysis, and adaptive agents.
Journal ArticleDOI

Neighborhood models of plant population dynamics. 2. Multi-species models of annuals

TL;DR: Models developed for the dynamics of multi-species communities of annual plants that lack seed dormancy demonstrate that dispersal may markedly influence the outcome of competition among plant species, even in a physically homogeneous environment, due to an effect of dispersal on the spatial distribution of individuals.
Related Papers (5)