Efficient peer-to-peer keyword searching

doi:10.5555/1515915.1515918

Open AccessProceedings ArticleDOI

Efficient peer-to-peer keyword searching

- pp 21-40

TLDR

A distributed search engine based on a distributed hash table is designed and analyzed and the simulation results predict that the search engine can answer an average query in under one second, using under one kilobyte of bandwidth.

Abstract:

The recent file storage applications built on top of peer-to-peer distributed hash tables lack search capabilities. We believe that search is an important part of any document publication system. To that end, we have designed and analyzed a distributed search engine based on a distributed hash table. Our simulation results predict that our search engine can answer an average query in under one second, using under one kilobyte of bandwidth.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Probabilistic Data Structures in Adversarial Environments

David Clayton, +2 more

TL;DR: This work provides a provable security treatment of probabilistic data structures in adversarial environments with a syntax that captures a wide variety of in-use structures, and security notions support development of error bounds in the presence of powerful attacks.

...read moreread less

Proceedings ArticleDOI

The dynamic cuckoo filter

Hanhua Chen, +3 more

TL;DR: The dynamic cuckoo filter (DCF) is proposed to support reliable delete operation and elastic capacity for dynamic set representation and membership testing and comprehensive experiment results demonstrate the efficiency of the DCF design compared to existing schemes.

...read moreread less

Proceedings ArticleDOI

Optimized Inverted List Assignment in Distributed Search Engine Architectures

Jiangong Zhang, +1 more

TL;DR: This work analyzes search engine query traces in order to optimize the assignment of index data to the nodes in the system, such that terms frequently occurring together in queries are also often collocated on the same node.

...read moreread less

Proceedings ArticleDOI

Keyword fusion to support efficient keyword-based search in peer-to-peer file sharing

Lintao Liu, +2 more

TL;DR: This paper proposes a set of mechanisms to provide a scalable keyword-based file search in distributed hash table (DHT)-based P2P systems called keyword fusion, which adaptively unburdens the peers overloaded with excessive storage consumptions due to common keywords and reduces network bandwidth consumption.

...read moreread less

Proceedings ArticleDOI

Bringing efficient advanced queries to distributed hash tables

Daniel Bauer, +3 more

TL;DR: This paper introduces new efficient, scalable, and completely distributed methods that strive to keep resource consumption by queries and index information as low as possible and describes how to improve the handling of multiple subqueries combined through Boolean set operators.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

The anatomy of a large-scale hypertextual Web search engine

Sergey Brin, +1 more

TL;DR: This paper provides an in-depth description of Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext and looks at the problem of how to effectively deal with uncontrolled hypertext collections where anyone can publish anything they want.

...read moreread less

Proceedings Article

The PageRank Citation Ranking : Bringing Order to the Web

Lawrence Page, +3 more

TL;DR: This paper describes PageRank, a mathod for rating Web pages objectively and mechanically, effectively measuring the human interest and attention devoted to them, and shows how to efficiently compute PageRank for large numbers of pages.

...read moreread less

Journal Article

The Anatomy of a Large-Scale Hypertextual Web Search Engine.

Sergey Brin, +1 more

- 01 Jan 1998 -

Computer Networks

TL;DR: Google as discussed by the authors is a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext and is designed to crawl and index the Web efficiently and produce much more satisfying search results than existing systems.

...read moreread less

Proceedings ArticleDOI

Chord: A scalable peer-to-peer lookup service for internet applications

Ion Stoica, +4 more

TL;DR: Results from theoretical analysis, simulations, and experiments show that Chord is scalable, with communication cost and the state maintained by each node scaling logarithmically with the number of Chord nodes.

...read moreread less

Journal ArticleDOI

Space/time trade-offs in hash coding with allowable errors

Burton H. Bloom

- 01 Jul 1970 -

Communications of The ACM

TL;DR: Analysis of the paradigm problem demonstrates that allowing a small number of test messages to be falsely identified as members of the given set will permit a much smaller hash area to be used without increasing reject time.

...read moreread less

Lecture Notes in Computer Science

Space/time trade-offs in hash coding with allowable errors

Burton H. Bloom

- 01 Jul 1970 -

Communications of The ACM

Search and replication in unstructured peer-to-peer networks

Qin Lv, +4 more

Efficient peer-to-peer keyword searching

Citations

Probabilistic Data Structures in Adversarial Environments

The dynamic cuckoo filter

Optimized Inverted List Assignment in Distributed Search Engine Architectures

Keyword fusion to support efficient keyword-based search in peer-to-peer file sharing

Bringing efficient advanced queries to distributed hash tables

References

The anatomy of a large-scale hypertextual Web search engine

The PageRank Citation Ranking : Bringing Order to the Web

The Anatomy of a Large-Scale Hypertextual Web Search Engine.

Chord: A scalable peer-to-peer lookup service for internet applications

Space/time trade-offs in hash coding with allowable errors

Related Papers (5)

Chord: A scalable peer-to-peer lookup service for internet applications

A scalable content-addressable network

Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems

Space/time trade-offs in hash coding with allowable errors

Search and replication in unstructured peer-to-peer networks