scispace - formally typeset
Search or ask a question

Showing papers by "Anthony Tomasic published in 1999"


Journal ArticleDOI
TL;DR: This article describes GlOSS, Glossary of Servers Server, with two versions: bGloss, which provides a Boolean query retrieval model, and vGlOSS, which providing a vector-space retrieval model and extensively describes the methodology for measuring the retrieval effectiveness of these systems.
Abstract: The dramatic growth of the Internet has created a new problem for users: location of the relevant sources of documents. This article presents a framework for (and experimentally analyzes a solution to) this problem, which we call the text-source discovery problem. Our approach consists of two phases. First, each text source exports its contents to a centralized service. Second, users present queries to the service, which returns an ordered list of promising text sources. This article describes GlOSS, Glossary of Servers Server, with two versions: bGlOSS, which provides a Boolean query retrieval model, and vGlOSS, which provides a vector-space retrieval model. We also present hGlOSS, which provides a decentralized version of the system. We extensively describe the methodology for measuring the retrieval effectiveness of these systems and provide experimental evidence, based on actual data, that all three systems are highly effective in determining promising text sources for a given query.

371 citations


Journal Article
TL;DR: In this article, the authors present an open hybrid architecture, which combines digital library technology, information integration mechanisms and workflow-based systems, for the integration and visualization of scientic repositories into an easily accessed interoperable networked environment.
Abstract: Scientic repositories found in institutions and organizations consist of data and programs. Data consists principally of numeric data, images, and text documents. Programs consist principally of software methods for visualizing and processing data and simulators of natural processes. Data represents both measured physical behavior and the results of simulations. The integration and visualization of scientic repositories into an easily accessed interoperable networked environment is needed in many disciplines for both scientic and management purposes. To satisfy these needs we present an open hybrid architecture, which combines digital library technology, information integration mechanisms and workflow-based systems. Our experience is based on the THETIS 1 [15] project, a distributed collection of scientic repositories focused on supporting Coastal Zone Management of the Mediterranean Region in Europe. It will demonstrate its ability to respond to users such as scientists and public administration authorities that use scientic information for decision making.

9 citations


Journal Article
TL;DR: La solution apportee avec DISCO consiste a combiner un modele de cout generique avec des informations de cout exportees par les adaptateurs pour permettre au mediateur d'estimer le cout des requetes heterogenes.
Abstract: DISCO est un systeme de mediation developpe a l'INRIA pour acceder a des sources de donnees heterogenes reparties sur Internet. Dans DISCO, l'utilisateur pose des requetes au mediateur. Le mediateur traite les requetes en accedant aux donnees via les adaptateurs, et retourne la reponse a l'utilisateur. Pour etre efficace le mediateur optimise les requetes en se basant sur l'estimation de leur cout. L'estimation du cout est difficile car les sources heterogenes n'exportent pas d'information de cout. La solution apportee avec DISCO consiste a combiner un modele de cout generique avec des informations de cout exportees par les adaptateurs pour permettre au mediateur d'estimer le cout des requetes heterogenes. Dans cet article, nous validons le modele de cout de DISCO par une experimentation sur des sources de donnees reelles accessibles sur le Web. Cette validation montre l'efficacite de notre modele de cout generique ainsi que l'efficacite de fonctions de cout plus specialisees.

3 citations