scispace - formally typeset
A

Anthony Tomasic

Researcher at Carnegie Mellon University

Publications -  101
Citations -  4544

Anthony Tomasic is an academic researcher from Carnegie Mellon University. The author has contributed to research in topics: Query optimization & Distributed database. The author has an hindex of 34, co-authored 100 publications receiving 4297 citations. Previous affiliations of Anthony Tomasic include University of California & Stanford University.

Papers
More filters
Proceedings ArticleDOI

Learning to detect phishing emails

TL;DR: This method is applicable, with slight modification, to detection of phishing websites, or the emails used to direct victims to these sites, and correctly identify over 96% of the phishing emails while only mis-classifying on the order of 0.1%" of the legitimate emails.
Journal ArticleDOI

GlOSS: text-source discovery over the Internet

TL;DR: This article describes GlOSS, Glossary of Servers Server, with two versions: bGloss, which provides a Boolean query retrieval model, and vGlOSS, which providing a vector-space retrieval model and extensively describes the methodology for measuring the retrieval effectiveness of these systems.
Proceedings ArticleDOI

Scaling heterogeneous databases and the design of Disco

TL;DR: The Distributed Information Search COmponent (Disco) as discussed by the authors is a distributed mediator architecture for heterogeneous distributed databases that allows for the translation of queries between query languages and schemas.
Journal ArticleDOI

Scaling access to heterogeneous data sources with DISCO

TL;DR: The distributed mediator architecture of Disco is described; the data model and its modeling of data source connections; the interface to underlying data sources and the query rewriting process; and query processing semantics are described.
Proceedings ArticleDOI

The effectiveness of GIOSS for the text database discovery problem

TL;DR: The first part of this paper presents a practical solution based on estimating the result size of a query and a database and evaluates the effectiveness of GlOSS based on a trace of real user queries.