A
Anthony Tomasic
Researcher at Carnegie Mellon University
Publications - 101
Citations - 4544
Anthony Tomasic is an academic researcher from Carnegie Mellon University. The author has contributed to research in topics: Query optimization & Distributed database. The author has an hindex of 34, co-authored 100 publications receiving 4297 citations. Previous affiliations of Anthony Tomasic include University of California & Stanford University.
Papers
More filters
Proceedings ArticleDOI
Learning to detect phishing emails
TL;DR: This method is applicable, with slight modification, to detection of phishing websites, or the emails used to direct victims to these sites, and correctly identify over 96% of the phishing emails while only mis-classifying on the order of 0.1%" of the legitimate emails.
Journal ArticleDOI
GlOSS: text-source discovery over the Internet
TL;DR: This article describes GlOSS, Glossary of Servers Server, with two versions: bGloss, which provides a Boolean query retrieval model, and vGlOSS, which providing a vector-space retrieval model and extensively describes the methodology for measuring the retrieval effectiveness of these systems.
Proceedings ArticleDOI
Scaling heterogeneous databases and the design of Disco
TL;DR: The Distributed Information Search COmponent (Disco) as discussed by the authors is a distributed mediator architecture for heterogeneous distributed databases that allows for the translation of queries between query languages and schemas.
Journal ArticleDOI
Scaling access to heterogeneous data sources with DISCO
TL;DR: The distributed mediator architecture of Disco is described; the data model and its modeling of data source connections; the interface to underlying data sources and the query rewriting process; and query processing semantics are described.
Proceedings ArticleDOI
The effectiveness of GIOSS for the text database discovery problem
TL;DR: The first part of this paper presents a practical solution based on estimating the result size of a query and a database and evaluates the effectiveness of GlOSS based on a trace of real user queries.