Search or ask a question

Showing papers by "Anthony Tomasic published in 2008"

PDF

Open Access

Proceedings Article•

RADAR: a personal assistant that learns to reduce email overload

[...]

Michael Freed¹, Jaime G. Carbonell², Geoff Gordon², Jordan Hayes, Brad A. Myers², Daniel P. Siewiorek², Stephen F. Smith², Aaron Steinfeld², Anthony Tomasic² - Show less +5 more•Institutions (2)

SRI International¹, Carnegie Mellon University²

13 Jul 2008

TL;DR: Tests show a clear advantage for learning-enabled RADAR over all other test conditions, and as machine learning plays a central role in many system components, this paper compares versions of RADAR with and without learning.

...read moreread less

Abstract: Email client software is widely used for personal task management, a purpose for which it was not designed and is poorly suited. Past attempts to remedy the problem have focused on adding task management features to the client UI. RADAR uses an alternative approach modeled on a trusted human assistant who reads mail, identifies task-relevant message content, and helps manage and execute tasks. This paper describes the integration of diverse AI technologies and presents results from human evaluation studies comparing RADAR user performance to unaided COTS tool users and users partnered with a human assistant. As machine learning plays a central role in many system components, we also compare versions of RADAR with and without learning. Our tests show a clear advantage for learning-enabled RADAR over all other test conditions.

...read moreread less

71 citations

Journal Article•DOI•

Scalable query result caching for web applications

[...]

Charles Garrod¹, Amit Manjhi², Anastasia Ailamaki¹, Bruce M. Maggs¹, Todd C. Mowry¹, Christopher Olston³, Anthony Tomasic¹ - Show less +3 more•Institutions (3)

Carnegie Mellon University¹, Google², Yahoo!³

01 Aug 2008

TL;DR: This paper introduces Ferdinand, the first proxy-based cooperative query result cache with fully distributed consistency management, and implements a fully functioning Ferdinand prototype and evaluates its performance compared to several alternative query-caching approaches, showing that high cache hit rate and consistency management are both critical for Ferdinand's performance gains over existing systems.

...read moreread less

Abstract: The backend database system is often the performance bottleneck when running web applications. A common approach to scale the database component is query result caching, but it faces the challenge of maintaining a high cache hit rate while efficiently ensuring cache consistency as the database is updated. In this paper we introduce Ferdinand, the first proxy-based cooperative query result cache with fully distributed consistency management. To maintain a high cache hit rate, Ferdinand uses both a local query result cache on each proxy server and a distributed cache. Consistency management is implemented with a highly scalable publish/subscribe system. We implement a fully functioning Ferdinand prototype and evaluate its performance compared to several alternative query-caching approaches, showing that our high cache hit rate and consistency management are both critical for Ferdinand's performance gains over existing systems.

...read moreread less

59 citations

Holistic Application Analysis for Update-Independence

[...]

Charles Garrod, Amit Manjhi, Bruce M. Maggs, Todd C. Mowry, Anthony Tomasic - Show less +1 more

01 Jan 2008

TL;DR: It is argued that typical database application design enables a more holistic analysis that maintains the relationship between the database and application data and shows that this holistic analysis outperforms traditional nonholistic methods both statically and when used as part of a dynamic, distributed environment for executing Web applications using database caches.

...read moreread less

Abstract: Current database performance optimizations stop at the border between the database application and the database system, focusing either on improving the performance of just the database system or the application’s execution in isolation of the other. We argue that typical database application design enables a more holistic analysis that maintains the relationship between the database and application data. We describe techniques to maintain this relationship and introduce several optimizations to improve the efficiency of Web application execution in a distributed environment. We show that our holistic analysis outperforms traditional nonholistic methods both statically and when used as part of a dynamic, distributed environment for executing Web applications using database caches.

...read moreread less

3 citations

Learning to Extract Gene-Protein Names from Weakly-Labeled Text

[...]

Richard C. Wang, Anthony Tomasic, Robert E. Frederking, Isaac Simmons, William W. Cohen - Show less +1 more

01 Jan 2008

TL;DR: This paper reduces or eliminate the effort required to create training data by automatically converting other sources of data into annotated training data and shows that the methods are effective and that the trained NER system outperforms all of the baseline results.

...read moreread less

Abstract: : Training a named entity recognizer (NER) has always been a difficult task due to the effort required to generate a significant amount of annotated training data. In this paper, we reduce or eliminate the effort required to create training data by automatically converting other sources of data into annotated training data. The performance of this approach is tested on a gene-protein name extractor by using the mouse and fly data obtained from the BioCreAtIvE challenge. Results show that our methods are effective and that our trained NER system outperforms all of our baseline results.

...read moreread less

3 citations