Showing papers by "Adam Tauman Kalai published in 1999"

PDF

Open Access

Proceedings Article•DOI•

Beating the hold-out: bounds for K-fold and progressive cross-validation

[...]

Avrim Blum¹, Adam Tauman Kalai¹, John Langford¹•Institutions (1)

06 Jul 1999

TL;DR: It is shown that for any nontrivial learning problem and learning algorithm that is insensitive to example ordering, the k-fold estimate is strictly more accurate than a single hold-out estimate on 1/k of the data, for 2 < k < n (k = n is leave-one-out), based on its variance and all higher moments.

...read moreread less

Abstract: The empirical error on a test set, the hold-out estimate, often is a more reliable estimate of generalization error than the observed error on the training set, the training estimate. K-fold cross validation is used in practice with the hope of being more accurate than the hold-out estimate without reducing the number of training examples. We argue that the k-fold estimate does in fact achieve this goal. Specifically, we show that for any nontrivial learning problem and learning algorithm that is insensitive to example ordering, the k-fold estimate is strictly more accurate than a single hold-out estimate on 1/k of the data, for 2 < k < n (k = n is leave-one-out), based on its variance and all higher moments. Previous bounds were termed sanitycheck because they compared the k-fold estimate to the training estimate and, further, restricted the VC dimension and required a notion of hypothesis stability [2]. In order to avoid these dependencies, we consider a k-fold hypothesis that is a randomized combination or average of the k individual hypotheses. We introduce progressive validationas another possible improvement on the hold-out estimate. This estimate of the generalization error is, in many ways, as good as that of a single hold-out, but it uses an average of half as many examples for testing. The procedure also involves a hold-out set, but after an example has been tested, it is added to the training set and the learning algorithm is rerun.

...read moreread less

303 citations

Proceedings Article•

Omnivergent stereo

[...]

Heung-Yeung Shum¹, Adam Tauman Kalai, Steven M. Seitz•Institutions (1)

Microsoft¹

01 Jan 1999

TL;DR: In this article, the authors proposed a virtual sensor for optimal 3D reconstruction, which collects a small number of rays at many different viewpoints for stereo reconstruction, and the resulting 2D manifold of rays are arranged into two multiple-perspective images.

...read moreread less

Abstract: The notion of a virtual sensor for optimal 3D reconstruction is introduced. Instead of planar perspective images that collect many rays at a fixed viewpoint, omnivergent cameras collect a small number of rays at many different viewpoints. The resulting 2D manifold of rays are arranged into two multiple-perspective images for stereo reconstruction. We call such images omnivergent images, and the process of reconstructing the scene from such images, omnivergent stereo. This procedure is shown to produce 3D scene models with minimal reconstruction error due to the fact that for any point in the 3D scene, two rays with maximum vergence angle can be found in the omnivergent images. Furthermore, omnivergent images are shown to have horizontal epipolar lines, enabling the application of traditional stereo matching algorithms, without modification. Three types of omnivergent virtual sensors are presented: spherical omnivergent cameras, center-strip cameras and dual-strip cameras.

...read moreread less

81 citations

Proceedings Article•DOI•

Finely-competitive paging

[...]

Avrim Blum¹, Carl Burch, Adam Tauman Kalai•Institutions (1)

Carnegie Mellon University¹

17 Oct 1999

TL;DR: An online algorithm for paging is constructed that achieves an O(r+log k) competitive ratio when compared to an offline strategy that is allowed the additional ability to "rent" pages at a cost of 1/r.

...read moreread less

Abstract: We construct an online algorithm for paging that achieves an O(r+log k) competitive ratio when compared to an offline strategy that is allowed the additional ability to "rent" pages at a cost of 1/r. In contrast, the competitive ratio of the Marking algorithm for this scenario is O(r log k). Our algorithm can be thought of in the standard setting as having a "fine-grained" competitive ratio, achieving an O(1) ratio when the request sequence consists of a small number of working sets, gracefully decaying to O(log k) as this number increases. Our result is a generalization of the result by Y. Bartal et al. (1997) that one can achieve an O(r+log n) ratio for the unfair n-state uniform-space Metrical Task System problem. That result was a key component of the polylog(n) competitive randomized algorithm given in that paper for the general Metrical Task System problem. One motivation of this work is that it may be a first step toward achieving a polylog(k) randomized competitive ratio for the much more difficult k-server problem.

...read moreread less

57 citations

Proceedings Article•DOI•

On-line algorithms for combining language models

[...]

Adam Tauman Kalai¹, Stanley F. Chen¹, Avrim Blum¹, Roni Rosenfeld¹•Institutions (1)

Carnegie Mellon University¹

15 Mar 1999

TL;DR: It is demonstrated that, in some situations, on-line techniques can significantly outperform static mixtures and are especially effective when the nature of the test data is unknown or changes over time.

...read moreread less

Abstract: Multiple language models are combined for many tasks in language modeling, such as domain and topic adaptation. In this work, we compare on-line algorithms from machine learning to existing algorithms for combining language models. On-line algorithms developed for this problem have parameters that are updated dynamically to adapt to a data set during evaluation. On-line analysis provides guarantees that these algorithms will perform nearly as well as the best model chosen in hindsight from a large class of models, e.g., the set of all static mixtures. We describe several on-line algorithms and present results comparing these techniques with existing language modeling combination methods on the task of domain adaptation. We demonstrate that, in some situations, on-line techniques can significantly outperform static mixtures (by over 10% in terms of perplexity) and are especially effective when the nature of the test data is unknown or changes over time.

...read moreread less

22 citations