Search or ask a question

Showing papers by "Ilya Sutskever published in 2009"

PDF

Open Access

Proceedings Article•

Modelling Relational Data using Bayesian Clustered Tensor Factorization

[...]

Ilya Sutskever¹, Joshua B. Tenenbaum², Ruslan Salakhutdinov²•Institutions (2)

University of Toronto¹, Massachusetts Institute of Technology²

07 Dec 2009

TL;DR: The Bayesian Clustered Tensor Factorization (BCTF) model is introduced, which embeds a factorized representation of relations in a nonparametric Bayesian clustering framework that is fully Bayesian but scales well to large data sets.

...read moreread less

Abstract: We consider the problem of learning probabilistic models for complex relational structures between various types of objects. A model can help us "understand" a dataset of relational facts in at least two ways, by finding interpretable structure in the data, and by supporting predictions, or inferences about whether particular unobserved relations are likely to be true. Often there is a tradeoff between these two aims: cluster-based models yield more easily interpretable representations, while factorization-based approaches have given better predictive performance on large data sets. We introduce the Bayesian Clustered Tensor Factorization (BCTF) model, which embeds a factorized representation of relations in a nonparametric Bayesian clustering framework. Inference is fully Bayesian but scales well to large data sets. The model simultaneously discovers interpretable clusters and yields predictive performance that matches or beats previous probabilistic models for relational data.

...read moreread less

275 citations

Patent•

Trans-lingual representation of text documents

[...]

John Platt¹, Ilya Sutskever¹•Institutions (1)

Microsoft¹

19 Jun 2009

TL;DR: In this article, a method of creating translingual text representations takes in documents in a first language and in a second language and creates a matrix using the words in the documents to represent which words are present in which language.

...read moreread less

Abstract: A method of creating translingual text representations takes in documents in a first language and in a second language and creates a matrix using the words in the documents to represent which words are present in which language. An algorithm is applied to each matrix such that like documents are placed close to each other and unlike documents are moved far from each other.

...read moreread less

13 citations

Proceedings Article•DOI•

A simpler unified analysis of budget perceptrons

[...]

Ilya Sutskever¹•Institutions (1)

University of Toronto¹

14 Jun 2009

TL;DR: A unified analysis of both the Forgetron and the Randomized Budget Perceptron algorithms is proposed by observing that the way in which they remove support vectors can be seen as types of L2-regularization.

...read moreread less

Abstract: The kernel Perceptron is an appealing online learning algorithm that has a drawback: whenever it makes an error it must increase its support set, which slows training and testing if the number of errors is large. The Forgetron and the Randomized Budget Perceptron algorithms overcome this problem by restricting the number of support vectors the Perceptron is allowed to have. These algorithms have regret bounds whose proofs are dissimilar. In this paper we propose a unified analysis of both of these algorithms by observing that the way in which they remove support vectors can be seen as types of L2-regularization. By casting these algorithms as instances of online convex optimization problems and applying a variant of Zinkevich's theorem for noisy and incorrect gradient, we can bound the regret of these algorithms more easily than before. Our bounds are similar to the existing ones, but the proofs are less technical.

...read moreread less

12 citations