scispace - formally typeset
R

Ruslan Salakhutdinov

Researcher at Carnegie Mellon University

Publications -  457
Citations -  142495

Ruslan Salakhutdinov is an academic researcher from Carnegie Mellon University. The author has contributed to research in topics: Computer science & Artificial neural network. The author has an hindex of 107, co-authored 410 publications receiving 115921 citations. Previous affiliations of Ruslan Salakhutdinov include Carnegie Learning & University of Toronto.

Papers
More filters
Posted Content

Deep Gamblers: Learning to Abstain with Portfolio Theory

TL;DR: Inspired by portfolio theory, a loss function for the selective classification problem based on the doubling rate of gambling is proposed, which allows us to train neural networks and characterize the disconfidence of prediction in an end-to-end fashion.
Posted Content

Adversarial Robustness Through Local Lipschitzness

TL;DR: The results show that having a small Lipschitz constant correlates with achieving high clean and robust accuracy, and therefore, the smoothness of the classifier is an important property to consider in the context of adversarial examples.
Posted Content

Neural Models for Reasoning over Multiple Mentions using Coreference

TL;DR: This article proposed a coreference annotations extracted from an external system to connect entity mentions belonging to the same cluster and incorporated this layer into a state-of-the-art reading comprehension model.
Posted Content

Block-Normalized Gradient Method: An Empirical Study for Training Deep Neural Network

TL;DR: The normalized gradient methods having constant step size with occasionally decay, such as SGD with momentum, have better performance in the deep convolution neural networks, while those with adaptive step sizes perform better in recurrent neural networks.
Proceedings ArticleDOI

Learning Representations from Imperfect Time Series Data via Tensor Rank Regularization

TL;DR: This paper proposed a tensor rank minimization method based on the observation that high-dimensional multimodal time series data often exhibit correlations across time and modalities which leads to low-rank tensor representations.