scispace - formally typeset
I

Illia Polosukhin

Researcher at Google

Publications -  17
Citations -  62085

Illia Polosukhin is an academic researcher from Google. The author has contributed to research in topics: Deep learning & Program synthesis. The author has an hindex of 11, co-authored 17 publications receiving 30290 citations.

Papers
More filters
Proceedings Article

Attention is All you Need

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.
Posted Content

Attention Is All You Need

TL;DR: A new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely is proposed, which generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.
Journal ArticleDOI

Natural Questions: A Benchmark for Question Answering Research

TL;DR: The Natural Questions corpus, a question answering data set, is presented, introducing robust metrics for the purposes of evaluating question answering systems; demonstrating high human upper bounds on these metrics; and establishing baseline results using competitive methods drawn from related literature.
Proceedings ArticleDOI

Coarse-to-Fine Question Answering for Long Documents

TL;DR: A framework for question answering that can efficiently scale to longer documents while maintaining or even improving performance of state-of-the-art models is presented and sentence selection is treated as a latent variable trained jointly from the answer only using reinforcement learning.
Proceedings ArticleDOI

WikiReading: A Novel Large-scale Language Understanding Task over Wikipedia

TL;DR: This work presents WIKIREADING, a large-scale natural language understanding task and publicly-available dataset with 18 million instances, and compares various state-of-the-art DNNbased architectures for document classification, information extraction, and question answering.