O
Oscar Koller
Researcher at Microsoft
Publications - 40
Citations - 3491
Oscar Koller is an academic researcher from Microsoft. The author has contributed to research in topics: Sign language & Gesture recognition. The author has an hindex of 21, co-authored 37 publications receiving 2075 citations. Previous affiliations of Oscar Koller include INESC-ID & University of Surrey.
Papers
More filters
Proceedings Article
MS-ASL: A Large-Scale Data Set and Benchmark for Understanding American Sign Language.
TL;DR: In this paper, a large-scale ASL data set was proposed, which covers over 200 signers, signer independent sets, challenging and unconstrained recording conditions and a large class count of 1000 signs.
Posted Content
Quantitative Survey of the State of the Art in Sign Language Recognition.
TL;DR: This study compiles the state of the art in a concise way to help advance the field and reveal open questions, such as shifts in the field from intrusive to non-intrusive capturing, from local to global features and the lack of non-manual parameters included in medium and larger vocabulary recognition systems.
Posted Content
Multi-channel Transformers for Multi-articulatory Sign Language Translation
TL;DR: This paper tackles the multi-articulatory sign language translation task and proposes a novel multi-channel transformer architecture that overcome the reliance on gloss annotations which underpin other state-of-the-art approaches, thereby removing future need for expensive curated datasets.
Posted Content
MS-ASL: A Large-Scale Data Set and Benchmark for Understanding American Sign Language.
TL;DR: This work proposes the first real-life large-scale sign language data set comprising over 25,000 annotated videos, which it thoroughly evaluates with state-of-the-art methods from sign and related action recognition, outperforming the current state of theart by a large margin.
Book ChapterDOI
Modality Combination Techniques for Continuous Sign Language Recognition
TL;DR: Early combination of features, late fusion of decisions, as well as synchronous combination on the hidden Markov model state level, and asynchronous combination onThe gloss level are investigated for five modalities on two publicly available benchmark databases consisting of challenging real-life data and less complex lab-data.