Vedanuj Goswami

Researcher at Facebook

Publications - 27

Citations - 1115

Vedanuj Goswami is an academic researcher from Facebook. The author has contributed to research in topics: Computer science & Question answering. The author has an hindex of 8, co-authored 18 publications receiving 387 citations. Previous affiliations of Vedanuj Goswami include National Institute of Technology, Silchar & Georgia Institute of Technology.

Papers

PDF

Open Access

More filters

Proceedings ArticleDOI

12-in-1: Multi-Task Vision and Language Representation Learning

Jiasen Lu, +4 more

TL;DR: This paper investigated the relationship between vision and language tasks by developing a large-scale, multi-task model, which culminates in a single model on 12 datasets from four broad categories of task including visual question answering, caption-based image retrieval, grounding referring expressions, and multimodal verification.

...read moreread less

Journal ArticleDOI

No Language Left Behind: Scaling Human-Centered Machine Translation

Nllb team, +38 more

- 11 Jul 2022 -

arXiv.org

TL;DR: A conditional compute model based on Sparsely Gated Mixture of Experts that is trained on data obtained with novel and effective data mining techniques tailored for low-resource languages is developed, laying important groundwork towards realizing a universal translation system.

...read moreread less

Proceedings Article

The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes

Douwe Kiela, +6 more

TL;DR: This work proposes a new challenge set for multimodal classification, focusing on detecting hate speech in multi-modal memes, constructed such that unimodal models struggle and only multimodAL models can succeed.

...read moreread less

Posted Content

12-in-1: Multi-Task Vision and Language Representation Learning

Jiasen Lu, +4 more

- 05 Dec 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work develops a large-scale, multi-task model that culminates in a single model on 12 datasets from four broad categories of task including visual question answering, caption-based image retrieval, grounding referring expressions, and multimodal verification and shows that finetuning task-specific models from this model can lead to further improvements, achieving performance at or above the state-of-the-art.

...read moreread less

Posted Content

The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes

Douwe Kiela, +6 more

- 10 May 2020 -

arXiv: Artificial Intelligence

TL;DR: The authors proposed a new challenge set for multimodal classification, focusing on detecting hate speech in multi-modal memes, where difficult examples are added to the dataset to make it hard to rely on unimodal signals.

...read moreread less

Collapse