Institution

Allen Institute for Artificial Intelligence

Facility•

About: Allen Institute for Artificial Intelligence is a based out in . It is known for research contribution in the topics: Computer science & Question answering. The organization has 415 authors who have published 1062 publications receiving 72560 citations.

...read moreread less

Topics: Computer science, Question answering, Language model, Context (language use), Task (project management) ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

You Only Look Once: Unified, Real-Time Object Detection

[...]

Joseph Redmon¹, Santosh K. Divvala², Ross Girshick³, Ali Farhadi²•Institutions (3)

University of Washington¹, Allen Institute for Artificial Intelligence², Facebook³

27 Jun 2016

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

Abstract: We present YOLO, a new approach to object detection. Prior work on object detection repurposes classifiers to perform detection. Instead, we frame object detection as a regression problem to spatially separated bounding boxes and associated class probabilities. A single neural network predicts bounding boxes and class probabilities directly from full images in one evaluation. Since the whole detection pipeline is a single network, it can be optimized end-to-end directly on detection performance. Our unified architecture is extremely fast. Our base YOLO model processes images in real-time at 45 frames per second. A smaller version of the network, Fast YOLO, processes an astounding 155 frames per second while still achieving double the mAP of other real-time detectors. Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background. Finally, YOLO learns very general representations of objects. It outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

27,256 citations

Proceedings Article•DOI•

Deep contextualized word representations

[...]

Matthew E. Peters¹, Mark Neumann¹, Mohit Iyyer², Matt Gardner¹, Christopher Clark¹, Kenton Lee³, Luke Zettlemoyer⁴ - Show less +3 more•Institutions (4)

Allen Institute for Artificial Intelligence¹, University of Massachusetts Amherst², Google³, University of Washington⁴

15 Feb 2018

TL;DR: This paper introduced a new type of deep contextualized word representation that models both complex characteristics of word use (e.g., syntax and semantics), and how these uses vary across linguistic contexts (i.e., to model polysemy).

...read moreread less

Abstract: We introduce a new type of deep contextualized word representation that models both (1) complex characteristics of word use (e.g., syntax and semantics), and (2) how these uses vary across linguistic contexts (i.e., to model polysemy). Our word vectors are learned functions of the internal states of a deep bidirectional language model (biLM), which is pre-trained on a large text corpus. We show that these representations can be easily added to existing models and significantly improve the state of the art across six challenging NLP problems, including question answering, textual entailment and sentiment analysis. We also present an analysis showing that exposing the deep internals of the pre-trained network is crucial, allowing downstream models to mix different types of semi-supervision signals.

...read moreread less

7,412 citations

Book Chapter•DOI•

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks

[...]

Mohammad Rastegari¹, Vicente Ordonez¹, Joseph Redmon², Ali Farhadi¹, Ali Farhadi² - Show less +1 more•Institutions (2)

Allen Institute for Artificial Intelligence¹, University of Washington²

08 Oct 2016

TL;DR: The Binary-Weight-Network version of AlexNet is compared with recent network binarization methods, BinaryConnect and BinaryNets, and outperform these methods by large margins on ImageNet, more than \(16\,\%\) in top-1 accuracy.

...read moreread less

Abstract: We propose two efficient approximations to standard convolutional neural networks: Binary-Weight-Networks and XNOR-Networks. In Binary-Weight-Networks, the filters are approximated with binary values resulting in 32\(\times \) memory saving. In XNOR-Networks, both the filters and the input to convolutional layers are binary. XNOR-Networks approximate convolutions using primarily binary operations. This results in 58\(\times \) faster convolutional operations (in terms of number of the high precision operations) and 32\(\times \) memory savings. XNOR-Nets offer the possibility of running state-of-the-art networks on CPUs (rather than GPUs) in real-time. Our binary networks are simple, accurate, efficient, and work on challenging visual tasks. We evaluate our approach on the ImageNet classification task. The classification accuracy with a Binary-Weight-Network version of AlexNet is the same as the full-precision AlexNet. We compare our method with recent network binarization methods, BinaryConnect and BinaryNets, and outperform these methods by large margins on ImageNet, more than \(16\,\%\) in top-1 accuracy. Our code is available at: http://allenai.org/plato/xnornet.

...read moreread less

3,288 citations

Posted Content•

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks

[...]

Mohammad Rastegari¹, Vicente Ordonez¹, Joseph Redmon², Ali Farhadi², Ali Farhadi¹ - Show less +1 more•Institutions (2)

Allen Institute for Artificial Intelligence¹, University of Washington²

16 Mar 2016-arXiv: Computer Vision and Pattern Recognition

TL;DR: XNOR-Nets as discussed by the authors approximate convolutions using primarily binary operations, which results in 58x faster convolutional operations and 32x memory savings, and outperforms BinaryConnect and BinaryNets by large margins on ImageNet.

...read moreread less

Abstract: We propose two efficient approximations to standard convolutional neural networks: Binary-Weight-Networks and XNOR-Networks. In Binary-Weight-Networks, the filters are approximated with binary values resulting in 32x memory saving. In XNOR-Networks, both the filters and the input to convolutional layers are binary. XNOR-Networks approximate convolutions using primarily binary operations. This results in 58x faster convolutional operations and 32x memory savings. XNOR-Nets offer the possibility of running state-of-the-art networks on CPUs (rather than GPUs) in real-time. Our binary networks are simple, accurate, efficient, and work on challenging visual tasks. We evaluate our approach on the ImageNet classification task. The classification accuracy with a Binary-Weight-Network version of AlexNet is only 2.9% less than the full-precision AlexNet (in top-1 measure). We compare our method with recent network binarization methods, BinaryConnect and BinaryNets, and outperform these methods by large margins on ImageNet, more than 16% in top-1 accuracy.

...read moreread less

1,886 citations

Proceedings Article•DOI•

SciBERT: A Pretrained Language Model for Scientific Text

[...]

Iz Beltagy¹, Kyle Lo², Arman Cohan³•Institutions (3)

Allen Institute for Artificial Intelligence¹, University of Washington², Adobe Systems³

01 Nov 2019

TL;DR: SciBERT leverages unsupervised pretraining on a large multi-domain corpus of scientific publications to improve performance on downstream scientific NLP tasks and demonstrates statistically significant improvements over BERT.

...read moreread less

Abstract: Obtaining large-scale annotated data for NLP tasks in the scientific domain is challenging and expensive. We release SciBERT, a pretrained language model based on BERT (Devlin et. al., 2018) to address the lack of high-quality, large-scale labeled scientific data. SciBERT leverages unsupervised pretraining on a large multi-domain corpus of scientific publications to improve performance on downstream scientific NLP tasks. We evaluate on a suite of tasks including sequence tagging, sentence classification and dependency parsing, with datasets from a variety of scientific domains. We demonstrate statistically significant improvements over BERT and achieve new state-of-the-art results on several of these tasks. The code and pretrained models are available at https://github.com/allenai/scibert/.

...read moreread less

1,864 citations

Collapse

Authors

Showing all 422 results

Name	H-index	Papers	Citations
Christof Koch	141	712	105221
Abhinav Gupta	93	249	39876
Noah A. Smith	93	455	32507
Eduard Hovy	92	597	36994
Oren Etzioni	89	245	33044
Daniel S. Weld	87	317	31625
Luke Zettlemoyer	82	278	40896
Yejin Choi	70	277	19222
Hongkui Zeng	67	235	23795
Ali Farhadi	63	234	57227
Marti A. Hearst	61	223	26608
Yoav Goldberg	58	227	18523
Michael Hawrylycz	57	166	30987
Wen-tau Yih	56	150	16303
Stefano Ermon	54	346	11846

Network Information

Related Institutions (5)

Facebook

10.9K papers, 570.1K citations

94% related

Google

39.8K papers, 2.1M citations

93% related

Microsoft

86.9K papers, 4.1M citations

92% related

Adobe Systems

8K papers, 214.7K citations

89% related

Amazon.com

17.3K papers, 266.5K citations

88% related

Performance

Metrics

1,076

Papers

113,360

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	5
2022	9
2021	265
2020	267
2019	213
2018	123