Institution

Amazon.com

Company•Seattle, Washington, United States•

About: Amazon.com is a company organization based out in Seattle, Washington, United States. It is known for research contribution in the topics: Computer science & Service (business). The organization has 13363 authors who have published 17317 publications receiving 266589 citations.

...read moreread less

Topics: Computer science, Service (business), Service provider, Context (language use), Virtual machine ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Multi-task learning and Weighted Cross-entropy for DNN-based Keyword Spotting

[...]

Sankaran Panchapagesan¹, Ming Sun¹, Aparna Khare², Spyros Matsoukas¹, Arindam Mandal¹, Bjorn Hoffmeister¹, Shiv Naga Prasad Vitaladevuni¹ - Show less +3 more•Institutions (2)

Amazon.com¹, Cisco Systems, Inc.²

08 Sep 2016

TL;DR: It is shown that the combination of 3 techniques LVCSR-initialization, multi-task training and weighted cross-entropy gives the best results, with significantly lower False Alarm Rate than the LV CSR- initialization technique alone, across a wide range of Miss Rates.

...read moreread less

Abstract: We propose improved Deep Neural Network (DNN) training loss functions for more accurate single keyword spotting on resource-constrained embedded devices. The loss function modifications consist of a combination of multi-task training and weighted cross entropy. In the multi-task architecture, the keyword DNN acoustic model is trained with two tasks in parallel the main task of predicting the keyword-specific phone states, and an auxiliary task of predicting LVCSR senones. We show that multi-task learning leads to comparable accuracy over a previously proposed transfer learning approach where the keyword DNN training is initialized by an LVCSR DNN of the same input and hidden layer sizes. The combination of LVCSRinitialization and Multi-task training gives improved keyword detection accuracy compared to either technique alone. We also propose modifying the loss function to give a higher weight on input frames corresponding to keyword phone targets, with a motivation to balance the keyword and background training data. We show that weighted cross-entropy results in additional accuracy improvements. Finally, we show that the combination of 3 techniques LVCSR-initialization, multi-task training and weighted cross-entropy gives the best results, with significantly lower False Alarm Rate than the LVCSR-initialization technique alone, across a wide range of Miss Rates.

...read moreread less

151 citations

Proceedings Article•

Ansor: Generating high-performance tensor programs for deep learning

[...]

Lianmin Zheng¹, Chengfan Jia², Minmin Sun², Zhao Wu², Cody Hao Yu³, Ameer Haj-Ali¹, Yida Wang³, Jun Yang², Danyang Zhuo⁴, Koushik Sen¹, Joseph E. Gonzalez¹, Ion Stoica¹ - Show less +8 more•Institutions (4)

University of California, Berkeley¹, Alibaba Group², Amazon.com³, Duke University⁴

11 Jun 2020

TL;DR: Ansor is presented, a tensor program generation framework for deep learning applications that can find high-performance programs that are outside the search space of existing state-of-the-art approaches.

...read moreread less

Abstract: High-performance tensor programs are crucial to guarantee efficient execution of deep neural networks. However, obtaining performant tensor programs for different operators on various hardware platforms is notoriously challenging. Currently, deep learning systems rely on vendor-provided kernel libraries or various search strategies to get performant tensor programs. These approaches either require significant engineering effort to develop platform-specific optimization code or fall short of finding high-performance programs due to restricted search space and ineffective exploration strategy. We present Ansor, a tensor program generation framework for deep learning applications. Compared with existing search strategies, Ansor explores many more optimization combinations by sampling programs from a hierarchical representation of the search space. Ansor then fine-tunes the sampled programs with evolutionary search and a learned cost model to identify the best programs. Ansor can find high-performance programs that are outside the search space of existing state-of-the-art approaches. In addition, Ansor utilizes a task scheduler to simultaneously optimize multiple subgraphs in deep neural networks. We show that Ansor improves the execution performance of deep neural networks relative to the state-of-the-art on the Intel CPU, ARM CPU, and NVIDIA GPU by up to $3.8\times$, $2.6\times$, and $1.7\times$, respectively.

...read moreread less

151 citations

Proceedings Article•DOI•

LT codes-based secure and reliable cloud storage service

[...]

Ning Cao¹, Shucheng Yu², Zhenyu Yang³, Wenjing Lou⁴, Y. Thomas Hou⁴ - Show less +1 more•Institutions (4)

Worcester Polytechnic Institute¹, University of Arkansas at Little Rock², Amazon.com³, Virginia Tech⁴

25 Mar 2012

TL;DR: This paper designs a secure cloud storage service which addresses the reliability issue with near-optimal overall performance, and proposes an exact repair solution so that no metadata needs to be generated on the fly for repaired data.

...read moreread less

Abstract: With the increasing adoption of cloud computing for data storage, assuring data service reliability, in terms of data correctness and availability, has been outstanding. While redundancy can be added into the data for reliability, the problem becomes challenging in the “pay-as-you-use” cloud paradigm where we always want to efficiently resolve it for both corruption detection and data repair. Prior distributed storage systems based on erasure codes or network coding techniques have either high decoding computational cost for data users, or too much burden of data repair and being online for data owners. In this paper, we design a secure cloud storage service which addresses the reliability issue with near-optimal overall performance. By allowing a third party to perform the public integrity verification, data owners are significantly released from the onerous work of periodically checking data integrity. To completely free the data owner from the burden of being online after data outsourcing, this paper proposes an exact repair solution so that no metadata needs to be generated on the fly for repaired data. The performance analysis and experimental results show that our designed service has comparable storage and communication cost, but much less computational cost during data retrieval than erasure codes-based storage solutions. It introduces less storage cost, much faster data retrieval, and comparable communication cost comparing to network coding-based distributed storage systems.

...read moreread less

151 citations

Proceedings Article•

Stochastic Activation Pruning for Robust Adversarial Defense

[...]

Guneet S. Dhillon¹, Kamyar Azizzadenesheli², Zachary C. Lipton³, Jeremy Bernstein⁴, Jean Kossaifi⁵, Aran Khanna¹, Animashree Anandkumar⁴ - Show less +3 more•Institutions (5)

Amazon.com¹, Stanford University², Carnegie Mellon University³, California Institute of Technology⁴, Imperial College London⁵

15 Feb 2018

TL;DR: Stochastic Activation Pruning (SAP) is proposed, a mixed strategy for adversarial defense that prunes a random subset of activations (preferentially pruning those with smaller magnitude) and scales up the survivors to compensate.

...read moreread less

Abstract: Neural networks are known to be vulnerable to adversarial examples. Carefully chosen perturbations to real images, while imperceptible to humans, induce misclassification and threaten the reliability of deep learning systems in the wild. To guard against adversarial examples, we take inspiration from game theory and cast the problem as a minimax zero-sum game between the adversary and the model. In general, for such games, the optimal strategy for both players requires a stochastic policy, also known as a mixed strategy. In this light, we propose Stochastic Activation Pruning (SAP), a mixed strategy for adversarial defense. SAP prunes a random subset of activations (preferentially pruning those with smaller magnitude) and scales up the survivors to compensate. We can apply SAP to pretrained networks, including adversarially trained models, without fine-tuning, providing robustness against adversarial examples. Experiments demonstrate that SAP confers robustness against attacks, increasing accuracy and preserving calibration.

...read moreread less

150 citations

Journal Article•DOI•

The potential impact of new Andean dams on Amazon fluvial ecosystems.

[...]

Bruce R. Forsberg¹, John M. Melack², Thomas Dunne², Ronaldo Borges Barthem, Michael Goulding³, Rodrigo Cauduro Dias de Paiva⁴, Mino Viana Sorribas⁴, Urbano L. Silva¹, Sabine Weisser⁵ - Show less +5 more•Institutions (5)

Amazon.com¹, University of California, Santa Barbara², Wildlife Conservation Society³, Universidade Federal do Rio Grande do Sul⁴, University of Konstanz⁵

23 Aug 2017-PLOS ONE

TL;DR: Together, these six dams are predicted to reduce the supply of sediments, phosphorus and nitrogen from the Andean region and to the entire Amazon basin by 64, 51 and 23%, respectively, which will have major impacts on channel geomorphology, floodplain fertility and aquatic productivity.

...read moreread less

Abstract: Increased energy demand has led to plans for building many new dams in the western Amazon, mostly in the Andean region. Historical data and mechanistic scenarios are used to examine potential impacts above and below six of the largest dams planned for the region, including reductions in downstream sediment and nutrient supplies, changes in downstream flood pulse, changes in upstream and downstream fish yields, reservoir siltation, greenhouse gas emissions and mercury contamination. Together, these six dams are predicted to reduce the supply of sediments, phosphorus and nitrogen from the Andean region by 69, 67 and 57% and to the entire Amazon basin by 64, 51 and 23%, respectively. These large reductions in sediment and nutrient supplies will have major impacts on channel geomorphology, floodplain fertility and aquatic productivity. These effects will be greatest near the dams and extend to the lowland floodplains. Attenuation of the downstream flood pulse is expected to alter the survival, phenology and growth of floodplain vegetation and reduce fish yields below the dams. Reservoir filling times due to siltation are predicted to vary from 106-6240 years, affecting the storage performance of some dams. Total CO2 equivalent carbon emission from 4 Andean dams was expected to average 10 Tg y-1 during the first 30 years of operation, resulting in a MegaWatt weighted Carbon Emission Factor of 0.139 tons C MWhr-1. Mercury contamination in fish and local human populations is expected to increase both above and below the dams creating significant health risks. Reservoir fish yields will compensate some downstream losses, but increased mercury contamination could offset these benefits.

...read moreread less

150 citations

Collapse

Authors

Showing all 13498 results

Name	H-index	Papers	Citations
Jiawei Han	168	1233	143427
Bernhard Schölkopf	148	1092	149492
Christos Faloutsos	127	789	77746
Alexander J. Smola	122	434	110222
Rama Chellappa	120	1031	62865
William F. Laurance	118	470	56464
Andrew McCallum	113	472	78240
Michael J. Black	112	429	51810
David Heckerman	109	483	62668
Larry S. Davis	107	693	49714
Chris M. Wood	102	795	43076
Pietro Perona	102	414	94870
Guido W. Imbens	97	352	64430
W. Bruce Croft	97	426	39918
Chunhua Shen	93	681	37468