Institution

Amazon.com

Company•Seattle, Washington, United States•

About: Amazon.com is a company organization based out in Seattle, Washington, United States. It is known for research contribution in the topics: Computer science & Service (business). The organization has 13363 authors who have published 17317 publications receiving 266589 citations.

...read moreread less

Topics: Computer science, Service (business), Service provider, Context (language use), Virtual machine ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•

Robust random cut forest based anomaly detection on streams

[...]

Sudipto Guha¹, Nina Mishra², Gourav Roy², Okke Schrijvers³•Institutions (3)

University of Pennsylvania¹, Amazon.com², Stanford University³

19 Jun 2016

TL;DR: A robust random cut data structure that can be used as a sketch or synopsis of the input stream is investigated and it is shown how the sketch can be efficiently updated in a dynamic data stream.

...read moreread less

Abstract: In this paper we focus on the anomaly detection problem for dynamic data streams through the lens of random cut forests. We investigate a robust random cut data structure that can be used as a sketch or synopsis of the input stream. We provide a plausible definition of non-parametric anomalies based on the influence of an unseen point on the remainder of the data, i.e., the externality imposed by that point. We show how the sketch can be efficiently updated in a dynamic data stream. We demonstrate the viability of the algorithm on publicly available real data.

...read moreread less

190 citations

Proceedings Article•

Transfer Learning across Low-Resource, Related Languages for Neural Machine Translation

[...]

Toan Q. Nguyen¹, David Chiang²•Institutions (2)

Amazon.com¹, University of Notre Dame²

31 Aug 2017

TL;DR: The experiments show that transfer learning helps word-based translation only slightly, but when used on top of a much stronger BPE baseline, it yields larger improvements of up to 4.3 BLEU.

...read moreread less

Abstract: We present a simple method to improve neural translation of a low-resource language pair using parallel data from a related, also low-resource, language pair. The method is based on the transfer method of Zoph et al., but whereas their method ignores any source vocabulary overlap, ours exploits it. First, we split words using Byte Pair Encoding (BPE) to increase vocabulary overlap. Then, we train a model on the first language pair and transfer its parameters, including its source word embeddings, to another model and continue training on the second language pair. Our experiments show that transfer learning helps word-based translation only slightly, but when used on top of a much stronger BPE baseline, it yields larger improvements of up to 4.3 BLEU.

...read moreread less

189 citations

Patent•

Assessing content based on assessed trust in users

[...]

Christopher D. Vander Mey¹, Arijit Ghosh¹, Brian David Marsh¹•Institutions (1)

Amazon.com¹

30 Nov 2005

TL;DR: In this article, a piece of content is automatically assessed in a manner based on automatically assessed levels of trust in users who are associated with the content, such as a user who authored or otherwise supplied the content and/or users who evaluated the content.

...read moreread less

Abstract: Techniques are described for managing content by identifying content that has attributes of interest (e.g., content that is useful, humorous and/or that otherwise has a sufficiently high degree of quality) and by determining how to use such identified content. In some situations, a piece of content is automatically assessed in a manner based on automatically assessed levels of trust in users who are associated with the content, such as a user who authored or otherwise supplied the content and/or users who evaluated the content. For example, an automatically assessed level of trust for a user may be based on prior activities of the user and be used to predict future behavior of the user as a supplier of acceptable content and/or as an acceptable evaluator of supplied content, such as based on prior activities of the user that are not related to supplying and/or evaluating content.

...read moreread less

188 citations

Proceedings Article•DOI•

Deep Active Learning for Named Entity Recognition

[...]

Yanyao Shen¹, Hyokun Yun², Zachary C. Lipton³, Yakov Kronrod⁴, Animashree Anandkumar⁵ - Show less +1 more•Institutions (5)

Tsinghua University¹, Amazon.com², Carnegie Mellon University³, University of Pennsylvania⁴, California Institute of Technology⁵

01 Aug 2017

TL;DR: In this article, the authors combine deep learning with active learning and show that they can outperform classical methods even with a significantly smaller amount of training data than a large dataset or a large budget for manually labeling data.

...read moreread less

Abstract: Deep neural networks have advanced the state of the art in named entity recognition. However, under typical training procedures, advantages over classical methods emerge only with large datasets. As a result, deep learning is employed only when large public datasets or a large budget for manually labeling data is available. In this work, we show otherwise: by combining deep learning with active learning, we can outperform classical methods even with a significantly smaller amount of training data.

...read moreread less

188 citations

Journal Article•DOI•

Standardized Assessment of Biodiversity Trends in Tropical Forest Protected Areas: The End Is Not in Sight

[...]

Lydia Beaudrot¹, Jorge A. Ahumada¹, Timothy G. O'Brien², Patricia Alvarez-Loayza³, Kelly Boekee⁴, Ahimsa Campos-Arceiz⁵, David Eichberg, Santiago Espinosa⁶, Eric Fegraus¹, Christine Fletcher⁷, Krisna Gajapersad¹, Chris Hallam², Johanna Hurtado⁸, Patrick A. Jansen⁹, Patrick A. Jansen⁴, Ajay Kumar¹⁰, Eileen Larney, Marcela Guimarães Moreira Lima¹¹, Colin Mahony¹⁰, Emanuel H. Martin, Alex McWilliam², Badru Mugerwa¹², Mireille Ndoundou-Hockemba², Jean Claude Razafimahaimodison, Hugo Romero-Saltos¹³, Francesco Rovero, Julia Salvador¹⁴, Fernanda Santos¹¹, Douglas Sheil¹⁵, Wilson Roberto Spironello¹⁶, Michael R. Willig¹⁷, Nurul L. Winarni¹⁸, Alex Zvoleff¹, Sandy J. Andelman¹ - Show less +30 more•Institutions (18)

Conservation International¹, Wildlife Conservation Society², Duke University³, Wageningen University and Research Centre⁴, University of Nottingham Malaysia Campus⁵, Pontificia Universidad Católica del Ecuador⁶, Forest Research Institute Malaysia⁷, Organization for Tropical Studies⁸, Smithsonian Tropical Research Institute⁹, Hewlett-Packard¹⁰, Federal University of Pará¹¹, Mbarara University of Science and Technology¹², Universidad Yachay Tech¹³, University of Florida¹⁴, Norwegian University of Life Sciences¹⁵, Amazon.com¹⁶, University of Connecticut¹⁷, University of Indonesia¹⁸

19 Jan 2016-PLOS Biology

TL;DR: Evaluating occupancy trends for 511 populations of terrestrial mammals and birds, representing 244 species from 15 tropical forest protected areas on three continents, finds that occupancy declined in 22, increased in 17%, and exhibited no change in 22% of populations during the last 3–8 years, while 39% of population were detected too infrequently to assess occupancy changes.

...read moreread less

Abstract: Extinction rates in the Anthropocene are three orders of magnitude higher than background and disproportionately occur in the tropics, home of half the world’s species. Despite global efforts to combat tropical species extinctions, lack of high-quality, objective information on tropical biodiversity has hampered quantitative evaluation of conservation strategies. In particular, the scarcity of population-level monitoring in tropical forests has stymied assessment of biodiversity outcomes, such as the status and trends of animal populations in protected areas. Here, we evaluate occupancy trends for 511 populations of terrestrial mammals and birds, representing 244 species from 15 tropical forest protected areas on three continents. For the first time to our knowledge, we use annual surveys from tropical forests worldwide that employ a standardized camera trapping protocol, and we compute data analytics that correct for imperfect detection. We found that occupancy declined in 22%, increased in 17%, and exhibited no change in 22% of populations during the last 3–8 years, while 39% of populations were detected too infrequently to assess occupancy changes. Despite extensive variability in occupancy trends, these 15 tropical protected areas have not exhibited systematic declines in biodiversity (i.e., occupancy, richness, or evenness) at the community level. Our results differ from reports of widespread biodiversity declines based on aggregated secondary data and expert opinion and suggest less extreme deterioration in tropical forest protected areas. We simultaneously fill an important conservation data gap and demonstrate the value of large-scale monitoring infrastructure and powerful analytics, which can be scaled to incorporate additional sites, ecosystems, and monitoring methods. In an era of catastrophic biodiversity loss, robust indicators produced from standardized monitoring infrastructure are critical to accurately assess population outcomes and identify conservation strategies that can avert biodiversity collapse.

...read moreread less

188 citations

Collapse

Authors

Showing all 13498 results

Name	H-index	Papers	Citations
Jiawei Han	168	1233	143427
Bernhard Schölkopf	148	1092	149492
Christos Faloutsos	127	789	77746
Alexander J. Smola	122	434	110222
Rama Chellappa	120	1031	62865
William F. Laurance	118	470	56464
Andrew McCallum	113	472	78240
Michael J. Black	112	429	51810
David Heckerman	109	483	62668
Larry S. Davis	107	693	49714
Chris M. Wood	102	795	43076
Pietro Perona	102	414	94870
Guido W. Imbens	97	352	64430
W. Bruce Croft	97	426	39918
Chunhua Shen	93	681	37468