Institution

Amazon.com

Company•Seattle, Washington, United States•

About: Amazon.com is a company organization based out in Seattle, Washington, United States. It is known for research contribution in the topics: Computer science & Service (business). The organization has 13363 authors who have published 17317 publications receiving 266589 citations.

...read moreread less

Topics: Computer science, Service (business), Service provider, Context (language use), Virtual machine ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Patent•

Systems and methods for statistically selecting content items to be used in a dynamically-generated display

[...]

Toby Walker¹, David L. Selinger¹, John M. Rauser¹, Patrik P. Sundberg¹•Institutions (1)

Amazon.com¹

29 Mar 2006

TL;DR: In this article, user interaction for a plurality of users with the web site is collected in a database, and the database is mined to extract relationships between probability and references of select attributes in probability models.

...read moreread less

Abstract: An apparatus and methods advantageously select content items for dynamically-generated web pages in an intelligent and virtually autonomous manner. This permits the operator of the web site to rapidly identify and respond to trends, thereby advantageously updating the web site relatively quickly and efficiently without or with less time consuming and expensive manual labor. User interaction for a plurality of users with the web site is collected in a database. For various content items, the database is mined to extract relationships between probability and references of select attributes in probability models. When a new web page is requested, attributes, which can include attributes associated with a user, are used as references to the applicable probability models of selected content items, combined with value weighting to generate expected values, and selected for use in the web page at least partially based on the expected values.

...read moreread less

155 citations

Journal Article•DOI•

IVT-seq reveals extreme bias in RNA sequencing

[...]

Nicholas F. Lahens¹, Ibrahim Halil Kavakli², Ray Zhang¹, Katharina E. Hayer¹, Michael B Black, Hannah Dueck¹, Angel Pizarro³, Junhyong Kim¹, Rafael A. Irizarry⁴, Russell S. Thomas, Gregory R. Grant¹, John B. Hogenesch¹ - Show less +8 more•Institutions (4)

University of Pennsylvania¹, Koç University², Amazon.com³, Johns Hopkins University⁴

30 Jun 2014-Genome Biology

TL;DR: It is found rRNA depletion is responsible for substantial, unappreciated biases in coverage introduced during library preparation, which suggest exon-level expression analysis may be inadvisable, and the utility of IVT-seq for promoting better understanding of bias introduced by RNA-seq is shown.

...read moreread less

Abstract: Background: RNA-seq is a powerful technique for identifying and quantifying transcription and splicing events, both known and novel. However, given its recent development and the proliferation of library construction methods, understanding the bias it introduces is incomplete but critical to realizing its value. Results: We present a method, in vitro transcription sequencing (IVT-seq), for identifying and assessing the technical biases in RNA-seq library generation and sequencing at scale. We created a pool of over 1,000 in vitro transcribed RNAs from a full-length human cDNA library and sequenced them with polyA and total RNA-seq, the most common protocols. Because each cDNA is full length, and we show in vitro transcription is incredibly processive, each base in each transcript should be equivalently represented. However, with common RNA-seq applications and platforms, we find 50% of transcripts have more than two-fold and 10% have more than 10-fold differences in within-transcript sequence coverage. We also find greater than 6% of transcripts have regions of dramatically unpredictable sequencing coverage between samples, confounding accurate determination of their expression. We use a combination of experimental and computational approaches to show rRNA depletion is responsible for the most significant variability in coverage, and several sequence determinants also strongly influence representation. Conclusions: These results show the utility of IVT-seq for promoting better understanding of bias introduced by RNA-seq. We find rRNA depletion is responsible for substantial, unappreciated biases in coverage introduced during library preparation. These biases suggest exon-level expression analysis may be inadvisable, and we recommend caution when interpreting RNA-seq results.

...read moreread less

155 citations

Proceedings Article•DOI•

To have and to hold: exploring the personal archive

[...]

Joseph 'Jofish' Kaye¹, Janet Vertesi¹, Shari Avery¹, Allan Dafoe¹, Shay David¹, Lisa Onaga¹, Ivan D. Rosero², Trevor Pinch¹ - Show less +4 more•Institutions (2)

Cornell University¹, Amazon.com²

22 Apr 2006

TL;DR: This paper describes a study of forty-eight academics and the techniques and tools they use to manage their digital and material archiving of papers, emails, documents, internet bookmarks, correspondence, and other artifacts, and presents two sets of results.

...read moreread less

Abstract: The personal archive is not only about efficient storage and retrieval of information. This paper describes a study of forty-eight academics and the techniques and tools they use to manage their digital and material archiving of papers, emails, documents, internet bookmarks, correspondence, and other artifacts. We present two sets of results: we first discuss rationales behind subjects' archiving, which go beyond information retrieval to include creating a legacy, sharing resources, confronting fears and anxieties, and identity construction. We then show how these rationales were mapped into our subjects' physical, social and electronic spaces, and discuss implications for development of digital tools that allow for personal archiving.

...read moreread less

155 citations

Proceedings Article•

Classification is a Strong Baseline for Deep Metric Learning.

[...]

Andrew Zhai¹, Hao-Yu Wu²•Institutions (2)

University of California, Berkeley¹, Amazon.com²

01 Jan 2019

TL;DR: This paper evaluates on several standard retrieval datasets such as CAR-196, CUB-200-2011, Stanford Online Product, and In-Shop datasets for image retrieval and clustering, and establishes that the classification-based approach is competitive across different feature dimensions and base feature networks.

...read moreread less

Abstract: Deep metric learning aims to learn a function mapping image pixels to embedding feature vectors that model the similarity between images. Two major applications of metric learning are content-based image retrieval and face verification. For the retrieval tasks, the majority of current state-of-the-art (SOTA) approaches are triplet-based non-parametric training. For the face verification tasks, however, recent SOTA approaches have adopted classification-based parametric training. In this paper, we look into the effectiveness of classification based approaches on image retrieval datasets. We evaluate on several standard retrieval datasets such as CAR-196, CUB-200-2011, Stanford Online Product, and In-Shop datasets for image retrieval and clustering, and establish that our classification-based approach is competitive across different feature dimensions and base feature networks. We further provide insights into the performance effects of subsampling classes for scalable classification-based training, and the effects of binarization, enabling efficient storage and computation for practical applications.

...read moreread less

155 citations

Posted Content•DOI•

Transformers without Tears: Improving the Normalization of Self-Attention

[...]

Toan Q. Nguyen¹, Julian Salazar²•Institutions (2)

University of Notre Dame¹, Amazon.com²

02 Nov 2019-arXiv: Computation and Language

TL;DR: It is shown that pre-norm residual connections (PRENORM) and smaller initializations enable warmup-free, validation-based training with large learning rates and proposed l2 normalization with a single scale parameter (SCALENORN) for faster training and better performance.

...read moreread less

Abstract: We evaluate three simple, normalization-centric changes to improve Transformer training. First, we show that pre-norm residual connections (PreNorm) and smaller initializations enable warmup-free, validation-based training with large learning rates. Second, we propose $\ell_2$ normalization with a single scale parameter (ScaleNorm) for faster training and better performance. Finally, we reaffirm the effectiveness of normalizing word embeddings to a fixed length (FixNorm). On five low-resource translation pairs from TED Talks-based corpora, these changes always converge, giving an average +1.1 BLEU over state-of-the-art bilingual baselines and a new 32.8 BLEU on IWSLT'15 English-Vietnamese. We observe sharper performance curves, more consistent gradient norms, and a linear relationship between activation scaling and decoder depth. Surprisingly, in the high-resource setting (WMT'14 English-German), ScaleNorm and FixNorm remain competitive but PreNorm degrades performance.

...read moreread less

154 citations

Collapse

Authors

Showing all 13498 results

Name	H-index	Papers	Citations
Jiawei Han	168	1233	143427
Bernhard Schölkopf	148	1092	149492
Christos Faloutsos	127	789	77746
Alexander J. Smola	122	434	110222
Rama Chellappa	120	1031	62865
William F. Laurance	118	470	56464
Andrew McCallum	113	472	78240
Michael J. Black	112	429	51810
David Heckerman	109	483	62668
Larry S. Davis	107	693	49714
Chris M. Wood	102	795	43076
Pietro Perona	102	414	94870
Guido W. Imbens	97	352	64430
W. Bruce Croft	97	426	39918
Chunhua Shen	93	681	37468