scispace - formally typeset
A

Ayose Falcón

Researcher at Intel

Publications -  37
Citations -  781

Ayose Falcón is an academic researcher from Intel. The author has contributed to research in topics: Branch predictor & Thread (computing). The author has an hindex of 13, co-authored 37 publications receiving 772 citations. Previous affiliations of Ayose Falcón include Hewlett-Packard & Polytechnic University of Catalonia.

Papers
More filters
Journal ArticleDOI

COTSon: infrastructure for full system simulation

TL;DR: COTSon opens up a new dimension in the speed/accuracy space, allowing simulation of a cluster of nodes several orders of magnitude faster with a minimal accuracy loss, and abandon the idea of "always-on" cycle-based simulation in favor of statistical sampling approaches that can trade accuracy for speed.
Journal ArticleDOI

How to simulate 1000 cores

TL;DR: A novel methodology to efficiently simulate shared-memory multiprocessors composed of hundreds of cores, which captures the intrinsic behavior of the SPLASH-2 suite, even when the number of shared- memory cores beyond the thousand-core limit is scaled up.
Patent

Weight-shifting mechanism for convolutional neural networks

TL;DR: A processor includes a processor core and a calculation circuit as discussed by the authors, which includes logic to determine a set of weights for use in a convolutional neural network (CNN) calculation and scale up the weights using a scale value.
Patent

Storage device and method for performing convolution operations

TL;DR: In this paper, a storage device and method for performing convolution operations is described, which comprises a plurality of processing units to execute convolution operation on input data and partial results.
Patent

Method and apparatus for distributed and cooperative computation in artificial neural networks

TL;DR: In this paper, an apparatus and method for distributed and cooperative computation in artificial neural networks is described, which comprises an input/output (I/O) interface, a plurality of processing units communicatively coupled to the I/O interface to receive data for input neurons and synaptic weights associated with each of the input neurons, each unit processing at least a portion of the data for the inputs and weights to generate partial results.