E
Elmoustapha Ould-Ahmed-Vall
Researcher at Intel
Publications - 299
Citations - 1664
Elmoustapha Ould-Ahmed-Vall is an academic researcher from Intel. The author has contributed to research in topics: Operand & Opcode. The author has an hindex of 19, co-authored 299 publications receiving 1656 citations. Previous affiliations of Elmoustapha Ould-Ahmed-Vall include Georgia Institute of Technology & AMIT.
Papers
More filters
Patent
Systems and methods to accelerate multiplication of sparse matrices
Baum Dan,Chen Koren,Elmoustapha Ould-Ahmed-Vall,Espig Michael,Christopher J. Hughes,Raanan Sade,Robert Valentine,Mark J. Charney,Alexander Heinecke +8 more
TL;DR: In this article, a processor is to fetch and decode an instruction having fields to specify locations of first, second, and third matrices, and an opcode indicating the processor was to multiply and accumulate matching non-zero (NZ) elements of the first and second matrices with corresponding elements on the third matrix.
Patent
Apparatus and method for fused multiply-multiply instructions
Elmoustapha Ould-Ahmed-Vall,Robert Valentine,Corbal Jesus,Mark J. Charney,Roger Espasa,Guillem Sole,Manel Fernandez,Brian J. Hickmann +7 more
TL;DR: In this paper, a processor device including a storage location configured to store a set of source packed-data operands, each of the operands having a plurality of packed data elements that are positive or negative according to an immediate bit value within one of the source operands.
Patent
Instructions having support for floating point and integer data types in the same register
Elmoustapha Ould-Ahmed-Vall,Barath Lakshmanan,Tatiana Shpeisman,Ray Joydeep,Ping T. Tang,Michael S. Strickland,Xiaoming Chen,Anbang Yao,Ben Ashbaugh,Linda L. Hurd,Liwei Ma +10 more
TL;DR: In this article, the authors present a general-purpose graphics compute unit to execute the single decoded instruction, where to execute a first instruction operation on a first set of operands of the multiple operands at a first precision and a simultaneously perform second instruction operations on a second set of operators at a second precision.
Patent
Apparatus and method for multiplying, summing, and accumulating sets of packed bytes
TL;DR: In this paper, a processor having a decoder to decode an instruction to generate a decoded instruction, a first source register to store a first plurality of packed signed bytes, a second source register for storing a second plurality of signed bytes; execution circuitry to execute the decoded instructions, the execution circuitry including: multiplier circuitry, adder circuitry, negation and extension circuitry, and accumulation circuitry to add each of the doublewords sums to a doubleword from a third source to generate final doubleword results.
Patent
On demand MSAA resolve during lens correction and/or other post-processing phases
Tomer Bar-On,Hugues Labbe,Adam T. Lake,Kai Xiao,Ankur N. Shah,Johannes Guenther,Appu Abhishek R,Ray Joydeep,Deepak S. Vembar,Elmoustapha Ould-Ahmed-Vall +9 more
TL;DR: In this paper, the authors propose a post-processing approach that identifies, at an image post-processor, unresolved surface data and, at the image postprocessor, control data associated with the unresolved surfaces data.