scispace - formally typeset
E

Elmoustapha Ould-Ahmed-Vall

Researcher at Intel

Publications -  299
Citations -  1664

Elmoustapha Ould-Ahmed-Vall is an academic researcher from Intel. The author has contributed to research in topics: Operand & Opcode. The author has an hindex of 19, co-authored 299 publications receiving 1656 citations. Previous affiliations of Elmoustapha Ould-Ahmed-Vall include Georgia Institute of Technology & AMIT.

Papers
More filters
Patent

Systems and methods to accelerate multiplication of sparse matrices

TL;DR: In this article, a processor is to fetch and decode an instruction having fields to specify locations of first, second, and third matrices, and an opcode indicating the processor was to multiply and accumulate matching non-zero (NZ) elements of the first and second matrices with corresponding elements on the third matrix.
Patent

Apparatus and method for fused multiply-multiply instructions

TL;DR: In this paper, a processor device including a storage location configured to store a set of source packed-data operands, each of the operands having a plurality of packed data elements that are positive or negative according to an immediate bit value within one of the source operands.
Patent

Instructions having support for floating point and integer data types in the same register

TL;DR: In this article, the authors present a general-purpose graphics compute unit to execute the single decoded instruction, where to execute a first instruction operation on a first set of operands of the multiple operands at a first precision and a simultaneously perform second instruction operations on a second set of operators at a second precision.
Patent

Apparatus and method for multiplying, summing, and accumulating sets of packed bytes

TL;DR: In this paper, a processor having a decoder to decode an instruction to generate a decoded instruction, a first source register to store a first plurality of packed signed bytes, a second source register for storing a second plurality of signed bytes; execution circuitry to execute the decoded instructions, the execution circuitry including: multiplier circuitry, adder circuitry, negation and extension circuitry, and accumulation circuitry to add each of the doublewords sums to a doubleword from a third source to generate final doubleword results.
Patent

On demand MSAA resolve during lens correction and/or other post-processing phases

TL;DR: In this paper, the authors propose a post-processing approach that identifies, at an image post-processor, unresolved surface data and, at the image postprocessor, control data associated with the unresolved surfaces data.