Elmoustapha Ould-Ahmed-Vall

Researcher at Intel

Publications - 299

Citations - 1664

Elmoustapha Ould-Ahmed-Vall is an academic researcher from Intel. The author has contributed to research in topics: Operand & Opcode. The author has an hindex of 19, co-authored 299 publications receiving 1656 citations. Previous affiliations of Elmoustapha Ould-Ahmed-Vall include Georgia Institute of Technology & AMIT.

Papers

PDF

Open Access

More filters

Patent

Systems and methods to accelerate multiplication of sparse matrices

Baum Dan, +8 more

TL;DR: In this article, a processor is to fetch and decode an instruction having fields to specify locations of first, second, and third matrices, and an opcode indicating the processor was to multiply and accumulate matching non-zero (NZ) elements of the first and second matrices with corresponding elements on the third matrix.

...read moreread less

Patent

Apparatus and method for fused multiply-multiply instructions

Elmoustapha Ould-Ahmed-Vall, +7 more

TL;DR: In this paper, a processor device including a storage location configured to store a set of source packed-data operands, each of the operands having a plurality of packed data elements that are positive or negative according to an immediate bit value within one of the source operands.

...read moreread less

Patent

Instructions having support for floating point and integer data types in the same register

Elmoustapha Ould-Ahmed-Vall, +10 more

TL;DR: In this article, the authors present a general-purpose graphics compute unit to execute the single decoded instruction, where to execute a first instruction operation on a first set of operands of the multiple operands at a first precision and a simultaneously perform second instruction operations on a second set of operators at a second precision.

...read moreread less

Patent

Apparatus and method for multiplying, summing, and accumulating sets of packed bytes

Madduri Venkateswara, +4 more

TL;DR: In this paper, a processor having a decoder to decode an instruction to generate a decoded instruction, a first source register to store a first plurality of packed signed bytes, a second source register for storing a second plurality of signed bytes; execution circuitry to execute the decoded instructions, the execution circuitry including: multiplier circuitry, adder circuitry, negation and extension circuitry, and accumulation circuitry to add each of the doublewords sums to a doubleword from a third source to generate final doubleword results.

...read moreread less

Patent

On demand MSAA resolve during lens correction and/or other post-processing phases

Tomer Bar-On, +9 more

TL;DR: In this paper, the authors propose a post-processing approach that identifies, at an image post-processor, unresolved surface data and, at the image postprocessor, control data associated with the unresolved surfaces data.

...read moreread less

Collapse