scispace - formally typeset
J

Javier Alejandro Varela

Researcher at Kaiserslautern University of Technology

Publications -  14
Citations -  57

Javier Alejandro Varela is an academic researcher from Kaiserslautern University of Technology. The author has contributed to research in topics: Software portability & Xeon Phi. The author has an hindex of 4, co-authored 12 publications receiving 45 citations.

Papers
More filters
Proceedings ArticleDOI

A quantitative cross-architecture study of morphological image processing on CPUs, GPUs, and FPGAs

TL;DR: It is demonstrated that even high-end GPUs cannot achieve the throughputs of modern CPUs and FPGAs by far, and an FPGA implementation is 8-10 times more energy efficient for this application, being comparable in speed to CPUs for large kernels.
Proceedings ArticleDOI

Reverse longstaff-schwartz american option pricing on hybrid CPU/FPGA systems

TL;DR: This paper presents a novel way to price high-dimensional American options using techniques of the embedded community and exploits the FPGA reconfiguration to deliver high-throughput.
Journal ArticleDOI

Nested MC-Based Risk Measurement of Complex Portfolios: Acceleration and Energy Efficiency

TL;DR: This work approaches the estimation of the widely-employed portfolio risk metrics value-at-risk (VaR) and conditional value- at- risk (c VaR) by means of nested Monte Carlo (MC) simulations by combining theory and software/hardware implementation.
Proceedings ArticleDOI

Near Real-Time Risk Simulation of Complex Portfolios on Heterogeneous Computing Systems with OpenCL

TL;DR: The combination of OpenCL, a new bit-accurate algorithmic optimization, and the extension of an existing numerical scheme using interpolation allows this approach to achieve over 1000x speedup compared to the state-of-the-art approach, making this approach even feasible for intraday risk analysis.
Proceedings ArticleDOI

Exploiting Decoupled OpenCL Work-Items with Data Dependencies on FPGAs: A Case Study

TL;DR: This work presents a new approach for FPGA implementations that decouples the parallel OpenCL work-items, avoiding the interference of data-dependent branches between them, and shows how to efficiently interleave computation with transfers to device global memory in each work-item.