W
Wayne Luk
Researcher at Imperial College London
Publications - 737
Citations - 13643
Wayne Luk is an academic researcher from Imperial College London. The author has contributed to research in topics: Field-programmable gate array & Reconfigurable computing. The author has an hindex of 54, co-authored 703 publications receiving 12517 citations. Previous affiliations of Wayne Luk include Fudan University & University of London.
Papers
More filters
Proceedings ArticleDOI
FLiMS: Fast Lightweight Merge Sorter
TL;DR: A highly-efficient and simple parallel hardware design for merging two sorted lists residing in banked (or multi-ported) memory using a modified version of the bitonic merge block, as found in a bitonic sorter, repurposed for performing parallel merge for streaming data.
Book ChapterDOI
Parametric encryption hardware design
TL;DR: New scalable hardware designs of modular multiplication, modular exponentiation and primality test are presented, based on an original Montgomery modular multiplier, which is the first Montgomery multiplier design with variable pipeline stages and variable serial replications.
Proceedings ArticleDOI
Reconfigurable Control Variate Monte-Carlo Designs for Pricing Exotic Options
TL;DR: An FPGA-accelerated control variate Monte-Carlo (CVMC) framework for pricing exotic options is proposed and an optimised implementation of arithmetic Asian option pricing under this framework in a Virtex-5 xc5vlx330t FPGa at 200MHz is 24 times faster than a multi-threaded software implementation.
Journal ArticleDOI
Power-Adaptive Computing System Design for Solar-Energy-Powered Embedded Systems
TL;DR: The present thorough design approach for self-powered embedded computing systems has a better utilization of ambient energy sources and can track the variable power supply better.
Proceedings ArticleDOI
Memory-Efficient Architecture for Accelerating Generative Networks on FPGA
Shuanglong Liu,Chenglong Zeng,Hongxiang Fan,Ho-Cheung Ng,Jiuxi Meng,Zhiqiang Que,Xinyu Niu,Wayne Luk +7 more
TL;DR: A novel parametrized deconvolutional architecture based on an FPGA-friendly method is proposed to accelerate the generator of GANs, by storing all intermediate data in on-chip memories and significantly reducing off-chip data transfers.