K
Koji Nakano
Researcher at Hiroshima University
Publications - 308
Citations - 3579
Koji Nakano is an academic researcher from Hiroshima University. The author has contributed to research in topics: Field-programmable gate array & Parallel algorithm. The author has an hindex of 30, co-authored 295 publications receiving 3342 citations. Previous affiliations of Koji Nakano include Nagoya Institute of Technology & Hitachi.
Papers
More filters
Proceedings ArticleDOI
A Simple Parallel Convex Hulls Algorithm for Sorted Points and the Performance Evaluation on the Multicore Processors
TL;DR: A simple parallel algorithm for computing the convex hull of a set of n sorted points in the plane and the results show that the implementation achieves a speed-up factor of approximately 7 using 8 processors.
Journal ArticleDOI
Implementations of the Hough Transform on the Embedded Multicore Processors
TL;DR: Two implementations of the Hough transform on the FPGA and the GPU are presented, an efficient usage of DSP slices and block RAMs for FPGAs, and the shared memory for GPUs.
Proceedings ArticleDOI
A Memory-Access-Efficient Implementation of the Approximate String Matching Algorithm on GPU
TL;DR: The main contribution of this work is to present a memory-access-efficient implementation for computing the ASM on a GPU that relies on warp shuffle operations, which are used to reduce the communication overhead between threads.
Proceedings ArticleDOI
Bitwise Parallel Bulk Computation on the GPU, with Application to the CKY Parsing for Context-Free Grammars
TL;DR: It is shown that the pairwise sums of a lot of integers can be computed faster using the BPBC technique, if the values of input integers are not large, and that the CKY parsing for context-free grammars can be implemented in the GPU efficiently using this technique.
Proceedings ArticleDOI
The super warp architecture with random address shift
Koji Nakano,Susumu Matsumae +1 more
TL;DR: The Random Super Discrete Memory Machine (RSDMM), an extended version of the DMM, which supports a super warp with multiple warps and the random address shift technique is applied, to present novel and practical parallel computing models in which the congestion is small for any memory access requests.