Home
/
Authors
/
Hoi-Jim Yoo

Author

Hoi-Jim Yoo

Bio: Hoi-Jim Yoo is an academic researcher from KAIST. The author has contributed to research in topics: Network on a chip & Graphics pipeline. The author has an hindex of 2, co-authored 2 publications receiving 55 citations.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Solutions for Real Chip Implementation Issues of NoC and Their Application to Memory-Centric NoC

[...]

Donghyun Kim¹, Kwanho Kim¹, Joo-Young Kim¹, Seungjin Lee¹, Hoi-Jim Yoo¹ - Show less +1 more•Institutions (1)

KAIST¹

07 May 2007

TL;DR: This paper proposes memory centric NoC (MC-NoC) for homogeneous multi processor SoC (MPSoC) and Experimental result obtained by mapping edge detection tasks on the MC- noC in various configurations shows almost constant performance, which proves the effectiveness of the proposed architecture.

...read moreread less

Abstract: This paper describes real chip implementation issues of network-on-chip (NoC) and their solutions along with series of chip design examples. The solutions described in this paper cover both architectural aspects and circuit level techniques for practical chip implementation of NoC. As for architecture level solutions, topology selection, chip-aware protocol design, and on-chip serialization (OCS) for link area reduction are explained. For circuit level techniques, SERDES and synchronizer design, crossbar switch partial activation, and low-voltage link are presented as the foundations for power and area efficient NoC implementation. Regarding presented solutions for NoC implementation, this paper proposes memory centric NoC (MC-NoC) for homogeneous multi processor SoC (MPSoC). Flexibility and feasibility of task mapping on homogeneous SoC is the key feature of the MC-NoC. 8 dual port SRAMs connected to crossbar switches in hierarchical star topology network facilitate data communication between processors, regardless of task mapping into the MC-NoC. Experimental result obtained by mapping edge detection tasks on the MC-NoC in various configurations shows almost constant performance. This result proves the effectiveness of the proposed architecture. The MC-NoC based SoC is also implemented on TSMC 0.18 um process technology

...read moreread less

38 citations

Journal Article•DOI•

A 195 mW, 9.1 MVertices/s Fully Programmable 3-D Graphics Processor for Low-Power Mobile Devices

[...]

Jeong-Ho Woo¹, Ju-Ho Sohn¹, Hyejung Kim¹, Jongcheol Jeong², Euljoo Jeong², Suk-Joong Lee², Hoi-Jim Yoo¹ - Show less +3 more•Institutions (2)

KAIST¹, Samsung²

01 Nov 2007

TL;DR: The mobile unified shader provides programmable per-vertex operations and per-pixel operations in a single hardware and thus, it achieves 35% area and 28% power reduction compared with previous architecture.

...read moreread less

Abstract: A 195 mW, 9.1 Mvertices/s fully programmable 3-D graphics processor is designed and implemented for mobile devices. The mobile unified shader provides programmable per-vertex operations and per-pixel operations in a single hardware and thus, it achieves 35% area and 28% power reduction compared with previous architecture. The pixel-vertex multi-threading enhances the 3-D graphics performance by enabling to compute the per-vertex operations and the per-pixel operations at the same time. By adopting the pixel-vertex multi-threading, 94% of the per-vertex operations are interleaved into the per-pixel operations and enhances 3-D graphics performance in real applications. The logarithmic lighting engine and specialized lighting instruction improve the vertex throughput including transform and OpenGL lighting up to 9.1 Mvertices/s, which is 2.5 times higher performance compared with previous works. The proposed 3-D graphics processor is implemented in 3.3 mmtimes3.0 mm using 0.13 mum CMOS process and it was successfully demonstrated on the system evaluation board.

...read moreread less

17 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Outstanding Research Problems in NoC Design: System, Microarchitecture, and Circuit Perspectives

[...]

Radu Marculescu¹, Umit Y. Ogras¹, Li-Shiuan Peh², Natalie Enright Jerger³, Yatin Hoskote⁴ - Show less +1 more•Institutions (4)

Carnegie Mellon University¹, Princeton University², University of Wisconsin-Madison³, Intel⁴

01 Jan 2009-IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

TL;DR: This paper provides a general description of NoC architectures and applications and enumerates several related research problems organized under five main categories: Application characterization, communication paradigm, communication infrastructure, analysis, and solution evaluation.

...read moreread less

Abstract: To alleviate the complex communication problems that arise as the number of on-chip components increases, network-on-chip (NoC) architectures have been recently proposed to replace global interconnects. In this paper, we first provide a general description of NoC architectures and applications. Then, we enumerate several related research problems organized under five main categories: Application characterization, communication paradigm, communication infrastructure, analysis, and solution evaluation. Motivation, problem description, proposed approaches, and open issues are discussed for each problem from system, microarchitecture, and circuit perspectives. Finally, we address the interactions among these research problems and put the NoC design process into perspective.

...read moreread less

733 citations

Proceedings Article•DOI•

A fully-synthesizable single-cycle interconnection network for Shared-L1 processor clusters

[...]

Abbas Rahimi¹, Igor Loi², Mohammad Reza Kakoee², Luca Benini²•Institutions (2)

University of California, San Diego¹, University of Bologna²

14 Mar 2011

TL;DR: A parametric, fully combinational Mesh-of-Trees (MoT) interconnection network to support high-performance, single-cycle communication between processors and memories in L1-coupled processor clusters is designed.

...read moreread less

Abstract: Shared L1 memory is an interesting architectural option for building tightly-coupled multi-core processor clusters. We designed a parametric, fully combinational Mesh-of-Trees (MoT) interconnection network to support high-performance, single-cycle communication between processors and memories in L1-coupled processor clusters. Our interconnect IP is described in synthesizable RTL and it is coupled with a design automation strategy mixing advanced synthesis and physical optimization to achieve optimal delay, power, area (DPA) under a wide range of design constraints. We explore DPA for a large set of network configurations in 65nm technology. Post placementr when the number of both processors and memories is increased by a factor of 4, the delay increases almost logarithmically, to 84FO4, confirming scalability across a significant range of configurations. DPA tradeoff flexibility is also promising: in comparison to the maxperformance 16×32 configuration, there is potential to save power and area by 45% and 12 % respectively, at the expense of 30% performance degradation.

...read moreread less

129 citations

Journal Article•DOI•

The Chip Is the Network: Toward a Science of Network-on-Chip Design

[...]

Radu Marculescu¹, Paul Bogdan¹•Institutions (1)

Carnegie Mellon University¹

01 Jan 2007-Foundations and Trends in Electronic Design Automation

TL;DR: This survey addresses the concept of network in three different contexts representing the deterministic, probabilistic, and statistical physics-inspired design paradigms by considering the natural representation of networks as graphs.

...read moreread less

Abstract: The Chip Is the Network: Towards a Science of Network-on-Chip Design reviews the major design methodologies that have had a profound effect on designing future Network-on-Chip (NoC) architectures. More precisely, it addresses the problem of NoC design in the deterministic context, where the application and the architecture are modeled as graphs with worst-case type of information about the parameters of the components influencing the network traffic. Rather than simply enumerating the proposed approaches, it takes a formal approach and also discusses the main features of each proposed solution. It then goes one step further by considering the design of NoCs with partial information available (primarily under the Markovian assumption) about the application and the architecture. Similarly to the deterministic context, it discusses various probabilistic approaches to NoC design and points out their advantages and limitations. Last, but not least, it looks at emerging approaches inspired from statistical physics and information theory. The formal approach adopted means the network concept is addressed in the most general context, pointing out the main limitations of the proposed solutions, and suggesting a few open-ended problems. The Chip Is the Network: Towards a Science of Network-on-Chip Design is an invaluable reference for the NoC research community and, indeed anyone from CAD/VLSI academe or industry with an interest in this emerging paradigm.

...read moreread less

95 citations

Proceedings Article•DOI•

Homogeneous NoC-based FPGA: The Foundation for Virtual FPGA

[...]

Jie Yang¹, Like Yan¹, Lihan Ju¹, Yuan Wen¹, Shaobin Zhang¹, Tianzhou Chen¹ - Show less +2 more•Institutions (1)

Zhejiang University¹

29 Jun 2010

TL;DR: A homogeneous NoC-based FPGA architecture is proposed, in which reconfigurable and I/O resources are interconnected via NoC so that reconfigured modules can be placed anywhere once enough space available.

...read moreread less

Abstract: Reconfigurable computing based on FPGAs (Field Programmable Gate Arrays) has been a promising solution to improve the performance with high flexibility. However, the physical capacity limitation of FPGAs prevents its wide adoption in real world. In this paper, a homogeneous NoC-based FPGA architecture is proposed, in which reconfigurable and I/O resources are interconnected via NoC so that reconfigurable modules can be placed anywhere once enough space available. Meanwhile, a virtual FPGA is proposed with which over large circuit can be implemented on a limited capacity FPGA. The experiment verified that our approach can provide more flexible reconfiguration, and combing NOC on FPGA, the resource utilization increased within 44.7%-53.5% because of the fragment in CRs benefit from such kind of dynamic partial configuration.

...read moreread less

64 citations

Patent•

Real time on-chip texture decompression using shader processors

[...]

Konstantine Iourcha¹•Institutions (1)

Advanced Micro Devices¹

17 Jun 2011

TL;DR: In this paper, a table is utilized by the cache to locate individual blocks within the compressed texture and a decompressing shader processor receives compressed blocks and then performs on-the-fly decompression of the blocks.

...read moreread less

Abstract: A processing unit, method, and medium for decompressing or generating textures within a graphics processing unit (GPU). The textures are compressed with a variable-rate compression scheme such as JPEG. The compressed textures are retrieved from system memory and transferred to local cache memory on the GPU without first being decompressed. A table is utilized by the cache to locate individual blocks within the compressed texture. A decompressing shader processor receives compressed blocks and then performs on-the-fly decompression of the blocks. The decompressed blocks are then processed as usual by a texture consuming shader processor of the GPU.

...read moreread less

30 citations

1
2
3
4
…
5
6
7
8
9
10
11

Collapse