Showing papers by "Yao-Wen Chang published in 2007"

PDF

Open Access

Journal Article•DOI•

Placement of defect-tolerant digital microfluidic biochips using the T-tree formulation

[...]

Ping-Hung Yuh¹, Chia-Lin Yang¹, Yao-Wen Chang¹•Institutions (1)

01 Nov 2007-ACM Journal on Emerging Technologies in Computing Systems

TL;DR: This work forms the placement problem of digital microfluidic biochips with a tree-based topological representation, called T-tree, and considers the defect tolerant issue to avoid to use defective cells due to fabrication.

...read moreread less

Abstract: Droplet-based microfluidic biochips have recently gained much attention and are expected to revolutionize the biological laboratory procedures. As biochips are adopted for the complex procedures in molecular biology, its complexity is expected to increase due to the need of multiple and concurrent assays on a chip. In this article, we formulate the placement problem of digital microfluidic biochips with a tree-based topological representation, called T-tree. To the best knowledge of the authors, this is the first work that adopts a topological representation to solve the placement problem of digital microfluidic biochips. We also consider the defect tolerant issue to avoid to use defective cells due to fabrication. Experimental results demonstrate that our approach is more efficient and effective than the previous unified synthesis and placement framework.

...read moreread less

99 citations

Proceedings Article•DOI•

Recent Research and Emerging Challenges in Physical Design for Manufacturability/Reliability

[...]

Chung-Wei Lin¹, Ming-Chao Tsai², Kuang-Yao Lee², Tai-Chen Chen¹, Ting-Chi Wang², Yao-Wen Chang¹ - Show less +2 more•Institutions (2)

National Taiwan University¹, National Tsing Hua University²

23 Jan 2007

TL;DR: Major challenges arising from nanometer process technology are introduced, key existing techniques for handling the challenges are surveyed, and some future research directions in physical design for manufacturability and reliability are provided.

...read moreread less

Abstract: As IC process geometries scale down to the nanometer territory, the industry faces severe challenges of manufacturing limitations. To guarantee yield and reliability, physical design for manufacturability and reliability has played a pivotal role in resolution and thus yield enhancement for the imperfect manufacturing process. In this paper, we introduce major challenges arising from nanometer process technology, survey key existing techniques for handling the challenges, and provide some future research directions in physical design for manufacturability and reliability.

...read moreread less

80 citations

Proceedings Article•DOI•

BioRoute: a network-flow based routing algorithm for digital microfluidic biochips

[...]

Ping-Hung Yuh¹, Chia-Lin Yang¹, Yao-Wen Chang¹•Institutions (1)

National Taiwan University¹

05 Nov 2007

TL;DR: This paper introduces the first network-flow based routing algorithm that can concurrently route a set of non-interfering nets for the droplet routing problem on biochips and presents the first polynomial-time algorithm for simultaneous routing and scheduling using the global-routing paths with a negotiation-based routing scheme.

...read moreread less

Abstract: Due to the recent advances in microfluidics, digital microfluidic biochips are expected to revolutionize laboratory procedures. One critical problem for biochip synthesis is the droplet routing problem. Unlike traditional VLSI routing problems, in addition to routing path selection, the biochip routing problem needs to address the issue of scheduling droplets under the practical constraints imposed by the fluidic property and the timing restriction of the synthesis result. In this paper, we present the first network-flow based routing algorithm that can concurrently route a set of non-interfering nets for the droplet routing problem on biochips. We adopt a two-stage technique of global routing followed by detailed routing. In global routing, we first identify a set of non-interfering nets and then adopt the network-flow approach to generate optimal global-routing paths for the nets. In detailed routing, we present the first polynomial-time algorithm for simultaneous routing and scheduling using the global-routing paths with a negotiation-based routing scheme. The experimental results show the robustness and efficiency of our algorithm.

...read moreread less

70 citations

Journal Article•DOI•

A Network-Flow-Based RDL Routing Algorithmz for Flip-Chip Design

[...]

Jia-Wei Fang¹, I-Jye Lin¹, Yao-Wen Chang¹, Jyh-Herng Wang²•Institutions (2)

National Taiwan University¹, Apache Corporation²

01 Aug 2007-IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

TL;DR: This paper proposes the first router for the flip-chip package in the literature that adopts a two-stage technique of global routing followed by detailed routing, and uses the network flow algorithm to solve the assignment problem from the wire-bonding pads to the bump pads and then create the global path for each net.

...read moreread less

Abstract: The flip-chip package gives the highest chip density of any packaging method to support the pad-limited application-specific integrated circuit designs. In this paper, we propose the first router for the flip-chip package in the literature. The router can redistribute nets from wire-bonding pads to bump pads and then route each of them. The router adopts a two-stage technique of global routing followed by detailed routing. In global routing, we use the network flow algorithm to solve the assignment problem from the wire-bonding pads to the bump pads and then create the global path for each net. The detailed routing consists of three stages, namely: 1) cross-point assignment; 2) net ordering determination; and 3) track assignment, to complete the routing. Experimental results based on seven real designs from the industry demonstrate that the router can reduce the total wirelength by 10.2%, the critical wirelength by 13.4%, and the signal skews by 13.9%, as compared with a heuristic algorithm currently used in industry.

...read moreread less

67 citations

Proceedings Article•DOI•

An integer linear programming based routing algorithm for flip-chip design

[...]

Jia-Wei Fang¹, Chin-Hsiung Hsu¹, Yao-Wen Chang¹•Institutions (1)

National Taiwan University¹

04 Jun 2007

TL;DR: Experimental results based on five real industry designs show that the first routing algorithm in the literature for the pre-assignment flip-chip routing problem with a pre-defined netlist among pads and wire-width and signal-skew considerations can achieve 100% routability and the optimal global-routing wirelength.

...read moreread less

Abstract: The flip-chip package provides a high chip-density solution to the demand for more I/O pads of VLSI designs. In this paper, we present the first routing algorithm in the literature for the pre-assignment flip-chip routing problem with a pre-defined netlist among pads and wire-width and signal-skew considerations. Our algorithm is based on integer linear programming (ILP) and guarantees to find an optimal solution for the addressed problem. It adopts a two-stage technique of global routing followed by detailed routing. In global routing, it first uses two reduction techniques to prune redundant solutions and create a global-routing path for each net. Without loss of the solution optimality, our reduction techniques can further prune the ILP variables (constraints) by 85.5% (98.0%) on average over a recent reduction technique. The detailed routing applies X-based grid- less routing to complete the routing. Experimental results based on five real industry designs show that our router can achieve 100% routability and the optimal global-routing wirelength and satisfy all signal-skew constraints, under reasonable CPU times, while recent related work results in much inferior solution quality.

...read moreread less

60 citations

Proceedings Article•DOI•

An ILP algorithm for post-floorplanning voltage-island generation considering power-network planning

[...]

Wan-Ping Lee¹, Hung-Yi Liu¹, Yao-Wen Chang¹•Institutions (1)

National Taiwan University¹

05 Nov 2007

TL;DR: An economical graph-based representation that needs only a linear number of nodes to the block number to model the block adjacency in a floorplan for the voltage-island generation and can produce better voltage islands in terms of power-network routing resources.

...read moreread less

Abstract: Power optimization is a crucial concern for modern circuit designs. Multiple supply voltages (MSV's) provide an effective technique for the power optimization. This paper addresses the voltage-island generation problem for MSV designs at the post-floorplanning stage. We first present a general formulation of this problem that considers level-shifter planning and power-network routing resources. Without loss of solution quality, we propose an economical graph-based representation that needs only a linear number of nodes to the block number to model the block adjacency in a floorplan for the voltage-island generation. In contrast, previous works need a quadratic number of nodes. To tackle the addressed problem, we employ an ILP formulation which consists of (1) level-shifter aware wirelength estimation to capture the timing overhead, (2) voltage-island-clustering inequalities to avoid complicated constraint transformations, and (3) inequalities to capture the power-network routing-resource usage. Compared with previous works, our algorithm can produce better voltage islands in terms of power-network routing resources. Experimental results show that our algorithm can effectively reduce the power-network routing resource by up to 19.46% with a reasonable overhead of 4.03% more power consumption and using reasonable running time.

...read moreread less

58 citations

Proceedings Article•DOI•

Efficient obstacle-avoiding rectilinear steiner tree construction

[...]

Chung-Wei Lin¹, Szu-Yu Chen¹, Chi-Feng Li¹, Yao-Wen Chang¹, Chia-Lin Yang¹ - Show less +1 more•Institutions (1)

National Taiwan University¹

18 Mar 2007

TL;DR: This paper presents an efficient algorithm with some theoretical optimality guarantees for the OARSMT construction, based on the obstacle-avoiding spanning graph (OASG), and guarantees to find an optimal OAR SMT for any 2-pin net and many higher-pin nets.

...read moreread less

Abstract: Given a set of pins and a set of obstacles on a plane, an obstacle-avoiding rectilinear Steiner minimal tree (OARSMT) connects these pins, possibly through some additional points (called Steiner points), and avoids running through any obstacle to construct a tree with a minimal total wirelength. The OARSMT problem becomes more important than ever for modern nanometer IC designs which need to consider numerous routing obstacles incurred from power networks, prerouted nets, IP blocks, feature patterns for manufacturability improvement, antenna jumpers for reliability enhancement, etc. Consequently, the OARSMT problem has received dramatically increasing attention recently. Nevertheless, considering obstacles significantly increases the problem complexity, and thus most previous works suffer from either poor quality or expensive running time. Based on the obstacle-avoiding spanning graph (OASG), this paper presents an efficient algorithm with some theoretical optimality guarantees for the OARSMT construction. Unlike previous heuristics, our algorithm guarantees to find an optimal OARSMT for any 2-pin net and many higher-pin nets. Extensive experiments show that our algorithm results in significantly shorter wirelengths than all state-of-the-art works.

...read moreread less

51 citations

Proceedings Article•DOI•

ECO timing optimization using spare cells

[...]

Yen-Pin Chen¹, Jia-Wei Fang¹, Yao-Wen Chang¹•Institutions (1)

National Taiwan University¹

05 Nov 2007

TL;DR: This paper presents a dynamic programming algorithm considering the dynamic cost, called dynamic cost programming (DCP), for the ECO timing optimization with spare cells, and presents an effective pruning method by selecting spare cells only inside an essential bounding polygon to reduce the solution space.

...read moreread less

Abstract: We introduce in this paper a new problem of ECO timing optimization using spare-cell rewiring and present the first work for this problem. Spare-cell rewiring is a popular technique for incremental timing optimization and/or functional change after the placement stage. The spare-cell rewiring problem is very challenging because of its dynamic wiring cost nature for selecting a spare cell, while the existing related problems consider only static wiring cost. For the addressed problem, we present a framework of buffer insertion and gate sizing to handle it. In this framework, we present a dynamic programming algorithm considering the dynamic cost, called dynamic cost programming (DCP), for the ECO timing optimization with spare cells. Without loss of solution optimality, we further present an effective pruning method by selecting spare cells only inside an essential bounding polygon to reduce the solution space. The whole framework is integrated into a commercial design flow. Experimental results based on five industry benchmarks show that our method is very effective and efficient in fixing the timing violations of ECO paths.

...read moreread less

42 citations

Journal Article•DOI•

Temporal floorplanning using the three-dimensional transitive closure subGraph

[...]

Ping-Hung Yuh¹, Chia-Lin Yang¹, Yao-Wen Chang¹•Institutions (1)

National Taiwan University¹

01 Sep 2007-ACM Transactions on Design Automation of Electronic Systems

TL;DR: A novel graph-based topological floorplan representation, named 3D-subTCG (3-Dimensional Transitive Closure subGraph), is used to deal with the 3-dimensional (temporal) floorplanning/placement problem, arising from dynamically reconfigurable FPGAs.

...read moreread less

Abstract: Improving logic capacity by time-sharing, dynamically reconfigurable Field Gate Programmable Arrays (FPGAs) are employed to handle designs of high complexity and functionality. In this paper, we use a novel graph-based topological floorplan representation, named 3D-subTCG (3-Dimensional Transitive Closure subGraph), to deal with the 3-dimensional (temporal) floorplanning/placement problem, arising from dynamically reconfigurable FPGAs. The 3D-subTCG uses three transitive closure graphs to model the temporal and spatial relations between modules. We derive the feasibility conditions for the precedence constraints induced by the execution of the dynamically reconfigurable FPGAs. Because the geometric relationship is transparent to the 3D-subTCG and its induced operations (i.e., we can directly detect the relationship between any two tasks from the representation), we can easily detect any violation of the temporal precedence constraints on 3D-subTCG. We also derive important properties of the 3D-subTCG to reduce the solution space and shorten the running time for 3D (temporal) foorplanning/placement. Experimental results show that our 3D-subTCG-based algorithm is very effective and efficient.

...read moreread less

25 citations

Proceedings Article•DOI•

MP-trees: a packing-based macro placement algorithm for mixed-size designs

[...]

Tung-Chieh Chen¹, Ping-Hung Yuh¹, Yao-Wen Chang¹, Fwu-Juh Huang², Denny Liu² - Show less +1 more•Institutions (2)

National Taiwan University¹, MediaTek²

04 Jun 2007

TL;DR: Experiments based on the eight ISPD'06 placement contest benchmarks show that the MP-tree-based macro placer combined with Capo 10.2, NTUplace3, or mPL6 for standard-cell placement outperforms these state-of-the-art academic mixed-size placers alone by large margins in both robustness and quality.

...read moreread less

Abstract: In this paper, we present a new multi-packing tree (MP-tree) representation for macro placement to handle mixed-size designs. Based on binary trees, the MP-tree is very efficient, effective, and flexible for handling macro placement with various constraints. Given a global placement, our MP-tree-based macro placer optimizes macro positions, minimizes the macro displacement from the initial macro positions, and maximizes the area of the chip center for standard-cell placement and routing. Experiments based on the eight ISPD'06 placement contest benchmarks show that our macro placer combined with Capo 10.2, NTUplace3, or mPL6 for standard-cell placement outperforms these state-of-the-art academic mixed-size placers alone by large margins in both robustness and quality. In addition to wirelength, experimented on five real industrial designs show that our method significantly reduce the average HPWL by 35%, the average routed wirelength by 55%, and the routing overflows than the counterpart with Capo 10.2, implying that our macro placer leads to much higher routability.

...read moreread less

25 citations

Proceedings Article•DOI•

A provably good approximation algorithm for power optimization using multiple supply voltages

[...]

Hung-Yi Liu¹, Wan-Ping Lee¹, Yao-Wen Chang¹•Institutions (1)

National Taiwan University¹

04 Jun 2007

TL;DR: This paper addresses a voltage partitioning problem arising in MSV design during high-level synthesis and proposes an efficient alpha2-approximation algorithm for the problem, where a is the constant ratio of the maximum to the minimum voltages.

...read moreread less

Abstract: Multiple supply voltages (MSV's) provide an effective technique for power optimization. This paper addresses a voltage partitioning problem arising in MSV design during high-level synthesis. We point out a theoretical mistake in a recent publication and prove that the partitioning problem is NP-hard. Despite its NP-hardness, we propose an efficient alpha2-approximation algorithm for the problem, where a is the constant ratio of the maximum to the minimum voltages. Compared with the previous work that runs in O(dn2) time, the time complexity of our algorithm is only O(dkn), where d, k, and n are respectively the numbers of voltages employed in the final designs (i.e., voltage domains), available supply voltages in the technology library, and functional units. Note that both d and k can be considered as small constants for practical applications. Experimental results show that our algorithm can achieve 36-255X run-time speedups than the recent work, with the same power reduction.

...read moreread less

Journal Article•DOI•

Multilevel Full-Chip Gridless Routing With Applications to Optical-Proximity Correction

[...]

Tai-Chen Chen¹, Yao-Wen Chang¹•Institutions (1)

National Taiwan University¹

01 Jun 2007-IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

TL;DR: This paper introduces a gridless-routing model that can obtain design-rule-correct paths and avoid redundant wires, and presents the first multilevel full-chip gridless detailed router (called MGR), which integrates global routing, detailed routing, and congestion estimation together at each level of multileVEL routing.

...read moreread less

Abstract: To handle modern routing with nanometer effects, we need to consider designs with variable wire/via widths and spacings, for which gridless-routing approaches are desirable due to its great flexibility. In this paper, we introduce a gridless-routing model that can obtain design-rule-correct paths and avoid redundant wires. Besides, we propose an enhanced model for the gridless-routing model to reduce the solution space and the runtime. Based on the enhanced gridless-routing model, we present the first multilevel full-chip gridless detailed router (called MGR). The router integrates global routing, detailed routing, and congestion estimation together at each level of multilevel routing. It can handle designs with nonuniform wire/via widths and spacings and consider routability and optical-proximity correction. Experimental results show that MGR achieves the best routing solutions in smaller running times than previous works, based on a set of commonly used benchmarks (with uniform and nonuniform wire widths) and a set of real industrial benchmarks (with a versatile set of design rules)

...read moreread less

Journal Article•DOI•

Power/Ground Network and Floorplan Cosynthesis for Fast Design Convergence

[...]

Chen-Wei Liu¹, Yao-Wen Chang²•Institutions (2)

Synopsys¹, National Taiwan University²

01 Apr 2007-IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

TL;DR: The design methodology successfully fixes the IR-drop errors earlier at the floorplanning stage and, thus, enables the single-pass design convergence of the power/ground (P/G) network cosynthesis.

...read moreread less

Abstract: As technology advances, the metal width decreases while the global wire length increases. This trend makes the resistance of the power wire increase substantially. Furthermore, the threshold voltage scales nonlinearly, raising the ratio of the threshold voltage to the supply voltage and making the voltage (IR) drop in the power/ground (P/G) network a serious problem in modern IC design. Traditional P/G network-analysis methods are often very computationally expensive, and it is, thus, not feasible to cosynthesize P/G network with floorplan. To make the cosynthesis feasible, we need not only an efficient, effective, and flexible floorplanning algorithm but also a very efficient yet sufficiently accurate P/G network-analysis method. In this paper, we present a method for floorplan and P/G network cosynthesis based on an efficient P/G network-analysis scheme and the B*-tree floorplan representation. We integrate the cosynthesis into a commercial design flow to develop an effective power-integrity (IR drop)-driven design methodology. Experimental results based on a real-world circuit design and the MCNC benchmarks show that our design methodology successfully fixes the IR-drop errors earlier at the floorplanning stage and, thus, enables the single-pass design convergence

...read moreread less

Proceedings Article•DOI•

Efficient multi-layer obstacle-avoiding rectilinear Steiner tree construction

[...]

Chung-Wei Lin¹, Shih-Lun Huang¹, Kai-Chi Hsu¹, Meng-Xiang Li¹, Yao-Wen Chang¹ - Show less +1 more•Institutions (1)

National Taiwan University¹

05 Nov 2007

TL;DR: The first algorithm to solve the ML-OARSMT problem is presented, based on the multilayer obstacle-avoiding spanning graph (ML-OASG), which can guarantee an optimal solution for any 2-pin net and many higher-pin nets.

...read moreread less

Abstract: Given a set of pins and a set of obstacles on routing layers, a multi-layer obstacle-avoiding rectilinear Steiner minimal tree (ML-OARSMT) connects these pins by rectilinear edges within layers and vias between layers, and avoids running through any obstacle to construct a Steiner tree with a minimal total cost. The ML-OARSMT problem is very important for many VLSI designs with pins being located in multiple routing layers that contain numerous routing obstacles incurred from IP blocks, power networks, prerouted nets, etc. Therefore, it is desired to develop an effective algorithm for the ML-OARSMT problem. However, there is no existing work on this ML-OARSMT problem. In this paper, we first formulate the ML-OARSMT problem and identify key different properties of the problem from its single-layer counterpart. Based on the multilayer obstacle-avoiding spanning graph (ML-OASG), we present the first algorithm to solve the ML-OARSMT problem. Our algorithm can guarantee an optimal solution for any 2-pin net and many higher-pin nets. Experiments show that our algorithm results in 33% smaller total costs on average than a construction-by-correction heuristic which is widely used for Steiner-tree construction in the recent literature.

...read moreread less

Proceedings Article•DOI•

Statistical circuit optimization considering device andinterconnect process variations

[...]

I-Jye Lin¹, Tsui-Yee Ling¹, Yao-Wen Chang¹•Institutions (1)

National Taiwan University¹

17 Mar 2007

TL;DR: This work presents the first work to use statistical methods to optimize the circuit area under timing, thermal and power constraints by gate and interconnect sizing optimization by model the problem as a second-order conic program and solve it with the interior-point method.

...read moreread less

Abstract: Process Variation has become a crucial challenge on both interconnect delay and reliability of nanometer integrated circuit designs. Furthermore, the dramatic increase of power consumption and integration density has led to high operating temperature. Temperature, as well as electromigration (EM) and power, also significantly affects the delay and reliability of interconnects. Considering process variation, we present the first work to use statistical methods to optimize the circuit area under timing, thermal and power constraints by gate and interconnect sizing optimization. We model the problem as a second-order conic program and solve it with the interior-point method. Experimental results show that our statistical algorithm can find desired solutions that satisfy all delay, power, and thermal constraints. Our statistical algorithm on average improves the circuit areas by respective 51.12%, 39.21%, and 25.60% with 70%, 84.1%, and 99.9% yields after wire and gate sizing.

...read moreread less

Patent•

V-shaped multilevel full-chip gridless routing

[...]

Shyh-Chang Lin, Tai-Chen Chen, Yao-Wen Chang¹, Feng-Yuan Chang•Institutions (1)

Synopsys¹

05 Mar 2007

TL;DR: In this article, a router selects routes for nets interconnecting terminals of circuit devices within an area of an IC by organizing the IC area into an array of global routing cells (GRCs) and generates a congestion map providing a separate congestion factor for each GRC boundary.

...read moreread less

Abstract: A router selects routes for nets interconnecting terminals of circuit devices within an area of an IC. The router organizes the IC area into an array of global routing cells (GRCs) and generates a congestion map providing a separate congestion factor for each GRC boundary that is a probabilistic measure of an estimated percentage of a capacity of the GRC boundary that will be occupied by nets when all nets have been routed. The router then iteratively partitions the IC area into progressively smaller tiles until the tiles reach a predetermined minimum size. Between partitioning iterations, the router selects a route for each net passing between tiles when possible to do so without altering any previously routed net. The router thereafter iteratively merges the tiles into progressively larger tiles. Between merging iterations, the router selects a route for each previously unrouted net residing wholly within a single tile, altering routes of previously routed nets when necessary to accommodate the selected route. When selecting a route for any connection of a net, the router seeks to minimize a cost function of congestion factors of all GRC boundaries and then modifies the congestion map to reflect changes in routing probabilities occasioned by the route selection before choosing a route for any other connection.

...read moreread less

Journal Article•DOI•

Profiling of Nitride-Trap-Energy Distribution in SONOS Flash Memory by Using a Variable-Amplitude Low-Frequency Charge-Pumping Technique

[...]

Yi-Ying Liao¹, Sheng-fu Horng¹, Yao-Wen Chang, Tao-Cheng Lu, Kuang-Chao Chen, Tahui Wang, Chih-Yuan Lu - Show less +3 more•Institutions (1)

National Tsing Hua University¹

27 Aug 2007-IEEE Electron Device Letters

TL;DR: In this article, a variable-amplitude low-frequency charge-pumping technique is proposed to characterize the nitride-trap energy and spatial distributions in SONOS Flash memory cells.

...read moreread less

Abstract: A variable-amplitude low-frequency charge-pumping technique is proposed to characterize the nitride-trap energy and spatial distributions in SONOS Flash memory cells. A numerical model based on Shockley-Read-Hall-like electron tunneling capture is used to correlate a charge-pumping current with the nitride-trap energy and position. By changing the frequency and pulse amplitude in charge-pumping measurement, a nitride-trap density, as a function of the trap position and energy, can be extracted.

...read moreread less

Journal Article•DOI•

An Optimal Jumper-Insertion Algorithm for Antenna Avoidance/Fixing

[...]

Bor-Yiing Su¹, Yao-Wen Chang¹•Institutions (1)

National Taiwan University¹

01 Oct 2007-IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

TL;DR: This paper forms an O(V)-time optimal jumper-insertion algorithm that uses the minimum number of jumpers to avoid/fix the antenna violations in a routing tree with vertices and presents the first optimal algorithm for the tree-cutting problem.

...read moreread less

Abstract: As the process technology enters the nanometer era, reliability has become a major concern in the design and manufacturing of very large-scale integration circuits. In this paper, we focus on one reliability issue-jumper insertion in routing trees for avoiding/fixing antenna-effect violations at the routing/postlayout stages. We formulate the jumper insertion for antenna avoidance/fixing as a tree-cutting problem and present the first optimal algorithm for the tree-cutting problem. We show that the tree-cutting problem exhibits the properties of optimal substructures and greedy choices. With these properties, we present an O(V)-time optimal jumper-insertion algorithm that uses the minimum number of jumpers to avoid/fix the antenna violations in a routing tree with vertices. Experimental results show the superior effectiveness and efficiency of our algorithm.

...read moreread less

Journal Article•DOI•

Multilevel Full-Chip Routing With Testability and Yield Enhancement

[...]

K. Shu-Min Li¹, Yao-Wen Chang², Chung Len Lee³, Chauchin Su³, Jwu E. Chen⁴ - Show less +1 more•Institutions (4)

National Sun Yat-sen University¹, National Taiwan University², National Chiao Tung University³, National Central University⁴

01 Sep 2007-IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

TL;DR: A multilevel full-chip routing algorithm that improves testability and diagnosability, manufacturability, and signal integrity for yield enhancement, and a heuristic to reduce routing congestion to optimize the multiple-fault probability, chemical-mechanical polishing- and optical proximity correction-induced manufacturality, and crosstalk effects, for yield improvement.

...read moreread less

Abstract: We propose a multilevel full-chip routing algorithm that improves testability and diagnosability, manufacturability, and signal integrity for yield enhancement. Two major issues are addressed. 1) The oscillation ring test (ORT) and its diagnosis scheme for interconnects based on the popular IEEE Standard 1500 are integrated into the multilevel routing framework to achieve testability enhancement. We augment the traditional multilevel framework of coarsening and uncoarsening by introducing a preprocessing stage that analyzes the interconnect structure for better resource estimation before the coarsening stage, and a final stage after uncoarsening that improves testability to achieve 100% interconnect fault coverage and maximal diagnosability. 2) We present a heuristic to reduce routing congestion to optimize the multiple-fault probability, chemical-mechanical polishing- and optical proximity correction-induced manufacturability, and crosstalk effects, for yield improvement. Experimental results on the Microelectronics Center for North Carolina benchmark circuits show that the proposed ORT method achieves 100% fault coverage and the optimal diagnosis resolution for interconnects. Further, the multilevel routing algorithm effectively balances the routing density to achieve 100% routing completion.

...read moreread less

Proceedings Article•DOI•

Challenges and solutions in modern analog placement

[...]

Zhe-Wei Jiang¹, Hsin-Chen Chen¹, Tung-Chieh Chen¹, Yao-Wen Chang¹•Institutions (1)

National Taiwan University¹

25 Apr 2007

TL;DR: This paper introduces a placer that can explore multiple placements under position constraints so that a designer can analyze the trade-off among different objectives and provides some future research directions for the modern analog placement problem.

...read moreread less

Abstract: The VLSI placement problem is to place objects into a fixed die such that there are no overlaps among objects and some cost metric (e.g., wirelength, routability) is optimized. It is a major step in physical design that has been studied for decades. However, modern VLSI design challenges have reshaped the placement problem. A modern placer needs to handle large-scale designs with millions of objects, heterogeneous objects with very different sizes, and various complex placement constraints such as preplaced blocks and chip density. In this paper, we first introduce the major techniques employed in our placer for tackling the large-scale mixed-size designs and the aforementioned constraints, and then provide some future research directions for the modern placement problem.

...read moreread less

Book•

Full-chip nanometer routing techniques

[...]

Tsung-Yi Ho, Yao-Wen Chang, Sao-Jie Chen

01 Jan 2007

TL;DR: This book presents a novel multilevel full-chip router, namely mSIGMA for SIGnal-integrity and MAnufacturability optimization, that combines high performance and capacity with the integration of congestion, timing, SI prevention, and DFM algorithms as the best means of getting to design closure quickly.

...read moreread less

Abstract: At 90 nm, wires account for nearly 75% of the total delay in a circuit. Even more insidious, however, is that among nearly 40% of these nets, more than 50% of their total net capacitance are attributed to the cross-coupling capacitance between neighboring signals. At this point a new design and optimization paradigm based on real wires is required. Nanometer routers must prevent and correct these effects on-the-fly in order to reach timing closure. From a manufacturability standpoint, nanometer routers must explicitly deal with the ever increasing design complexity, and be capable of adapting to the constraint requirements of timing, signal integrity, process antenna effect, and new interconnect architecture such as X-architecture. In the nanometer era, we must look into new-generation routing technologies that combine high performance and capacity with the integration of congestion, timing, SI prevention, and DFM algorithms as the best means of getting to design closure quickly. In this book, we present a novel multilevel full-chip router, namely mSIGMA for SIGnal-integrity and MAnufacturability optimization. And these routing technologies will ensure faster time-to-market and time-to-profitability.

...read moreread less

Journal Article•DOI•

MB $^{\ast}$ -Tree: A Multilevel Floorplanner for Large-Scale Building-Module Design

[...]

Hsun-Cheng Lee¹, Yao-Wen Chang², Hannah Honghua Yang³•Institutions (3)

Synopsys¹, National Taiwan University², Intel³

01 Aug 2007-IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

TL;DR: The MB- tree preserves the geometric relations among modules during declustering, which makes the MB-tree an ideal data structure for the multilevel floorplanning/placement framework.

...read moreread less

Abstract: In this paper, we present an agglomeratively multilevel floorplanning/placement framework based on the B -tree representation called MB- tree to handle the floorplanning and packing for large-scale building modules. The MB-tree adopts a two-stage technique, i.e., clustering followed by declustering. The clustering stage iteratively groups a set of modules based on a cost metric guided by area utilization and module connectivity and at the same time establishes the geometric relations for the newly clustered modules by constructing a corresponding B -tree for them. The declustering stage iteratively ungroups a set of the previously clustered modules (i.e., perform tree expansion) and then refines the floorplanning/placement solution by using a simulated annealing scheme. In particular, the MB-tree preserves the geometric relations among modules during declustering, which makes the MB-tree an ideal data structure for the multilevel floorplanning/placement framework. Experimental results show that the MB-tree obtains significantly better silicon area and wirelength than previous works. Further, unlike previous works, the MB-tree scales very well as the circuit size increases.

...read moreread less

Proceedings Article•DOI•

Post-placement leakage optimization for partially dynamically reconfigurable FPGAs

[...]

Chi-Feng Li¹, Ping-Hung Yuh¹, Chia-Lin Yang¹, Yao-Wen Chang¹•Institutions (1)

National Taiwan University¹

27 Aug 2007

TL;DR: A post-placement leakage-aware scheduling algorithm is proposed that refines a placement generated by a performance-driven scheduler such that leakage waste is minimized and performance is not sacrificed.

...read moreread less

Abstract: As technology continues to shrink, leakage power becomes an important issue for modern FPGAs. In this paper, we address the leakage issue of partially dynamical reconfigurable FPGAs. We focus on eliminating leakage waste due to the delay between reconfiguration and task execution. We propose a post-placement leakage-aware scheduling algorithm that refines a placement generated by a performance-driven scheduler such that leakage waste is minimized and performance is not sacrificed. Experimental results on real and synthetic designs demonstrate the effectiveness and efficiency of our algorithm on leakage optimization.

...read moreread less

Book Chapter•DOI•

NTUplace3: An Analytical Placer for Large-Scale Mixed-Size Designs

[...]

Tung-Chieh Chen, Zhe-Wei Jiang, Tien-Chang Hsu, Hsin-Chen Chen¹, Yao-Wen Chang¹ - Show less +1 more•Institutions (1)

National Taiwan University¹

01 Jan 2007

Journal Article•DOI•

An Exact Jumper-Insertion Algorithm for Antenna Violation Avoidance/Fixing Considering Routing Obstacles

[...]

Bor-Yiing Su¹, Yao-Wen Chang, Jiang Hu•Institutions (1)

National Taiwan University¹

01 Apr 2007-IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

TL;DR: This paper forms the first optimal algorithm for the general tree-cutting problem and presents an O((V+D)lgD)-time optimal jumper-insertion algorithm that uses the least number of jumpers to avoid/fix the antenna violations on a Steiner/spanning tree with V vertices and D obstacles.

...read moreread less

Abstract: We study in this paper the problem of jumper insertion on general routing (Steiner/spanning) trees with obstacles for antenna avoidance/fixing at the routing and/or postlayout stages. We formulate the jumper insertion for antenna avoidance/fixing as a tree-cutting problem and present the first optimal algorithm for the general tree-cutting problem. We show that the tree-cutting problem exhibits the properties of optimal substructures and greedy choices. With these properties, we present an O((V+D)lgD)-time optimal jumper-insertion algorithm that uses the least number of jumpers to avoid/fix the antenna violations on a Steiner/spanning tree with V vertices and D obstacles. Experimental results show the superior effectiveness and efficiency of our algorithm

...read moreread less

Proceedings Article•DOI•

Characterization and Monte Carlo Analysis of Secondary Electrons Induced Program Disturb in a Buried Diffusion Bit-line SONOS Flash Memory

[...]

Chun-Jung Tang¹, C.W. Li¹, Tahui Wang¹, S.H. Gu, P.C. Chen, Yao-Wen Chang, T.C. Lu, W.P. Lu, Kuang-Chao Chen, Chih-Yuan Lu - Show less +6 more•Institutions (1)

National Chiao Tung University¹

01 Dec 2007

TL;DR: In this article, a program disturb in a buried diffusion bit-line SONOS array is observed as a bitline width is reduced, and a multi-step Monte Carlo simulation is performed to explore the disturb mechanism.

...read moreread less

Abstract: A new program disturb in a buried diffusion bit-line SONOS array is observed as a bit-line width is reduced. A multi-step Monte Carlo simulation is performed to explore the disturb mechanism. We find that the Vt shift of a disturbed cell is attributed to impact ionization-generated secondary electrons in a neighboring cell when it is in programming. The effects of substrate bias, bit-line dimension and pocket implant on the program disturb are characterized and evaluated by a Monte Carlo simulation.

...read moreread less

Proceedings Article•DOI•

A New Interference Phenomenon in Sub-60nm Nitride-Based Flash Memory

[...]

Yao-Wen Chang, G. Wu, P.C. Chen, C.H. Chen, I.C. Yang, C.Y. Chin, I.J. Huang, W.J. Tsai, T.C. Lu, W.P. Lu, K.C. Chen, C.Y. Lu - Show less +8 more

04 Sep 2007

TL;DR: It is the first time to disclose that the similar interference from adjacent wordlines as found in floating-gate flash memory also exists in nitride-based flash memory.

...read moreread less

Abstract: It is the first time to disclose that the similar interference from adjacent wordlines as found in floating-gate flash memory also exists in nitride-based flash memory. For sub-60nm nitride-based flash technologies, this interference effect cannot be ignored any more and should be well taken into consideration when defining the operation window of the memory products.

...read moreread less

Proceedings Article•DOI•

X-Route: An X-architecture full-chip multilevel router

[...]

Chen-Feng Chang¹, Yao-Wen Chang²•Institutions (2)

Synopsys¹, National Taiwan University²

01 Sep 2007

TL;DR: A progressive X-Steiner tree algorithm based on the delaunay triangulation approach for the X-architecture is developed, which reduces the respective wirelength and average delay by about 14.05% and 30.62%, respectively.

...read moreread less

Abstract: In this paper, we present an X-architecture multilevel full-chip router, called X-Route. Unlike the traditional Λ-shaped multilevel framework that adopts bottom-up coarsening followed by top-down uncoarsening, our multilevel framework runs in the Vshaped manner: top-down uncoarsening followed by bottom-up coarsening. The top-down uncoarsening stage performs octagonal global routing and X-detailed routing for local nets at each level and then refines the solution for the next level. Then, the bottom-up coarsening stage performs the X-detailed routing to reroute failed nets and refines the solution level by level. Since we perform top-down routing first, global long nets are routed earlier. To prevent a wrong decision from blocking the later nets, we keep a dynamic congestion map that records the updated routing congestion information based on the routed nets and the global-path prediction of the unrouted nets. To take full advantage of the X-architecture, we also develop a progressive X-Steiner tree algorithm based on the delaunay triangulation approach for the X-architecture. Compared with the state-of-the-art Λ-shaped multilevel routing for the X-architecture, experimental results show that our X-Route reduces the respective wirelength and average delay by about 14.05% and 30.62%, with better routing completion.

...read moreread less

Proceedings Article•DOI•

An efficient algorithm for statistical circuit optimization using Lagrangian relaxation

[...]

I-Jye Lin¹, Yao-Wen Chang¹•Institutions (1)

National Taiwan University¹

05 Nov 2007

TL;DR: This paper presents an effective and efficient alternative for multi-constrained statistical circuit optimization by both gate and wire sizing using Lagrangian relaxation (LR), and shows that the LR-based algorithm can achieve much better solution quality by reducing 21% area and obtain 560X speed-up over SOCP.

...read moreread less

Abstract: Due to the technology scaling down, process variation has become a crucial challenge on both interconnect delay and reliability. To handle the process variation, statistical optimization has emerged as a popular technique for yield improvement. As a relatively new technique, second-order conic programming (SOCP) has recently attracted very much attention in the literature for statistical circuit optimization. However, we observe significant limitations of SOCP in its flexibility, accuracy, and scalability for statistical circuit optimization, especially when interconnects are considered. We thus present in this paper an effective and efficient alternative for multi-constrained statistical circuit optimization by both gate and wire sizing using Lagrangian relaxation (LR). Compared with SOCP, experimental results show that our LR-based algorithm can achieve much better solution quality by reducing 21% area and obtain 560X speed-up over SOCP.

...read moreread less

Book Chapter•DOI•

Multilevel Full-Chip Routing Considering Crosstalk And Performance

[...]

Tsung-Yi Ho, Yao-Wen Chang, Sao-Jie Chen

01 Jan 2007