Home
/
Authors
/
Martti Forsell

Author

Martti Forsell

Other affiliations: VTT Technical Research Centre of Finland

Bio: Martti Forsell is an academic researcher from University of Eastern Finland. The author has contributed to research in topics: Shared memory & Thread (computing). The author has an hindex of 8, co-authored 49 publications receiving 1523 citations. Previous affiliations of Martti Forsell include VTT Technical Research Centre of Finland.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

A network on chip architecture and design methodology

[...]

Shashi Kumar, Axel Jantsch, Juha-Pekka Soininen, Martti Forsell, Mikael Millberg¹, Johnny Öberg¹, Kari Tiensyrjä¹, Ahmed Hemani¹ - Show less +4 more•Institutions (1)

Royal Institute of Technology¹

07 Aug 2002

TL;DR: A packet switched platform for single chip systems which scales well to an arbitrary number of processor like resources which is the onchip communication infrastructure comprising the physical layer, the data link layer and the network layer of the OSI protocol stack.

...read moreread less

Abstract: We propose a packet switched platform for single chip systems which scales well to an arbitrary number of processor like resources. The platform, which we call Network-on-Chip (NOC), includes both the architecture and the design methodology. The NOC architecture is a m/spl times/n mesh of switches and resources are placed on the slots formed by the switches. We assume a direct layout of the 2-D mesh of switches and resources providing physical- and architectural-level design integration. Each switch is connected to one resource and four neighboring switches, and each resource is connected to one switch. A resource can be a processor core, memory, an FPGA, a custom hardware block or any other intellectual property (IP) block, which fits into the available slot and complies with the interface of the NOC. The NOC architecture essentially is the onchip communication infrastructure comprising the physical layer, the data link layer and the network layer of the OSI protocol stack. We define the concept of a region, which occupies an area of any number of resources and switches. This concept allows the NOC to accommodate large resources such as large memory banks, FPGA areas, or special purpose computation resources such as high performance multi-processors. The NOC design methodology consists of two phases. In the first phase a concrete architecture is derived from the general NOC template. The concrete architecture defines the number of switches and shape of the network, the kind and shape of regions and the number and kind of resources. The second phase maps the application onto the concrete architecture to form a concrete product.

...read moreread less

1,304 citations

Proceedings Article•DOI•

Extending platform-based design to network on chip systems

[...]

Juha-Pekka Soininen, Axel Jantsch, Martti Forsell, A. Pelkonen, Jari Kreku, Shashi Kumar - Show less +2 more

04 Jan 2003

TL;DR: A novel layered backbone-platform-system (BPS) design methodology for development of network-on-chip based products that combines and extends the distributed, parallel, embedded and platform-based design concepts in order to manage the diversity and complexity of NOC-based systems.

...read moreread less

Abstract: Exploitation of silicon capacity will require improvements in design productivity and more scalable system paradigms. Asynchronous message passing networks on chip (NOC) have been proposed as backbones for billion-transistor ASICs. We present a novel layered backbone-platform-system (BPS) design methodology for development of network-on-chip based products. It combines and extends the distributed, parallel, embedded and platform-based design concepts in order to manage the diversity and complexity of NOC-based systems. The reuse of communication principles in various platforms, the reuse of platforms in product differentiation, and system-level decision-support methods are the cornerstones of our methodology. The presented mappability estimation and workload simulations demonstrate the feasibility of such methods.

...read moreread less

22 citations

Journal Article•DOI•

Are multiport memories physically feasible

[...]

Martti Forsell¹•Institutions (1)

University of Eastern Finland¹

01 Sep 1994-ACM Sigarch Computer Architecture News

TL;DR: This paper considers the idea of true multiport memory that can be used as building block of efficient PRAM-style shared main memory, and shows that at least small size multiport memories look physically feasible and the power of PRAM model can be fully exploited by computer systems with multiport Memories.

...read moreread less

Abstract: Parallel Random Access Machine (PRAM) is a popular model for parallel computation that promises easy programmability and great parallel performance, but only if efficient shared main memories can be built. This won't be easy, because the complexity of shared memories leads to difficult technical problems. In this paper we consider the idea of true multiport memory that can be used as building block of efficient PRAM-style shared main memory. Two possible structures of multiport memory chips are presented. We will also give preliminary cost-effectivity and performance analysis of memory systems using proposed multiport RAMs. Results are encouraging: At least small size multiport memories look physically feasible. Also the power of PRAM model can be fully exploited by computer systems with multiport memories.

...read moreread less

21 citations

Journal Article•DOI•

Architectural differences of efficient sequential and parallel computers

[...]

Martti Forsell

01 Jul 2002-Journal of Systems Architecture

TL;DR: The performance of eight general purpose processor architectures representing widely both commercial and scientific processor designs in both single processor and multiprocessor setups is evaluated and it is concluded that there exists no single optimal architecture for general purpose computation.

...read moreread less

17 citations

Book•DOI•

Euro-Par 2007 Workshops: Parallel Processing

[...]

L. Bougé, Martti Forsell, Jesper Larsson Träff, Achim Streit, Wolfgang Ziegler, Michael Alexander, Stephen Childs - Show less +3 more

01 Jan 2008

17 citations

1
2
3
4
…
5
6
7
8
9
10

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

A survey of research and practices of Network-on-chip

[...]

Tobias Bjerregaard¹, Shankar Mahadevan¹•Institutions (1)

Technical University of Denmark¹

29 Jun 2006-ACM Computing Surveys

TL;DR: The research shows that NoC constitutes a unification of current trends of intrachip communication rather than an explicit new alternative.

...read moreread less

Abstract: The scaling of microchip technologies has enabled large scale systems-on-chip (SoC). Network-on-chip (NoC) research addresses global communication in SoC, involving (i) a move from computation-centric to communication-centric design and (ii) the implementation of scalable communication structures. This survey presents a perspective on existing NoC research. We define the following abstractions: system, network adapter, network, and link to explain and structure the fundamental concepts. First, research relating to the actual network design is reviewed. Then system level design and modeling are discussed. We also evaluate performance analysis techniques. The research shows that NoC constitutes a unification of current trends of intrachip communication rather than an explicit new alternative.

...read moreread less

1,720 citations

Proceedings Article•DOI•

A network on chip architecture and design methodology

[...]

Shashi Kumar, Axel Jantsch, Juha-Pekka Soininen, Martti Forsell, Mikael Millberg¹, Johnny Öberg¹, Kari Tiensyrjä¹, Ahmed Hemani¹ - Show less +4 more•Institutions (1)

Royal Institute of Technology¹

07 Aug 2002

...read moreread less

1,304 citations

Journal Article•DOI•

Performance evaluation and design trade-offs for network-on-chip interconnect architectures

[...]

Partha Pratim Pande¹, C. Grecu¹, M. Jones¹, Andre Ivanov¹, Resve A. Saleh¹ - Show less +1 more•Institutions (1)

University of British Columbia¹

01 Aug 2005-IEEE Transactions on Computers

TL;DR: This paper develops a consistent and meaningful evaluation methodology to compare the performance and characteristics of a variety of NoC architectures and explores design trade-offs that characterize the NoC approach and obtains comparative results for a number of common NoC topologies.

...read moreread less

Abstract: Multiprocessor system-on-chip (MP-SoC) platforms are emerging as an important trend for SoC design. Power and wire design constraints are forcing the adoption of new design methodologies for system-on-chip (SoC), namely, those that incorporate modularity and explicit parallelism. To enable these MP-SoC platforms, researchers have recently pursued scaleable communication-centric interconnect fabrics, such as networks-on-chip (NoC), which possess many features that are particularly attractive for these. These communication-centric interconnect fabrics are characterized by different trade-offs with regard to latency, throughput, energy dissipation, and silicon area requirements. In this paper, we develop a consistent and meaningful evaluation methodology to compare the performance and characteristics of a variety of NoC architectures. We also explore design trade-offs that characterize the NoC approach and obtain comparative results for a number of common NoC topologies. To the best of our knowledge, this is the first effort in characterizing different NoC architectures with respect to their performance and design trade-offs. To further illustrate our evaluation methodology, we map a typical multiprocessing platform to different NoC interconnect architectures and show how the system performance is affected by these design trade-offs.

...read moreread less

921 citations

Proceedings Article•DOI•

Bandwidth-constrained mapping of cores onto NoC architectures

[...]

Srinivasan Murali¹, G. De Micheli¹•Institutions (1)

Stanford University¹

16 Feb 2004

TL;DR: NMAP is presented, a fast algorithm that maps the cores onto a mesh NoC architecture under bandwidth constraints, minimizing the average communication delay, and the NMAP algorithm is presented for both single minimum-path routing and split-traffic routing.

...read moreread less

Abstract: We address the design of complex monolithic systems, where processing cores generate and consume a varying and large amount of data, thus bringing the communication links to the edge of congestion. Typical applications are in the area of multi-media processing. We consider a mesh-based networks on chip (NoC) architecture, and we explore the assignment of cores to mesh cross-points so that the traffic on links satisfies bandwidth constraints. A single-path deterministic routing between the cores places high bandwidth demands on the links. The bandwidth requirements can be significantly reduced by splitting the traffic between the cores across multiple paths. In this paper, we present NMAP, a fast algorithm that maps the cores onto a mesh NoC architecture under bandwidth constraints, minimizing the average communication delay. The NMAP algorithm is presented for both single minimum-path routing and split-traffic routing. The algorithm is applied to a benchmark DSP design and the resulting NoC is built and simulated at cycle accurate level in SystemC using macros from the /spl times/pipes library. Also, experiments with six video processing applications show significant savings in bandwidth and communication cost for NMAP algorithm when compared to existing algorithms.

...read moreread less

714 citations

Journal Article•DOI•

Energy- and performance-aware mapping for regular NoC architectures

[...]

Jingcao Hu¹, Radu Marculescu¹•Institutions (1)

Carnegie Mellon University¹

04 Apr 2005-IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

TL;DR: An algorithm which automatically maps a given set of intellectual property onto a generic regular network-on-chip (NoC) architecture and constructs a deadlock-free deterministic routing function such that the total communication energy is minimized.

...read moreread less

Abstract: In this paper, we present an algorithm which automatically maps a given set of intellectual property onto a generic regular network-on-chip (NoC) architecture and constructs a deadlock-free deterministic routing function such that the total communication energy is minimized. At the same time, the performance of the resulting communication system is guaranteed to satisfy the specified design constraints through bandwidth reservation. As the main theoretical contribution, we first formulate the problem of energy- and performance-aware mapping in a topological sense, and show how the routing flexibility can be exploited to expand the solution space and improve the solution quality. An efficient branch-and-bound algorithm is then proposed to solve this problem. Experimental results show that the proposed algorithm is very fast, and significant communication energy savings can be achieved. For instance, for a complex video/audio application, 51.7% communication energy savings have been observed, on average, compared to an ad hoc implementation.

...read moreread less

662 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse