Home
/
Authors
/
Julio Sahuquillo

Author

Julio Sahuquillo

Other affiliations: Polytechnic University of Catalonia, University of Valencia

Bio: Julio Sahuquillo is an academic researcher from Polytechnic University of Valencia. The author has contributed to research in topics: Cache & Cache algorithms. The author has an hindex of 21, co-authored 171 publications receiving 1743 citations. Previous affiliations of Julio Sahuquillo include Polytechnic University of Catalonia & University of Valencia.

Topics: Cache, Cache algorithms, Cache pollution, CPU cache, Multi-core processor ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2002
2001
2000
1999
1998

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Multi2Sim: A Simulation Framework to Evaluate Multicore-Multithreaded Processors

[...]

Rafael Ubal, Julio Sahuquillo, Salvador Petit, Pedro López

19 Nov 2007

TL;DR: The Multi2Sim simulation framework is presented, which models the major components of incoming systems, and is intended to cover the limitations of existing simulators.

...read moreread less

Abstract: Current microprocessors are based in complex designs, integrating different components on a single chip, such as hardware threads, processor cores, memory hierarchy or interconnection networks. The permanent need of evaluating new designs on each of these components motivates the development of tools which simulate the system working as a whole. In this paper, we present the Multi2Sim simulation framework, which models the major components of incoming systems, and is intended to cover the limitations of existing simulators. A set of simulation examples is also included for illustrative purposes.

...read moreread less

164 citations

Proceedings Article•DOI•

Efficient interconnects for clustered microarchitectures

[...]

Joan-Manuel Parcerisa, Julio Sahuquillo¹, Antonio González², José Duato²•Institutions (2)

Polytechnic University of Catalonia¹, Polytechnic University of Valencia²

22 Sep 2002

TL;DR: This work investigates the design of on-chip interconnection networks for clustered microarchitectures and proposes point-to-point interconnects together with an effective latency-aware instruction steering scheme and shows that they achieve much better performance than bus-based interConnects.

...read moreread less

Abstract: Clustering is an effective microarchitectural technique for reducing the impact of wire delays, the complexity, and the power requirements of microprocessors. In this work, we investigate the design of on-chip interconnection networks for clustered microarchitectures. This new class of interconnects has different demands and characteristics than traditional multiprocessor networks. In a clustered microarchitecture, a low inter-cluster communication latency is essential for high performance. We propose point-to-point interconnects together with an effective latency-aware instruction steering scheme and show that they achieve much better performance than bus-based interconnects. The results show that the connectivity of the network together with latency-aware steering schemes are key for high performance. We also show that these interconnects can be built with simple hardware and achieve a performance close to that of an idealized contention-free model.

...read moreread less

154 citations

Proceedings Article•DOI•

A simple power-aware scheduling for multicore systems when running real-time applications

[...]

Diana Bautista, Julio Sahuquillo, Houcine Hassan, Salvador Petit, José Duato - Show less +1 more

14 Apr 2008

TL;DR: This paper proposes a novel soft power-aware real-time scheduler for a state-of-the-art multicore multithreaded processor, which implements dynamic voltage scaling techniques, and shows that using a fair scheduling policy, the proposed algorithm provides, on average, energy savings.

...read moreread less

Abstract: High-performance microprocessors, e.g., multithreaded and multicore processors, are being implemented in embedded real-time systems because of the increasing computational requirements. These complex microprocessors have two major drawbacks when they are used for real-time purposes. First, their complexity difficults the calculation of the WCET (worst case execution time). Second, power consumption requirements are much larger, which is a major concern in these systems. In this paper we propose a novel soft power-aware real-time scheduler for a state-of-the-art multicore multithreaded processor, which implements dynamic voltage scaling techniques. The proposed scheduler reduces the energy consumption while satisfying the constraints of soft real-time applications. Different scheduling alternatives have been evaluated, and experimental results show that using a fair scheduling policy, the proposed algorithm provides, on average, energy savings ranging from 34% to 74%.

...read moreread less

60 citations

Proceedings Article•DOI•

Exploiting temporal locality in drowsy cache policies

[...]

Salvador Petit¹, Julio Sahuquillo¹, Jose M. Such¹, David Kaeli²•Institutions (2)

Polytechnic University of Valencia¹, Northeastern University²

04 May 2005

TL;DR: A novel drowsy cache policy is proposed called Reuse Most Recently used On (RMRO), which makes use of reuse information to trade off performance versus energy consumption and improves the hit ratio fordrowsy lines by about 67%, while reducing the power consumption by about 11.7%.

...read moreread less

Abstract: Technology projections indicate that static power will become a major concern in future generations of high-performance microprocessors. Caches represent a significant percentage of the overall microprocessor die area. Therefore, recent research has concentrated on the reduction of leakage current dissipated by caches. The variety of techniques to control current leakage can be classified as non-state preserving or state preserving. Non-state preserving techniques power off selected cache lines while state preserving place selected lines into a low-power state. Drowsy caches are a recently proposed state-preserving technique. In order to introduce low performance overhead, drowsy caches must be very selective on which cache lines are moved to a drowsy statePast research on cache organization has focused on how best to exploit the temporal locality present in the data stream. In this paper we propose a novel drowsy cache policy called Reuse Most Recently used On (RMRO), which makes use of reuse information to trade off performance versus energy consumption. Our proposal improves the hit ratio for drowsy lines by about 67%, while reducing the power consumption by about 11.7% (assuming 70nm technology) with respect to previously proposed drowsy cache policies.

...read moreread less

53 citations

Journal Article•DOI•

A user-focused evaluation of web prefetching algorithms

[...]

Josep Domenech¹, Ana Pont¹, Julio Sahuquillo¹, José A. Gil¹•Institutions (1)

Polytechnic University of Valencia¹

01 Jul 2007-Computer Communications

TL;DR: This paper analyzes the perceived latency versus the traffic increase (both in bytes and in objects) to evaluate the benefits from the user's perspective and shows that higher algorithm complexity does not improve performance, object-based algorithms outperform those based on pages, and performance among object- based algorithms present minor differences in the object traffic increase.

...read moreread less

52 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36

Collapse

Cited by

PDF

Open Access

More filters

Fast parallel algorithms for short-range molecular dynamics

[...]

Steven J. Plimpton¹•Institutions (1)

Sandia National Laboratories¹

01 May 1993

TL;DR: Comparing the results to the fastest reported vectorized Cray Y-MP and C90 algorithm shows that the current generation of parallel machines is competitive with conventional vector supercomputers even for small problems.

...read moreread less

Abstract: Three parallel algorithms for classical molecular dynamics are presented. The first assigns each processor a fixed subset of atoms; the second assigns each a fixed subset of inter-atomic forces to compute; the third assigns each a fixed spatial region. The algorithms are suitable for molecular dynamics models which can be difficult to parallelize efficiently—those with short-range forces where the neighbors of each atom change rapidly. They can be implemented on any distributed-memory parallel machine which allows for message-passing of data between independently executing processors. The algorithms are tested on a standard Lennard-Jones benchmark problem for system sizes ranging from 500 to 100,000,000 atoms on several parallel supercomputers--the nCUBE 2, Intel iPSC/860 and Paragon, and Cray T3D. Comparing the results to the fastest reported vectorized Cray Y-MP and C90 algorithm shows that the current generation of parallel machines is competitive with conventional vector supercomputers even for small problems. For large problems, the spatial algorithm achieves parallel efficiencies of 90% and a 1840-node Intel Paragon performs up to 165 faster than a single Cray C9O processor. Trade-offs between the three algorithms and guidelines for adapting them to more complex molecular dynamics simulations are also discussed.

...read moreread less

29,323 citations

[서평]「Computer Organization and Design, The Hardware/Software Interface」

[...]

장훈

01 Nov 1997

TL;DR: Recognizing the mannerism ways to get this books computer organization and design the hardware software interface 4th fourth edition by patterson hennessy is additionally useful.

...read moreread less

Abstract: Recognizing the mannerism ways to get this books computer organization and design the hardware software interface 4th fourth edition by patterson hennessy is additionally useful. You have remained in right site to begin getting this info. acquire the computer organization and design the hardware software interface 4th fourth edition by patterson hennessy join that we manage to pay for here and check out the link.

...read moreread less

832 citations

Using Mpi Portable Parallel Programming With The Message Passing Interface

[...]

Christina Freytag

01 Jan 2016

TL;DR: Thank you very much for downloading using mpi portable parallel programming with the message passing interface for reading a good book with a cup of coffee in the afternoon, instead they are facing with some malicious bugs inside their laptop.

...read moreread less

Abstract: Thank you very much for downloading using mpi portable parallel programming with the message passing interface. As you may know, people have search hundreds times for their chosen novels like this using mpi portable parallel programming with the message passing interface, but end up in harmful downloads. Rather than reading a good book with a cup of coffee in the afternoon, instead they are facing with some malicious bugs inside their laptop.

...read moreread less

593 citations

技術解説 IEEE Computer

[...]

Thorsten von Eicken, Werner Vogels

18 Jan 1999

465 citations

Journal Article•DOI•

A tool for the generation of realistic network workload for emerging networking scenarios

[...]

Alessio Botta, Alberto Dainotti, Antonio Pescape

01 Oct 2012-Computer Networks

TL;DR: This paper describes the main properties that a network workload generator should have today, and presents a tool for the generation of realistic network workload that can be used for the study of emerging networking scenarios.

...read moreread less

434 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse