G
Guillaume Mercier
Researcher at L'Abri
Publications - 38
Citations - 1496
Guillaume Mercier is an academic researcher from L'Abri. The author has contributed to research in topics: Multi-core processor & Network topology. The author has an hindex of 17, co-authored 37 publications receiving 1421 citations. Previous affiliations of Guillaume Mercier include Argonne National Laboratory & University of Bordeaux.
Papers
More filters
Proceedings ArticleDOI
hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications
François Broquedis,Jérôme Clet-Ortega,Stéphanie Moreaud,Nathalie Furmento,Brice Goglin,Guillaume Mercier,Samuel Thibault,Raymond Namyst +7 more
TL;DR: The Hardware Locality (hwloc) software is introduced which gathers hardware information about processors, caches, memory nodes and more, and exposes it to applications and runtime systems in a abstracted and portable hierarchical manner.
Proceedings ArticleDOI
Design and evaluation of Nemesis, a scalable, low-latency, message-passing communication subsystem
TL;DR: Nemesis is a new low-level communication subsystem designed and implemented to be scalable and efficient both in the intranode communication context using shared-memory and in the internode communication case using high-performance networks and is natively multimethod-enabled.
Journal ArticleDOI
Process Placement in Multicore Clusters:Algorithmic Issues and Practical Techniques
TL;DR: This paper details the algorithm and techniques proposed to decrease the communication costs of parallel applications by matching the communication pattern to the underlying hardware architecture.
Book ChapterDOI
Near-optimal placement of MPI processes on hierarchical NUMA architectures
TL;DR: A novel algorithm called TreeMatch is described that maps processes to resources in order to reduce the communication cost of the whole application.
Book ChapterDOI
Towards an Efficient Process Placement Policy for MPI Applications in Multicore Environments
TL;DR: It is found out that a policy merely increasing the intranode communication ratio is not enough and that cache utilization is also an influential factor and a more sophisticated policy is required to observe performance improvements.