Decentralized in-order execution of a sequential task-based code for shared-memory architectures

Castes, Charly; Agullo, Emmanuel; Aumage, Olivier; Saillard, Emmanuelle

doi:10.1109/IPDPSW55747.2022.00095

Castes, Charly; Agullo, Emmanuel; Aumage, Olivier; Saillard, Emmanuelle

2022

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

The hardware complexity of modern machines makes the design of adequate programming models crucial for jointly ensuring performance, portability, and productivity in high-performance computing (HPC). Sequential task-based programming models paired with advanced runtime systems allow the programmer to write a sequential algorithm independently of the hardware architecture in a productive and portable manner, and let a third party software layer -the runtime system- deal with the burden of scheduling a correct, parallel execution of that algorithm to ensure performance. Many HPC algorithms have successfully been implemented following this paradigm, as a testimony of its effectiveness.

Developing algorithms that specifically require fine-grained tasks along this model is still considered prohibitive, however, due to per-task management overhead [1], forcing the programmer to resort to a less abstract, and hence more complex "task+X" model. We thus investigate the possibility to offer a tailored execution model, trading dynamic mapping for efficiency by using a decentralized, conservative in-order execution of the task flow, while preserving the benefits of relying on the sequential taskbased programming model. We propose a formal specification of the execution model as well as a prototype implementation, which we assess on a shared-memory multicore architecture with several synthetic workloads. The results show that under the condition of a proper task mapping supplied by the programmer, the pressure on the runtime system is significantly reduced and the execution of fine-grained task flows is much more efficient.

Details

Title Decentralized in-order execution of a sequential task-based code for shared-memory architectures

Author(s) Castes, Charly ; Agullo, Emmanuel ; Aumage, Olivier ; Saillard, Emmanuelle

Published in 2022 Ieee 36Th International Parallel And Distributed Processing Symposium Workshops (Ipdpsw 2022)

Series IEEE International Symposium on Parallel and Distributed Processing Workshops

Pages 552-561

Conference 36th IEEE International Parallel and Distributed Processing Symposium (IEEE IPDPS), May 30-Jun 03, 2022, ELECTR NETWORK

Date 2022-01-01

Publisher Los Alamitos, IEEE COMPUTER SOC

ISSN 2164-7062

ISBN 978-1-6654-9747-3

DOI https://doi.org/10.1109/IPDPSW55747.2022.00095

Other identifier(s) View record in Web of Science

Laboratories DCSL

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > DCSL - Data Center Systems Laboratory
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2022-10-10