Characterizing and mitigating work time inflation in task parallel programs
Author(s) -
Stephen L. Olivier,
Bronis R. de Supinski,
Martin Schulz,
Jan F. Prins
Publication year - 2013
Publication title -
2012 international conference for high performance computing, networking, storage and analysis
Language(s) - English
Resource type - Conference proceedings
SCImago Journal Rank - 0.363
H-Index - 56
eISSN - 2167-4337
pISSN - 2167-4329
ISBN - 978-1-4673-0806-9
DOI - 10.1109/sc.2012.27
Subject(s) - computing and processing
Task parallelism raises the level of abstraction in shared memory parallel programming to simplify the development of complex applications. However, task parallel applications can exhibit poor performance due to thread idleness, scheduling overheads, and work time inflation -- additional time spent by threads in a multithreaded computation beyond the time required to perform the same work in a sequential computation. We identify the contributions of each factor to lost efficiency in various task parallel OpenMP applications and diagnose the causes of work time inflation in those applications. Increased data access latency can cause significant work time inflation in NUMA systems. Our locality framework for task parallel OpenMP programs mitigates this cause of work time inflation. Our extensions to the Qthreads library demonstrate that locality-aware scheduling can improve performance up to 3X compared to the Intel OpenMP task scheduler.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom