Workshop program
Tuesday, June 14 LaSalle I
8:00 a.m. - 5:00 p.m.
Registration/Check-in
9:00 a.m.

Robert Harrison, Joint Institute of Computational Sciences UT/ORNL
Many are now questioning whether our current approaches to developing software for science and engineering are sustainable. In particular, can we deliver to society and the nation the full benefits expected from high-performance simulation at the peta- and exascales? Or is innovative science being stifled by the increasing complexities of all aspects of our problem space (rapidly changing hardware, software, multidisciplinary physics, etc.)?
Focusing on applications in chemistry and materials science, and motivated co-design of exascale hardware and software, I will discuss many of these issues including how chemistry has already been forced to adopt solutions that differ quite sharply to those in the mainstream, and how these solutions position us well for the technology transitions now underway.
Many are now questioning whether our current approaches to developing software for science and engineering are sustainable. In particular, can we deliver to society and the nation the full benefits expected from high-performance simulation at the peta- and exascales? Or is innovative science being stifled by the increasing complexities of all aspects of our problem space (rapidly changing hardware, software, multidisciplinary physics, etc.)?
Focusing on applications in chemistry and materials science, and motivated co-design of exascale hardware and software, I will discuss many of these issues including how chemistry has already been forced to adopt solutions that differ quite sharply to those in the mainstream, and how these solutions position us well for the technology transitions now underway.
10:00 a.m.
Break
Lasalle Foyer
10:30 a.m.
Session 1: Using OpenMP with Applications
Parallelising Computational Microstructure Simulations for Metallic Materials with OpenMP (PDF)
"Hybrid Programming Model for Implicit PDE Simulations on Multicore Architectures"
An Experimental Model to Analyze OpenMP Applications for System Utilization (PDF)
Parallelising Computational Microstructure Simulations for Metallic Materials with OpenMP (PDF)
Ralph Altenfeld, Markus Apel, An Mey Dieter, Bernd Boettger, Stefan Benke and Christian Bischof
"Hybrid Programming Model for Implicit PDE Simulations on Multicore Architectures"
Dinesh Kaushik, David Keyes, Satish Balay and Barry Smith
An Experimental Model to Analyze OpenMP Applications for System Utilization (PDF)
Mark Woodyard
Noon
Lunch
Lasalle Foyer
1:30 p.m.
Session 2: Extensions for OpenMP
Thread-Local Storage Extension to Support Thread-Based MPI/OpenMP Applications (PDF)
OpenMP Extensions for Heterogeneous Architectures (PDF)
Thread-Local Storage Extension to Support Thread-Based MPI/OpenMP Applications (PDF)
Patrick Carribault, Marc Perache and Herve Jourdren
OpenMP Extensions for Heterogeneous Architectures (PDF)
Leo White
2:30 p.m.
Break
Lasalle Foyer
3:00 p.m.
Session 3: Extensions for OpenMP
OpenMP for Accelerators (PDF)
Unifying Barrier and Point-to-Point Synchronization in OpenMP with Phasers (PDF)
OpenMP for Accelerators (PDF)
James Beyer, Eric Stotzer, Alistair Hart and Bronis De Supinski
Unifying Barrier and Point-to-Point Synchronization in OpenMP with Phasers (PDF)
Jun Shirako, Kamal Sharma and Vivek Sarkar
4:00 p.m.
OpenMP Language Committee Report (PDF)
Bronis R. de Supinski, Lawrence Livermore National Laboratory
An OpenMP 3.1 Specification draft was released for comment earlier this year. Concurrent with that, the committee has reviewed all sections in detail. Based on feedback from the community and that review, the final version will soon be voted on by the OpenMP Architecture Review Board (ARB). We expect the final version to be adopted in July. In addition, the Language Committee continues its efforts to develop a more comprehensive 4.0 update that will position OpenMP for current and future systems in which locality is increasingly important and accelerators are common. This talk will detail the changes in OpenMP 3.1 as well as directions for 4.0.
An OpenMP 3.1 Specification draft was released for comment earlier this year. Concurrent with that, the committee has reviewed all sections in detail. Based on feedback from the community and that review, the final version will soon be voted on by the OpenMP Architecture Review Board (ARB). We expect the final version to be adopted in July. In addition, the Language Committee continues its efforts to develop a more comprehensive 4.0 update that will position OpenMP for current and future systems in which locality is increasingly important and accelerators are common. This talk will detail the changes in OpenMP 3.1 as well as directions for 4.0.
6:00 p.m.
Reception
Market House Totunda
Wednesday, June 15 Ohio Room
9:00 a.m.

Arch Robison, Intel
Intel® Threading Building Blocks (Intel® TBB) is a library approach to parallel programmingno compiler extensions are required. In this talk, I'll explore the motivation for the taking that approach and how some of the consequent challenges were solved. I'll trace TBB's lineage to earlier systems, how its work-stealing scheduler evolved over the last five years, and where it might be headed. I'll also compare TBB with OpenMP and other Intel paradigms for parallel programming such as Cilk Plus.
Intel® Threading Building Blocks (Intel® TBB) is a library approach to parallel programmingno compiler extensions are required. In this talk, I'll explore the motivation for the taking that approach and how some of the consequent challenges were solved. I'll trace TBB's lineage to earlier systems, how its work-stealing scheduler evolved over the last five years, and where it might be headed. I'll also compare TBB with OpenMP and other Intel paradigms for parallel programming such as Cilk Plus.
10:00 a.m.
Break
Ohio Room
10:30 a.m.
Session 4: Tools for OpenMP
"OMPVerify: Polyhedral Analysis for the OpenMP Programmer"
A Dynamic Optimization Framework for OpenMP (PDF)
Towards NUMA Support with Distance Information (PDF)
"OMPVerify: Polyhedral Analysis for the OpenMP Programmer"
Vamshi Basupal li, Tomofumi Yuki, Sanjay Rajopadhye, Antoine Morvan, Steven Derrien, Patrice Quinton, and D. Wonnacott
A Dynamic Optimization Framework for OpenMP (PDF)
Besar Wicaksono, Ramachandra C. Nanjegowda and Barbara Chapman
Towards NUMA Support with Distance Information (PDF)
Dirk Schmidl, Christian Terboven and Dieter An Mey
Noon
Lunch
Ohio Room
1:30 p.m.
Session 5: Implementation and Performance
Performance Evaluation of OpenMP Applications on Virtualized Multicore Machines (PDF)
Performance Analysis and Tuning of Automatically Parallelized OpenMP Applications (PDF)
"A Runtime Implementation of OpenMP Tasks"
Performance Evaluation of OpenMP Applications on Virtualized Multicore Machines (PDF)
Jie Tao, Karl Furlinger and Holger Marten
Performance Analysis and Tuning of Automatically Parallelized OpenMP Applications (PDF)
Dheya Mustafa, Aurangzeb and Rudolf Eigenmann
"A Runtime Implementation of OpenMP Tasks"
James Lagrone, Ayodunni Aribuki, Cody Addison and Barbara Chapman
3:00 p.m.
Break
Ohio Room
3:30 p.m.
OpenMP Language Committee Accelerator Sub-committee Report
James Beyer and Eric Stotzer
Co-chairs, OpenMP Language committee accelerator sub-committee
The two dominate programming models for GPUs, the most common accelerator, are Compute Unified Device Architecture (CUDA) and Open Computing Language (OpenCL), both of these programming models give the programmer the power to extract performance from the accelerator. However, they also require the programmer to significantly rewrite their codes. Rather than require the programmer to rewrite the code, it would be preferable to use a standard set of high level language directives that would help the compiler to generate optimized code to run on the heterogeneous system. The OpenMP language committee accelerator sub-committee is working on a set of OpenMP extensions to support non-cache coherent and cache coherent accelerators. This talk will present some of the work already accomplished by the sub-committee, as well as present some of the major task that remain to be completed.
Co-chairs, OpenMP Language committee accelerator sub-committee
The two dominate programming models for GPUs, the most common accelerator, are Compute Unified Device Architecture (CUDA) and Open Computing Language (OpenCL), both of these programming models give the programmer the power to extract performance from the accelerator. However, they also require the programmer to significantly rewrite their codes. Rather than require the programmer to rewrite the code, it would be preferable to use a standard set of high level language directives that would help the compiler to generate optimized code to run on the heterogeneous system. The OpenMP language committee accelerator sub-committee is working on a set of OpenMP extensions to support non-cache coherent and cache coherent accelerators. This talk will present some of the work already accomplished by the sub-committee, as well as present some of the major task that remain to be completed.
5:00 p.m.
Closing




