Search
Search Results
-
Introducing the Metric Proxy for Holistic I/O Measurements
High-Performance Computing (HPC) systems face a wide spectrum of I/O patterns from various sources including workflows, in-Situ data operations, or... -
Euro-Par 2024: Parallel Processing 30th European Conference on Parallel and Distributed Processing, Madrid, Spain, August 26–30, 2024, Proceedings, Part II
The three-volume set LNCS 14801, 14802, and 14803 constitutes the proceedings of the 30th European Conference on Parallel and Distributed Processing,...
-
Euro-Par 2024: Parallel Processing 30th European Conference on Parallel and Distributed Processing, Madrid, Spain, August 26–30, 2024, Proceedings, Part I
The three-volume set LNCS 14801, 14802, and 14803 constitutes the proceedings of the 30th European Conference on Parallel and Distributed Processing,...
-
Euro-Par 2024: Parallel Processing 30th European Conference on Parallel and Distributed Processing, Madrid, Spain, August 26–30, 2024, Proceedings, Part III
The three-volume set LNCS 14801, 14802, and 14803 constitutes the proceedings of the 30th European Conference on Parallel and Distributed Processing,...
-
Towards Smarter Schedulers: Molding Jobs into the Right Shape via Monitoring and Modeling
High-performance computing is not only a race towards the fastest supercomputers but also the science of using such massive machines productively to... -
Exploring Space-Time Trade-Off in Backtraces
The backtrace is one of the most common operations done by profiling and debugging tools. It consists in determining the nesting of functions leading... -
Tracking Memory Usage in OpenSHMEM Runtimes with the TAU Performance System
As the exascale era approaches, it is becoming increasingly important that runtimes be able to scale to very large numbers of processing elements.... -
Unifying the Analysis of Performance Event Streams at the Consumer Interface Level
Several instrumentation interfaces have been developed for parallel programs to make observable actions that take place during execution and to make... -
Performance Analysis of OpenSHMEM Applications with TAU Commander
The TAU Performance System® (TAU) is a powerful and highly versatile profiling and tracing tool ecosystem for performance engineering of parallel... -
Gleaming the Cube: Online Performance Analysis and Visualization Using MALP
Multi-Application onLine Profiling (MALP) is a performance tool which has been developed as an alternative to the trace-based approach for... -
Profiling Production OpenSHMEM Applications
Developing high performance OpenSHMEM applications routinely involves gaining a deeper understanding of software execution, yet there are numerous... -
Profiling Non-numeric OpenSHMEM Applications with the TAU Performance System
The recent development of a unified SHMEM framework, OpenSHMEM, has enabled further study in the porting and scaling of applications that can benefit... -
Integrated Measurement for Cross-Platform OpenMP Performance Analysis
The ability to measure the performance of OpenMP programs portably across shared memory platforms and across OpenMP compilers is a challenge due to... -
Score-P: A Joint Performance Measurement Run-Time Infrastructure for Periscope, Scalasca, TAU, and Vampir
This paper gives an overview about the Score-P performance measurement infrastructure which is being jointly developed by leading HPC performance... -
Advances in the TAU Performance System
Evolution and growth of parallel systems requires continued advances in the tools to measure, characterize, and understand parallel performance. Five... -
Hands-on Practical Hybrid Parallel Application Performance Engineering
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on the Score-P community instrumentation and... -
Improving the Scalability of Performance Evaluation Tools
Performance evaluation tools play an important role in helping understand application performance, diagnose performance problems and guide tuning... -
An Approach to Creating Performance Visualizations in a Parallel Profile Analysis Tool
With increases in the scale of parallelism the dimensionality and complexity of parallel performance measurements has placed greater challenges on... -
-
Score-P: A Unified Performance Measurement System for Petascale Applications
The rapidly growing number of cores on modern supercomputers imposes scalability demands not only on applications but also on the software tools...