O

Allinea MAP - MPI, OpenMP, C, C++ and F90 profiling

Software performance matters. Allinea MAP is a profiler that shows exactly where and why your code is losing performance.

More results, faster

We believe scientists and developers should be set free to spend their time and energy doing great science and code - not battling with arcane or unnecessarily complex tools.

Allinea MAP is our way of giving time back to you. Profiling your C++, C or F90 code on Linux is as simple as running "map -profile my_program.exe".

There are no extra steps and no complicated instrumentation.  MPI, OpenMP, pthreads or scalar codes can all be profiled with Allinea MAP.

Allinea MAP imbalance code

The graphical results are precise, straightforward to interpret and bottlenecks are shown directly in the source code.

Integration and a common interface with the debugger Allinea DDT in the Allinea Forge tool suite makes moving between tools a breeze, saving you time and energy throughout each stage of the development cycle.

Deepen your insight into the performance of your code

Existing performance tools can give a powerful view, when you run them and spend the time analyzing their output, but when was the last time you ran a profiler on your production code?

Allinea MAP imbalance code
We built Allinea MAP with less than 5% runtime overhead so you can run it every day, on every change, giving you fascinating and powerful insights into the performance of real codes under real conditions.

I/O profiling

As systems get larger, more and more codes are being affected by poor I/O performance. Often, this goes unnoticed or mislabelled as poor application scaling. Allinea MAP shows you exactly where your file I/O bandwidth is being used, helping to diagnose overloaded shared filesystems, poor read/write patterns and even system misconfiguration issues.

Profiling threads and OpenMP code

Getting performance from multithreaded code can be a challenge - but Allinea MAP makes it easy to see where thread synchronization is costing cycles and where threads are spending their time. 

With views of CPU core activity, and code profiling by actual per-core walltime, Allinea MAP is the thread profiler that threaded code has been waiting for.  Our OpenMP profiling blog explores how MAP profiles threads.

MPI Profiling

At the core of Allinea MAP is a scalable architecture that lets it profile hundreds, or thousands of parallel processes - such as those in HPC's main communication library, MPI.  It gathers performance data from each process, and merges the information to present why and where your MPI or multiprocess code is slow.  Allinea MAP works with almost every MPI implementation to make using it seamless, so that your users have the fastest and most pain free experience of profiling their MPI code.

Memory Profiling

Memory usage and whole node memory usage

As your application progresses, Allinea MAP can show you the real memory usage across all processes in the application and all compute nodes/servers. The memory usage helps you identify imbalance, or changes caused by phases in your application - - and MAP shows this alongside your source code.  The visible high-water mark of usage helps to track down applications that rely on 3rd party libraries which temporarily consume memory and push memory usage over the edge.  Applications that use increasing memory over time - memory leaks - can then be address with Allinea DDT's in-built memory debugging.

Additionally, the time spent in memory accesses is one of the key metrics profiled so that poor memory access patterns and cache use are found easily.

Accelerator Metrics

Allinea MAP supports the latest NVIDIA GPUs and helps you to understand the impact of GPUs and CPUs together on application performance. Profiling enables you to see how your CPU waits for GPU completion - and view GPU time in global memory access, GPU utilization and even GPU temperature.

Energy Profiling

Energy consumption and peak power usage is increasingly important for high-performance applications and their users.  With Allinea MAP's Energy Pack,developers can optimize for time and energy

The latest Sandy Bridge and above Intel processors are supported (including Haswell and Broadwell chips) - via their in-built Intel RAPL power measurement capability - for CPU power measurement.  GPU power measurement is available on any NVIDIA GPU with power monitoring support.  Node-level measurement is also available for systems supporting the Intel Energy Checker API or the Cray HSS energy counters (XK6 and XC30 and above).

Read more about our Energy Pack.

Compare the performance of different clusters and architectures

Allinea MAP can provide all its data in an open XML format, making it ideal for post-processing to characterize and compare the performance of key codes on different hardware platforms.

Allinea MAP imbalance code

Even without access to the original source code, Allinea MAP tracks and reports on CPU, memory and MPI performance metrics over time, giving you everything you need to evaluate and compare potential new platforms.

Download a trial

Free up support staff time to solve key challenges

HPC consultants and support staff have a deep understanding of performance and optimization tools. Yet again and again they tell us much of their time is spent diagnosing the same basic mistakes new MPI programmers make over and over.

Allinea MAP imbalance code

We designed Allinea MAP so that new MPI developers can see the cause of common performance problems at once, freeing up experts to dive deeper into complex and leadership-class optimization problems.

Download a trial