Contents
- Introduction
- Debug Solution for Intel® MIC
- Debugging Applications with Offload Extension
- Debugging Native Coprocessor Applications
Introduction
Intel® Xeon Phi™ coprocessor is a product based on the Intel® Many Integrated Core Architecture (Intel® MIC). Intel® offers a debug solution for this architecture that can debug applications running on an Intel® Xeon Phi™ coprocessor.
There are many reasons for the need of a debug solution for Intel® MIC. Some of the most important ones are the following:
- Developing native Intel® MIC applications is as easy as for IA-32 or Intel® 64 hosts. In most cases they just need to be cross-compiled (/Qmic).
Yet, Intel® MIC Architecture is different to host architecture. Those differences could unveil existing issues. Also, incorrect tuning for Intel® MIC could introduce new issues (e.g. alignment of data, can an application handle more than hundreds of threads?, efficient memory consumption?, etc.) - Developing offload enabled applications induces more complexity as host and coprocessor share workload.
- General lower level analysis, tracing execution paths, learning the instruction set of Intel® MIC Architecture, …
Debug Solution for Intel® MIC
For Windows* host, Intel offers a debug solution, the Intel® Debugger Extension for Intel® MIC Architecture Applications. It supports debugging offload enabled application as well as native Intel® MIC applications running on the Intel® Xeon Phi™ coprocessor.
How to get it?
To obtain Intel’s debug solution for Intel® MIC Architecture on Windows* host, you need the following:
- Intel® Composer XE (for C/C++ or Fortran):
http://software.intel.com/en-us/intel-composer-xe
- Only Intel® Composer XE contains the debug solution for Windows* host. However, Intel® Manycore Platform Software Stack (Intel® MPSS) is also required for this to work:
http://software.intel.com/en-us/articles/intel-manycore-platform-software-stack-mpss
(currently 3.2.1 for Windows*)
Debug Solution as Integration
Debug solution from Intel® based on GNU* GDB 7.5:
- Full integration into Microsoft Visual Studio*, no command line version needed
- Available with Intel® Composer XE 2013 SP1 and later
Why integration into Microsoft Visual Studio*?
- Microsoft Visual Studio* is established IDE on Windows* host
- Integration reuses existing usability and features
- Fortran support added with Intel® Fortran Composer XE
Components Required
The following components are required to develop and debug for Intel® MIC Architecture:
- Intel® Xeon Phi™ coprocessor
- Windows* Server 2008 RC2, Windows* 7 or later
- Microsoft Visual Studio* 2012 or later
Support for Microsoft Visual Studio* 2013 was added with Intel® Composer XE 2013 SP1 Update 1. - Intel® MPSS 3.1 or later
- C/C++ development:
Intel® C++ Composer XE 2013 SP1 for Windows* or later - Fortran development:
Intel® Fortran Composer XE 2013 SP1 for Windows* or later
Configure & Test
It is crucial to make sure that the coprocessor setup is correctly working. Otherwise the debugger might not be fully functional.
Setup Intel® MPSS:
- Follow Intel® MPSS readme-windows.pdf for setup
- Verify that the Intel® Xeon Phi™ coprocessor is running
Before debugging applications with offload extensions:
- Use official examples from:
C:\Program Files (x86)\Intel\Composer XE 2013 SP1\Samples\en_US - Verify that offloading code works
Prerequisite for Debugging
Debugger integration for Intel® MIC Architecture only works when debug information is being available:
- Compile in debug mode with at least the following option set:
/Zi (compiler) and /DEBUG (linker) - Optional: Unoptimized code (/Od) makes debugging easier
(due to removed/optimized away temporaries, etc.)
Applications can only be debugged in 64 bit
- Set platform to x64
- Verify that /MACHINE:x64 (linker) is set!
Debugging Applications with Offload Extension
Start Microsoft Visual Studio* IDE and open or create an Intel® Xeon Phi™ project with offload extensions. Examples can be found in the Samples directory of Intel® Composer XE, that is:
C:\Program Files (x86)\Intel\Composer XE 2013 SP1\Samples\en_US
- C++\mic_samples.zip or
- Fortran\mic_samples.zip
We’ll use intro_SampleC from the official C++ examples in the following.
Compile the project with Intel® C++/Fortran Compiler.
Characteristics of Debugging
- Set breakpoints in code (during or before debug session):
- In code mixed for host and coprocessor
- Debugger integration automatically dispatches between host/coprocessor
- Run control is the same as for native applications:
- Run/Continue
- Stop/Interrupt
- etc.
- Offloaded code stops execution (offloading thread) on host
- Offloaded code is executed on coprocessor in another thread
- IDE shows host/coprocessor information at the same time:
- Breakpoints
- Threads
- Processes/Modules
- etc.
- Multiple coprocessors are supported:
- Data shown is mixed:
Keep in mind the different processes and address spaces - No further configuration needed:
Debug as you go!
- Data shown is mixed:
Setting Breakpoints
Note the mixed breakpoints here:
The ones set in the normal code (not offloaded) apply to the host. Breakpoints on offloaded code apply to the respective coprocessor(s) only.
The Breakpoints window shows all breakpoints (host & coprocessor(s)).
Start Debugging
Start debugging as usual via menu (shown) or <F5> key:
While debugging, continue till you reach a set breakpoint in offloaded code to debug the coprocessor code.
Thread Information
Information of host and coprocessor(s) is mixed. In the example above, the threads window shows two processes with their threads. One process comes from the host, which does the offload. The other one is the process hosting and executing the offloaded code, one for each coprocessor.
Debugging Native Coprocessor Applications
Pre-Requisites
Create a native Intel® Xeon Phi™ application and transfer & execute the application to the coprocessor target:
- Use micnativeloadex.exe provided by Intel® MPSS for an application C:\Temp\mic-examples\bin\myApp, e.g.:
> "C:\Program Files\Intel\MPSS\sdk\coi\tools\micnativeloadex\micnativeloadex.exe""C:\Temp\mic-examples\bin\myApp" -d 0
- Option –d 0 specifies the first device (zero based) in case there are multiple coprocessors per system
- This application is executed directly after transfer
Using micnativeloadex.exe also takes care about dependencies (i.e. libraries) and transfers them, too.
Other ways to transfer and execute native applications are also possible (but more complex):
- SSH/SCP
- NFS
- FTP
- etc.
Debugging native applications with Start Visual Studio* IDE is only possible via Attach to Process…:
- micnativeloadex.exe has been used to transfer and execute the native application
- Make sure the application waits till attached, e.g. by:
static int lockit = 1; while(lockit) { sleep(1); }
- After having attached, set lockit to 0 and continue.
- No Visual Studio* solution/project is required.
Only one coprocessor at a time can be debugged this way.
Configuration
Open the options via TOOLS/Options… menu:
It tells the debugger extension where to find the binary and sources. This needs to be changed every time a different coprocessor native application is being debugged.
The entry solib-search-path directories works the same as for the analogous GNU* GDB command. It allows to map paths from the build system to the host system running the debugger.
The entry Host Cache Directory is used for caching symbol files. It can speed up lookup for big sized applications.
Attach
Open the options via TOOLS/Attach to Process… menu:
Specify the Intel(R) Debugger Extension for Intel(R) MIC Architecture. Set the IP and port the GDBServer is running on; the default port of the GDB-Server is 2000, so use that.
After a short delay the processes of the coprocessor card are listed. Select one to attach.
Note:
Checkbox Show processes from all users does not have a function for the coprocessor as user accounts cannot be mapped from host to target and vice versa (Linux* vs. Windows*).