16.1.1 release update includes:
- Fix for the known incompatibility issue with the CPU Kernel Debugger from the Intel® SDK for OpenCL™ Applications 2016 R2 and the CPU only runtime package version 16.1.
- Performance optimizations:
- Compiler vectorizer heuristic tuning for a set of workloads
- Workgroup fusion optimization improvements
- Performance enhancements of the vload()/vstore() built-in functions
- Fix for the issue reported on the forum (https://software.intel.com/en-us/comment/1844607#comment-1844607): vectorizer produces incorrect code on SSE42 architectures when using the samplerless read_imagef() built-in function with image2d_t and int2 coordinates as arguments.
- cl_khr_gl_sharing extension was disabled due to incompatibility with the Microsoft* Basic Display Adapter. To use this extension, please install OpenCL Driver for Intel® Iris™ Graphics and HD Graphics for Windows* OS from https://software.intel.com/en-us/articles/opencl-drivers#iris. The driver package includes the OpenCL Runtime package for CPUs.
- Due to performance bug Threading Building Blocks (TBB) library was downgraded from 4.2,Interface version 7001, Oct 2 2013" to 4.2, Interface version 7005 , Jun 1 2014