We have created a data analytics acceleration project on github, to help accelerate data analytics applications. We have placed the Intel® Data Analytics Acceleration Library (Intel® DAAL), the high performance analytics (for "Big Data") library for x86 and x86-64, into open source to create this project.
Intel DAAL helps accelerate big data analytics by providing highly optimized algorithmic building blocks for all data analysis stages (preprocessing, transformation, analysis, modeling, validation, and decision making) for batch, online and distributed processing modes of computation. It’s designed for use with popular data platforms including Hadoop*, Spark*, R, and Matlab* for highly efficient data access. Intel DAAL is available for Linux*, OS X* and Windows* and is licensed with the Apache 2.0 license. The DAAL project is available on github for download, feedback and contributions.
Intel DAAL has benefited from customer feedback since its initial release in 2015. Following a year of intense feedback and additional development as a full product, we are excited to introduce it as a very solid open source project ready for use and participation. Intel DAAL remains an integral part of Intel's software developer tools and is backed by Intel with support and future development investments.
WHERE
- Download the source or prebuilt binaries for Intel DAAL - github - and give feedback, and contribute!
- Getting starting information and training for Intel DAAL - software.intel.com/en-us/intel-daal-support/training
- Join the ongoing beta for Intel Parallel Studio 2017 - including Intel DAAL - tinyurl.com/ipsbeta2017
ACCELERATE DATA ANALYTICS
The Intel Data Analytics Acceleration Library (Intel DAAL) is a library delivering high performance machine learning and data analytics algorithms. Intel DAAL is an essential component of Intel’s overall machine learning solution including Intel® Xeon® Processor E7 Family, the Trusted Analytics Platform and Intel® Xeon Phi™ Processors (Knights Landing). Intel DAAL works with a wide selection of data platforms and programming languages including Hadoop, Spark, Python, Java and C++. Intel DAAL was first released in 2015 without source code to give us time to evolve some interfaces on our path to open sourcing this year. We appreciate the many users who have given feedback and encouraged us to get where we are today. Previous versions of Intel DAAL required separate installation of the Intel Math Kernel Library (Intel MKL) and Intel Integrated Performance Primitives (Intel IPP). The latest version of Intel DAAL actually comes with the necessary binary parts of Intel MKL (for BLAS and LAPACK) as well as Intel IPP (compression and decompression) so that the tremendous performance from these key routines are available automatically with no additional downloads needed! In order to make the most of multicore and many-core parallelism, and for superior threading interoperability, it is notable that the threading in Intel DAAL relies on the open source project known as "TBB" (Intel Threading Building Blocks).
EXPERIENCE PERFORMANCE
In the exciting and rapidly-evolving data analytics market, this key Intel performance library can really boost performance. At the Intel Developers Forum in 2015, Capital One discussed significant acceleration (over 200X - see slide 26) as an early user of Intel DAAL. We've seen numerous examples across many industries in the first year of product of substantial performance improvements using Intel DAAL - it is definitely worth a try!
Many more details about the product are available on the product page including some benchmarking data to share more related to the potential performance gains when using DAAL.
SPEEDING TOWARD 2017 - JOIN US!
DAAL is currently speeding toward a "2017" release (expected in late Q3 2016) in conjunction with Intel's award winning Intel Parallel Studio suite of developer tools. Precompiled binaries with installers are available for free as part of the beta program. Registration for the beta is available at tinyurl.com/ipsbeta2017.
The open source project feeds the product; there are no features held exclusively for the product version. The only difference when purchased is that Intel's Premier support is included for the entire product.
Support for all users of Intel DAAL is available online through the online Intel DAAL forum.