Knights Landing is the codename for Intel's 2nd generation Intel® Xeon Phi™ Product Family, which will deliver massive thread parallelism, data parallelism and memory bandwidth – with improved single-thread performance and Intel® Xeon® processor binary-compatibility in a standard CPU form factor. Additionally, Knights Landing will offer integrated Intel® Omni-Path fabric technology, and also be available in the traditional PCIe* coprocessor form factor.
The following is a list of public disclosures that Intel has previously made about the forthcoming product:
PERFORMANCE |
3+ TeraFLOPS of double-precision peak theoretical performance per single socket node0 |
INTEGRATION |
Intel® Omni Scale™ fabric integration |
High-performance on-package memory (MCDRAM) | Over 5x STREAM vs. DDR41⇒ Over 400 GB/s | Up to 16GB at launch | NUMA support | Over 5x Energy Efficiency vs. GDDR52 | Over 3x Density vs. GDDR52 | In partnership with Micron Technology | Flexible memory modes including cache and flat |
|
|
|
|
|
|
|
SERVER PROCESSOR |
Standalone bootable processor (running host OS) and a PCIe coprocessor (PCIe end-point device) |
Platform memory capacity comparable to Intel® Xeon® Processors |
Reliability (“Intel server-class reliability”) |
Power Efficiency (Over 25% better than discrete coprocessor)4 ⇒ Over 10 GF/W |
Density (3+ KNL with fabric in 1U)5 |
MICROARCHITECTURE |
Based on Intel’s 14 nanometer manufacturing technology |
Binary compatible with Intel® Xeon® Processors6 |
Support for Intel® Advanced Vector Extensions 512 (Intel® AVX-512) |
3x Single-Thread Performance compared to Knights Corner7 |
Cache-coherency |
60+ cores in a 2D Mesh architecture |
“Based on Intel® Atom™ core (based on Silvermont microarchitecture) with many HPC enhancements” | 4 Threads / Core | Deep Out-of-Order Buffers | Gather/scatter in hardware | Advanced Branch Prediction | High cache bandwidth |
|
|
|
|
|
Most of today’s parallel optimizations carry forward to KNL |
Multiple NUMA domain support per socket |
ROADMAP |
Knights Hill is the codename for the 3rd generation of the Intel® Xeon Phi™ product family | Based on Intel’s 10 nanometer manufacturing technology | Integrated 2nd generation Intel® Omni-Path Fabric |
|
|
AVAILABILITY |
First commercial HPC systems in 2H’15 |
Knights Corner to Knights Landing upgrade program available today |
MOMENTUM |
Cori Supercomputer at NERSC (National Energy Research Scientific Computing Center at LBNL/DOE) became the first publically announced Knights Landing based system, with over 9,300 nodes slated to be deployed in mid-2016 |
“Trinity” Supercomputer at NNSA (National Nuclear Security Administration) is a $174 million deal awarded to Cray that will feature Haswell and Knights Landing, with acceptance phases in both late-2015 and 2016. |
Expecting over 50 system providers for the KNL host processor, in addition to many more PCIe*-card based solutions. |
>100 Petaflops of committed customer deals to date |
DISCLAIMERS
*Other names and brands may be claimed as the property of others.
All products, computer systems, dates and figures specified are preliminary based on current expectations, and are subject to change without notice.
All projections are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual
performance.
0 Over 3 Teraflops of peak theoretical double-precision performance is preliminary and based on current expecations of cores, clock frequency and floating
point operations per cycle.
1 Projected result based on internal Intel analysis of STREAM benchmark using a Knights Landing processor with 16GB of ultra high-bandwidth versus DDR4 memory with all channels populated.
2 Projected result based on internal Intel analysis comparison of 16GB of ultra high-bandwidth memory to 16GB of GDDR5 memory used in the Intel® Xeon Phi™ coprocessor 7120P.
3 Compared to 1st Generation Intel® Xeon Phi™ 7120P Coprocessor (formerly codenamed Knights Corner)
4 Projected result based on internal Intel analysis using estimated performance and power consumption of a rack sized deployment of Intel® Xeon® processors and Knights Landing coprocessors as compared to a rack with KNL processors only
5 Projected result based on internal Intel analysis comparing a discrete Knights Landing processor with integrated fabric to a discrete Intel fabric component card.
6 Binary compatible with Intel® Xeon® Processors v3 (Haswell) with the exception of Intel® TSX (Transactionaly Synchronization Extensions)
7 Projected peak theoretical single-thread performance relative to
1st Generation Intel® Xeon Phi™ Coprocessor 7120P