Revision for “In-Depth Comparison of NVIDIA Tesla Kepler GPU Accelerators” created on March 9, 2022 @ 12:04:02
In-Depth Comparison of NVIDIA Tesla Kepler GPU Accelerators
|
<em>This article provides in-depth details of the NVIDIA Tesla K-series GPU accelerators (codenamed "Kepler"). "Kepler" GPUs improve upon the previous-generation "Fermi" architecture.
For more information on other Tesla GPU architectures, please refer to:</em> <h2>Important changes available in the "Kepler" GPU architecture include:</h2> <h2>"Kepler" Tesla GPU Specifications</h2> "Currently-shipping
<table> <thead> <tr> <th>Feature</th> <th>Tesla K80</th> <th>Tesla K40</th> </tr> </thead> <tbody> <tr><td class="rowhead">GPU Chip(s)</td><td>2x Kepler GK210</td><td>Kepler GK110b</td></tr> <tr><td class="rowhead">Peak Single Precision (base clocks)</td><td>5.60 TFLOPS (both GPUs combined)</td><td>4.29 TFLOPS</td></tr> <tr><td class="rowhead">Peak Double Precision (base clocks)</td><td>1.87 TFLOPS (both GPUs combined)</td><td>1.43 TFLOPS</td></tr> <tr><td class="rowhead">Peak Single Precision (GPU Boost)</td><td>8.73 TFLOPS (both GPUs combined)</td><td>5.04 TFLOPS</td></tr> <tr><td class="rowhead">Peak Double Precision (GPU Boost)</td><td>2.91 TFLOPS (both GPUs combined)</td><td>1.68 TFLOPS</td></tr> <tr><td class="rowhead">Onboard GDDR5 Memory<sup>1</sup></td><td>24GB (12GB per GPU)</td><td>12 GB</td></tr> <tr><td class="rowhead">Memory Bandwidth<sup>1</sup></td><td>480 GB/s (240 GB/s per GPU)</td><td>288 GB/s</td></tr> <tr><td class="rowhead">PCI-Express Generation</td><td colspan=2>3.0</td></tr> <tr><td class="rowhead">Achievable PCI-E transfer bandwidth</td><td>12 GB/s</td><td>12 GB/s</td></tr> <tr><td class="rowhead"># of SMX Units</td><td>26 (13 per GPU)</td><td>15</td></tr> <tr><td class="rowhead"># of CUDA Cores</td><td>4992 (2496 per GPU)</td><td>2880</td></tr> <tr><td class="rowhead">Memory Clock</td><td>2500 MHz</td><td>3004 MHz</td></tr> <tr><td class="rowhead">GPU Base Clock</td><td>560 MHz</td><td>745 MHz</td></tr> <tr><td class="rowhead">GPU Boost Support</td><td>Yes – Dynamic</td><td>Yes – Static</td></tr> <tr><td class="rowhead">GPU Boost Clocks</td><td>23 levels between 562 MHz and 875 MHz</td><td>810 MHz<br />875 MHz</td></tr> <tr><td class="rowhead">Architecture features</td><td colspan=2>SMX, Dynamic Parallelism, Hyper-Q</td></tr> <tr><td class="rowhead">Compute Capability</td><td>3.7</td><td>3.5</td></tr> <tr><td class="rowhead">Workstation Support</td><td>-</td><td>Yes</td></tr> <tr><td class="rowhead">Server Support</td><td colspan=2>Yes</td></tr> <tr><td class="rowhead">Wattage (TDP)</td><td>300W (plus Zero Power Idle)</td><td>235W</td></tr> </tbody> </table> <em>1. Measured with ECC disabled. Memory capacity and performance are reduced with ECC enabled.</em> "Previous
The models listed below are still available for sale in certain scenarios, but are not generally recommended. They offer lower performance than Tesla K40 or K80 (and do not cost any less).
<table> <h2>Comparison between "Fermi" and "Kepler" GPU Architectures</h2> |