|
October 2014 |
|
Tesla
K20 is designed for double precision applications and the broader
supercomputing market, the Tesla K20 delivers 3x the double precision
performance compared to the previous generation Fermi-based Tesla M2090. Tesla
K20 features a single GK110 Kepler GPU that includes Dynamic parallelism and
Hyper-Q. With more than one teraflop of peak double precision performance, the
Tesla K20 is ideal for a wide range of high performance computing workloads
including climate and weather modelling, CFD, CAE, computational physics,
biochemistry simulations and computational finance.
TESLA K20 - GPU Computing Accelerator Common Features
ECC Memory Error Protection
|
Meets a critical requirement
for computing accuracy and reliability in datacenters and supercomputing
centers. External DRAM is ECC protected in Tesla K10 and both external
and internal memories are ECC protected in Tesla K20X. |
| System Monitoring Features |
Integrates the GPU subsystem with the host system’s monitoring
and management capabilities such as IPMI or OEM-proprietary tools. IT
staff can thus manage the GPU processors in the computing system using
widely used cluster/grid management solutions. |
| L1 and L2 caches |
Accelerates algorithms such as
physics solvers, ray-tracing, and sparse matrix multiplication where
data addresses are not known beforehand |
| Asynchronous Transfer with dual DMA engines |
Turbocharges system performance by transferring data over the PCIe bus while the computing cores are crunching other data. |
| Flexible programming environment with broad support of programming languages and APIs |
Choose OpenACC, CUDA toolkits
for C, C++, or Fortran to express application parallelism and take
advantage of the innovative Kepler architecture. |
Specifications:
| Specification |
Description |
GPU Chips
|
GK110
|
Cuda Cores
|
2496 |
Single Precision Performance
|
3.52 TFLPOS |
| Double Precision Performance |
1.17 TFLPOS |
Memory Size
|
5GB GDDR5
|
Memory Bandwidth(ECC off)
|
208 Gbytes/sec
|
Memory I/O
|
320-bit |
Memory Clock
|
2600 MHz
|
SMX Units
|
13 |
| Architecture features |
SMX, Dynamic Parallelism, Hyper-Q |
| System Interface |
PCI Express 2.0 x16 |
Energy Star Enabling
|
Yes
|
Compute Capabilty
|
3.5 |
| Semiconductor Manufacturing Size
|
28nm |
| Wattage (TDP)
|
225 |
| GPU Applications
|
> Reservoir simulation
> CAE (structural analysis)
> Molecular dynamics, Numerical analytics
> Computational visualization (ray tracing)
|
| Thermal Solution
|
Ultra-quiet Active Fansink
|
| Form Factor
|
110 mm (H) × 265 mm (L) Dual Slot, Full-Height
|
|