Menu Content/Inhalt
Home

Tesla K20 Print
October 2014
tesla_k40.jpg

 

Tesla K20 is designed for double precision applications and the broader supercomputing market, the Tesla K20 delivers 3x the double precision performance compared to the previous generation Fermi-based Tesla M2090. Tesla K20 features a single GK110 Kepler GPU that includes Dynamic parallelism and Hyper-Q. With more than one teraflop of peak double precision performance, the Tesla K20 is ideal for a wide range of high performance computing workloads including climate and weather modelling, CFD, CAE, computational physics, biochemistry simulations and computational finance.


TESLA K20 - GPU Computing Accelerator Common Features

ECC Memory Error Protection
Meets a critical requirement for computing accuracy and reliability in datacenters and supercomputing centers. External DRAM is ECC protected in Tesla K10 and both external and internal memories are ECC protected in Tesla K20X.
System Monitoring Features Integrates the GPU subsystem with the host system’s monitoring and management capabilities such as IPMI or OEM-proprietary tools. IT staff can thus manage the GPU processors in the computing system using widely used cluster/grid management solutions.
L1 and L2 caches Accelerates algorithms such as physics solvers, ray-tracing, and sparse matrix multiplication where data addresses are not known beforehand
Asynchronous Transfer with dual DMA engines Turbocharges system performance by transferring data over the PCIe bus while the computing cores are crunching other data.
Flexible programming environment with broad support of programming languages and APIs Choose OpenACC, CUDA toolkits for C, C++, or Fortran to express application parallelism and take advantage of the innovative Kepler architecture.

Specifications:

Specification Description
GPU Chips
 GK110
Cuda Cores
 2496
Single Precision Performance
 3.52 TFLPOS
Double Precision Performance  1.17 TFLPOS
Memory Size
 5GB GDDR5
Memory Bandwidth(ECC off)
 208 Gbytes/sec
Memory I/O
 320-bit
Memory Clock
 2600 MHz
SMX Units
 13
Architecture features  SMX, Dynamic Parallelism, Hyper-Q
System Interface  PCI Express 2.0 x16
 Energy Star Enabling
 Yes
Compute Capabilty
 3.5
Semiconductor Manufacturing Size  28nm
Wattage (TDP)  225
GPU Applications  > Reservoir simulation
 > CAE (structural analysis)
 > Molecular dynamics, Numerical analytics
 > Computational visualization (ray tracing) 
Thermal Solution  Ultra-quiet Active Fansink
Form Factor  110 mm (H) × 265 mm (L) Dual Slot, Full-Height