NVIDIA’s Ada Lovelace architecture offers 18,176 CUDA cores compared to the previous generation NVIDIA A40 with Ampere architecture with up to 10,752 CUDA cores. This GPU is designed to take advantage of the PCIe 4th Generation connection interface with a bi-directional transfer speed of 64GB/s, which is the double the 3rd Gen PCIe throughput. 142 3rd Generation Ray Tracing (RT) Cores deliver 2x to 3x the speed for Lovelace architecture cards compared to previous generations. Data science and AI model training has been improved with 568 4th Generation Tensor Cores providing up to 90.1 TF32 TFLOPS.
The L40 GPU supports 48GB of GDDR6 memory with Error Correction Code (ECC). It has a memory bandwidth of 864GB/s. Using NVIDIA RTX Virtual Workstation (RTX vWS) virtual GPU (vGPU) software it is possible to allocate memory to multiple users across different teams such as creative, data science, and design.
Maximum power consumption on this card is rated at 300W and uses a 16-pin power connector. The NVIDIA L40 GPU features bi-directional passive cooling and depends on the cooling provided by the host server chassis to maintain operational temperatures.
Offering impressive performance for simulation, ray-tracing, AI modeling, and virtual production, the NVIDIA L40 GPU Accelerator is a great general-purpose GPU and ideal demanding datacenter workloads. 48GB of GGRD6 memory are available on this card, as well as 142x 3rd generation RT Cores, 18176x CUDA cores and 568x 4th generation Tensor cores.