Powerful infrastructure innovations for your AI-first future

2024-10-30

Google Cloud

Google Cloud announced powerful infrastructure innovations for your AI-first future, including the sixth-generation TPU, Trillium, A3 Ultra VMs powered by NVIDIA H200 Tensor Core GPUs, Hypercompute Cluster, C4A VMs based on custom Axion processors, enhancements to the Jupiter data center network and Titanium host offload capability, and Hyperdisk ML storage service. These enhancements aim to improve AI infrastructure performance, ease of use, and cost efficiency for customers. Trillium delivers significant performance improvements over TPU v5e, including over 4x improvement in training performance, up to 3x increase in inference throughput, and a 67% increase in energy efficiency. A3 Ultra VMs are designed to offer a significant leap in performance over previous generations, with 2x the GPU-to-GPU networking bandwidth and up to 2x higher LLM inferencing performance. Hypercompute Cluster streamlines the management of AI accelerator clusters, enabling reliable and repeatable deployments via a single API call. C4A VMs are now generally available, offering improved performance and efficiency for a wide range of general-purpose workloads. Titanium and the Jupiter network have been enhanced to support AI workloads, with the new Titanium ML network adapter delivering 3.2 Tbps of GPU-to-GPU traffic over RoCE. Hyperdisk ML is now generally available, providing high-performance storage for AI and HPC workloads, with faster data load times and greater cost efficiency.

Powerful infrastructure innovations for your AI-first future

Recommends