Mountain View, California, USA - March 28, 2018: Google sign at Google's headquarters in Silicon Valley. Google is an American technology company.

Google Expands Cost-Effective AI-Optimised Infrastructure Portfolio For Customers

Bijay Pokharel, August 30, 2023 0 2 min read

Google on Wednesday expanded its artificial intelligence (AI)-optimized infrastructure portfolio that is both cost-effective and scalable for its Cloud customers.

The company is expanding its AI-optimised infrastructure portfolio with ‘Cloud TPU v5e’, the most cost-efficient, versatile, and scalable Cloud TPU to date, which is also now available in preview.

“Cloud TPU v5e is purpose-built to bring the cost-efficiency and performance required for medium- and large-scale training and inference. TPU v5e delivers up to 2x higher training performance per dollar and up to 2.5x inference performance per dollar for LLMs and gen AI models compared to Cloud TPU v4,” Google said in a blog post.

According to the company, TPU v5e is also incredibly versatile, with support for eight different virtual machine (VM) configurations, ranging from one chip to more than 250 chips within a single slice, allowing customers to choose the right configurations to serve a wide range of LLM and gen AI model sizes.

Cloud TPU v5e also provides built-in support for leading AI frameworks such as JAX, PyTorch, and TensorFlow, along with popular open-source tools like Hugging Face’s Transformers and Accelerate, PyTorch Lightning, and Ray.

Moreover, the tech giant announced that its A3 VMs, based on Nvidia H100 GPUs, delivered as a GPU Supercomputer, will be generally available next month to power customers large-scale AI models.

“Today, we’re thrilled to announce that A3 VMs will be generally available next month. Powered by Nvidia’s H100 Tensor Core GPUs, which feature the Transformer Engine to address trillion-parameter models, Nvidia’s H100 GPU, A3 VMs are purpose-built to train and serve especially demanding gen AI workloads and LLMs,” Google said.

The A3 VM features dual next-generation 4th Gen Intel Xeon scalable processors, eight Nvidia H100 GPUs per VM, and 2TB of host memory.

Built on the latest Nvidia HGX H100 platform, the A3 VM delivers 3.6 TB/s bisectional bandwidth between the eight GPUs via fourth-generation Nvidia NVLink technology.

IT NEWS & UPDATES

Instagram’s New Feature To Let Creators Highlight Comments In Stories

GOOGLE

Google Discontinues Its Pixel Pass Subscription For New Devices & Renewals

Bijay Pokharel

Bijay Pokharel is the creator and owner of Abijita.com. He is a freelance technology writer focusing on all things pertaining to Cyber Security. The topics he writes about include malware, vulnerabilities, exploits, internet defense, women's safety and privacy, as well as research and innovation in information security. He is a tech enthusiast, keen learner, rational and cool person in his professional activities and challenges.

Subscribe

Cybersecurity Newsletter

You have Successfully Subscribed!

Recent Posts

Microsoft’s Bing Hits 140 Million Daily User Milestone

Microsoft to Invest $1.7 Billion in Cloud, AI Infrastructure in Indonesia

Samsung Q1 Operating Profit Soars, Chip Business Back to Profit

FBI Warns of “Free Verification” Scam Targeting Dating App Users

Google Protects Users by Blocking Over 2.5 Million Harmful Apps in 2023

Microsoft Introduces Offline Mode for OneDrive on the Web

Advertisement

Subscribe

Cybersecurity Newsletter

You have Successfully Subscribed!

SIGN UP FOR NEWSLETTERS

Please confirm your email address.

Subscribe

Cybersecurity Newsletter

You have Successfully Subscribed!

Google Expands Cost-Effective AI-Optimised Infrastructure Portfolio For Customers

Bijay Pokharel

Related posts

Google Docs To Now Automatically Add Line Numbers

Google Launches Open-Source Software Bug Bounty Program

Google Ends Its ‘Chrome Cleanup Tool’ With Latest Update

YouTube Showing Election-Fraud Videos To Users Skeptical About 2020 US Polls

Android 8.1 Oreo New Features Roundup

Google Raises Nest Aware Subscription Prices Up To $30 Per Year

Leave a Reply Cancel reply

Recent Posts

Microsoft’s Bing Hits 140 Million Daily User Milestone

Microsoft to Invest $1.7 Billion in Cloud, AI Infrastructure in Indonesia

Samsung Q1 Operating Profit Soars, Chip Business Back to Profit

FBI Warns of “Free Verification” Scam Targeting Dating App Users

Google Protects Users by Blocking Over 2.5 Million Harmful Apps in 2023

Microsoft Introduces Offline Mode for OneDrive on the Web

Advertisement

Subscribe

Cybersecurity Newsletter

You have Successfully Subscribed!

SIGN UP FOR NEWSLETTERS

Please confirm your email address.