Is Gemini tied/benefitting from Google TPU hardware? Because you need hardware i...

drdirk · 2025-04-12T11:02:16 1744455736

Gemini models are written in Jax which through the XLA compiler can be compiled either to TPU or GPU hardware.

Performance may differ but Google (and Nvidia) are very interested in having good performance on both platforms.

cavisne · 2025-04-12T17:15:40 1744478140

The raw computation is just a bunch of matrix multiplications in a row, most of the algorithmic complexity/secret stuff would be around scaling & efficiency.

For training the model the HW is much more important as you need to scale up to as many chips as possible without being bottlenecked by the network.

This would just be inference, and it doesn't need to be very efficient as its for on prem usage not selling API access. So you could strip out any efficiency secrets, and it would probably look like a bigger Gemma (their open source model).

I wonder if they would/could try and strip out stuff like whatever tricks they use for long context + video support (both of which they are a bit ahead of everyone else on).

summerlight · 2025-04-12T17:41:45 1744479705

The model itself is likely built upon their own open source system JAX so they should be usable in Nvidia. Of course cost efficiency is going to be a different story.

Workaccount2 · 2025-04-12T20:38:15 1744490295

TPUs are definitely the reason why Gemini models have both massive context and very low prices. There is no nvidia tax to pay.

disgruntledphd2 · 2025-04-12T10:48:49 1744454929

The Google blogpost notes that it's a partnership with Nvidia, so using cuda rather than TPUs apparently.

stingraycharles · 2025-04-12T11:55:41 1744458941

Makes complete sense, as NVidia has a lot more experience building these types of appliances.

ddingus · 2025-04-12T15:48:44 1744472924

Some one said it could also mean Google hardware has some advantage they would rather stay inside the G-silo.

MortyWaves · 2025-04-12T10:51:05 1744455065

Google abandoned Coral in true Google style.