> a real-time router that quickly decides which model to use based on conversati...

hirako2000 · 2025-08-07T20:11:31 1754597491

Consider it a low level routing. Keeping in mind it allows the other non active parts to not be in memory. Mistral afaik came up with this concept, quite a while back.

ItsHarper · 2025-08-08T02:54:40 1754621680

It's actually just a high-level routing between the reasoning and non-reasoning models that only applies to ChatGPT.

ItsHarper · 2025-08-08T02:53:40 1754621620

That only applies to ChatGPT. The API has direct access to specific models.

willy_k · 2025-08-08T21:22:57 1754688177

The router seems to only apply to the ChatGPT version, not the API, so it’s not really anything new. Gemini already has functionally dynamic reasoning effort.