Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Consider it a low level routing. Keeping in mind it allows the other non active parts to not be in memory. Mistral afaik came up with this concept, quite a while back.


It's actually just a high-level routing between the reasoning and non-reasoning models that only applies to ChatGPT.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: