I want to see more models that can be streamed to a browser and run locally via ...

firejake308 · 2025-05-24T16:05:51 1748102751

After experimenting with 1B models, I am starting to think that any model with 1B parameters or less will probably lack a lot of the general intelligence that we observe in the frontier models, because it seems physically impossible to encode that much information into so few parameters. I believe that in the range of very small models, the winner will be models that are fine tuned to a small range of tasks or domains, such as a model that can translate between English and any other language, or a legal summarization model, etc.

vindex10 · 2025-05-24T16:59:08 1748105948

Have you heard of Transformers.js? They are running onnx inside browser:

https://huggingface.co/docs/transformers.js/en/index

relaxing · 2025-05-24T16:10:31 1748103031

Why? Just so user data stays local?

dainiusse · 2025-05-24T17:02:00 1748106120

Yes. And also, cost to run it.

relaxing · 2025-05-25T12:15:23 1748175323

Sounds like a case where egress could easily cost more than compute.