Maybe go all in on local on device low latency LLMs? They make their own hardwar...

nerdponx · 2025-03-21T12:08:10 1742558890

The M chips are pretty much perfect for this.

Uehreka · 2025-03-21T16:15:37 1742573737

8GB of RAM is not perfect for this. The M chips are only known for AI because unified memory allows you to get huge amounts of "VRAM", but if you're not putting absurd amounts of RAM on the phone (and they're not going to) then that advantage largely goes away. It's not like the NPU is some world-beating AI thing.

rusk · 2025-03-21T10:00:12 1742551212

No reason why most apps couldn’t be cloud hosted and devote all the significant local edge compute to such a digital companion.

iainmerrick · 2025-03-21T14:18:00 1742566680

That seems like the opposite of what they were suggesting? Unless by “edge compute” you meant user’s devices, but I assume you intended the usual meaning of CDN edges.

rusk · 2025-03-21T15:18:46 1742570326

No I meant user’s device. It’s the edge of the application space.