More

_fjb4 · on June 9, 2023

Yeah as a long-time user I was really hoping for that myself, or at least have some sort of compromise. Certainly not that trainwreck of an AMA.

_fjb4 · on June 9, 2023

Wouldn't that be Hetzner?

re-thc · on June 10, 2023

Hetzner lacks a lot of features to count as a proper cloud. Still wishing for managed kubernetes.

8organicbits · on June 10, 2023

Managed k8s probably wouldn't count as a bare bones cloud.

re-thc · on June 13, 2023

Upcloud, Vultr, Scaleway, Linode, DigitalOcean, etc have all included managed k8s. I think the standards have moved up and it's pretty much a standard feature of most bare bones cloud now.

_fjb4 · on June 7, 2023

It's not easy to move a group to a new platform, and there's very little people using Lemmy currently. The thing about Reddit despite its flaws is that it allows access to many large communities with a single account.

_fjb4 · on June 6, 2023

OpenLLAMA will be released soon and it's 100% compatible with the original LLAMA.

https://github.com/openlm-research/open_llama

_fjb4 · on June 6, 2023

The fact that this is commodity hardware makes ggml extremely impressive and puts the tech in the hands of everyone. I recently reported my experience running 7B llama.cpp on a 15 year old Core 2 Quad [1] - when that machine came out it was a completely different world and I certainly never imagined how AI would look like today. This was around when the first iPhone was released and everyone began talking about how smartphones would become the next big thing. We saw what happened 15 years later...

Today with the new k-quants users are reporting that 30B models are working with 2-bit quantization on 16GB CPUs and GPUs [2]. That's enabling access to millions of consumers and the optimizations will only improve from there.

[1] https://old.reddit.com/r/LocalLLaMA/comments/13q6hu8/7b_perf...

[2] https://github.com/ggerganov/llama.cpp/pull/1684, https://old.reddit.com/r/LocalLLaMA/comments/141bdll/moneros...

_fjb4 · on June 6, 2023

Likely you can use a Bluetooth keyboard and mouse with it.

_fjb4 · on June 5, 2023

It also doesn't have ECC as well which was a staple of the previous Mac Pro line.

JohnBooty · on June 5, 2023

I wonder if on-die RAM is less susceptible to memory errors?

I suspect that it is. Feels like less can go wrong. You have physically shorter interconnects, and the RAM is perhaps more of a known quantity relative to $SOME_RANDOM_MANUFACTURERS_DIMMS. But that is only a guess.

However, I don't know if that's true. I guess it's not necessarily more resistant to random cosmic rays or whatever.

_fjb4 · on June 1, 2023

I personally like the Adminforge instance https://teddit.adminforge.de. It's much quicker than the original teddit.net.

_fjb4 · on May 26, 2023

Theoretically the ARC should work with llama.cpp using OpenCL, but I haven't seen benchmarks or even a confirmation that it works.

_fjb4 · on May 25, 2023

I think the tinybox is meant to be a training/inference server meant for tinygrad and filled with those AMD cards. Very likely it will run Linux.