More

johnb231 · 2025-07-19T05:19:17 1752902357

They blacklisted games that feature incest. Good riddance. This trash should never have been allowed on Steam.

bigyabai · 2025-07-19T05:34:02 1752903242

Clearly Steam disagreed. I don't see any publishers who were angry.

johnb231 · 2025-07-15T19:32:36 1752607956

The site does not request any personal information, therefore no privacy policy is required.

th0ma5 · 2025-07-16T00:06:20 1752624380

It has no server side log? How do I know that if there is no policy?

simonw · 2025-07-16T02:21:49 1752632509

You could read the code. This is Hacker News after all.

krapp · 2025-07-16T22:18:38 1752704318

You can't actually read the code. They use a proprietary fork of Arc Forums that they won't release.

johnb231 · 2025-07-15T07:14:50 1752563690

Devs finish the work 30% faster and take the rest of the day off. That's what I would do. Working remotely.

cess11 · 2025-07-15T07:56:44 1752566204

People aren't generally able to keep up the discipline to time when to pass on tickets to hide changes in their ability, unless it's forced by a constant anxiety.

Developers are also not very good at estimating how long something is supposed to take. If there was even a 10% jump in profitability in the software department it would have been obvious to bean counters and managers. You'd also see a massive recruitment spree, because large organisations ramp up activities that make money in the short term.

johnb231 · 2025-07-15T06:56:22 1752562582

Dude in 1972 looking at Atari Pong: “computer graphics will never achieve realism”

johnb231 · 2025-07-12T23:28:46 1752362926

These are all "too big to host at home". I don't think that is the issue here.

https://github.com/MoonshotAI/Kimi-K2/blob/main/docs/deploy_...

"The smallest deployment unit for Kimi-K2 FP8 weights with 128k seqlen on mainstream H200 or H20 platform is a cluster with 16 GPUs with either Tensor Parallel (TP) or "data parallel + expert parallel" (DP+EP)."

16 GPUs costing ~$30k each. No one is running a ~$500k server at home.

weitendorf · 2025-07-13T03:10:32 1752376232

For most people, before it makes sense to just buy all the hardware yourself, you probably should be renting GPUs by the hour from the various providers serving that need. On Modal, I think should cost about $72/hr to serve Kimi K2 https://modal.com/pricing

Once that's running it can serve the needs of many users/clients simultaneously. It'd be too expensive and underutilized for almost any individual to use regularly, but it's not unreasonable for them to do it in short intervals just to play around with it. And it might actually be reasonable for a small number of students or coworkers to share a $70/hr deployment for ~40hr/week in a lot of cases; in other cases, that $70/hr expense could be shared across a large number of coworkers or product users if they use it somewhat infrequently.

So maybe you won't host it at home, but it's actually quite feasible to self-host, and is it ever really worth physically hosting anything at home except as a hobby?

apitman · 2025-07-14T03:18:36 1752463116

How does multi-user work, and how many users could it handle concurrently? My only experience is running much smaller models, and they easily peg my GPU at ~90 tokens/s. So maybe I could run 5-10 users at <10t/s? Does software like llama.cpp and ollama handle this?

pxc · 2025-07-12T23:55:32 1752364532

I think what GP means is that because the (hopefully) pending OpenAI release is also "too big to run at home", these two models may be close enough in size that they seem more directly comparable, meaning that it's even more important for OpenAI to outperform Kimi K2 on some key benchmarks.

ls612 · 2025-07-13T01:00:42 1752368442

This is a dumb question I know, but how expensive is model distillation? How much training hardware do you need to take something like this and create a 7B and 12B version for consumer hardware?

johnb231 · 2025-07-13T02:55:53 1752375353

The process involves running the original model. You can rent these big GPUs for ~$10 per hour, so that is ~$160 per hour for as long as it takes

qeternity · 2025-07-13T11:48:49 1752407329

You can rent H100s for $1.50/gpu/hr these days.

spaceman_2020 · 2025-07-13T10:19:29 1752401969

The real users for these open source models are businesses that want something on premises for data privacy reasons

Not sure if they’ll trust a Chinese model but dropping $50-100k for a quantized model that replaces, say, 10 paralegals is good enough for a law firm

MaxPock · 2025-07-13T10:34:13 1752402853

An on-premise,open source Chinese model for my business,or a closed source American model from a company that's a defense contractor .Shouldn’t be too difficult a decision to make.

apitman · 2025-07-14T03:23:34 1752463414

Even if they provide the code/data and not just the weights, aren't you taking their word for it that the weights were trained using that code, and not modified? Or is there some way to verify that?

MaxPock · 2025-07-14T09:55:14 1752486914

I don't care .I'm hosting LLM and I can train or modify it the way I like. I'll have this authoritarian open source any day

johnb231 · 2025-07-12T18:59:36 1752346776

Written by ChatGPT. The second paragraph is obviously AI slop.

This sentence is 100% AI generated: "It’s a symbiotic cycle: retro inspires, modern sustains. Commodore isn’t returning. It’s evolving, with purpose."

tracerbulletx · 2025-07-12T20:55:31 1752353731

The people behind this are a pretty known entity.. https://www.youtube.com/watch?v=ke-Ao-CpI7E

tpmoney · 2025-07-13T19:21:00 1752434460

Nah, watch any of the retro recipes videos on youtube and that's more or less his default style of speaking/presenting.

mrob · 2025-07-12T20:51:34 1752353494

Agreed it looks like slop, and it's IMO a bad sign. I think a big part of the appeal of old computers is the fact that they're simple enough for a single human to completely understand.

Generative AI is a black box that's impossible to completely understand. Using generative AI signals to me that whoever did it probably doesn't find any inherent value in understanding things, and sees understanding only as a means to an end. Old computers have little practical use, so this leaves nostalgia as the main appeal, and nostalgia has less stringent requirements.

johnb231 · 2025-07-11T04:58:11 1752209891

That is for professional or commercial use, not casual home users.

johnb231 · 2025-07-10T09:25:59 1752139559

You are doing it wrong. That's where you should downvote, not flag.

Frivolous flagging - as you are doing - could eventually get your account privileges removed.

thomassmith65 · 2025-07-10T09:39:47 1752140387

I expressed that poorly. Just 'boring' alone doesn't warrant a flag.

There's a subjective element.

As an example of something I would flag: a one sentence 'hamas supporter!' or 'genocide denier!' accusation in reply to someone's thoughtful comment. If the same sentiment were expressed in a more original way, I might upvote.

Edit: In regard to news stories, sometimes a story breaks and the main and 'new' pages wind up a dozen links to it. At some point, I might flag that. I'm not sure if that's kosher, but there's little purpose in having users wade through identical articles. Maybe @tomhow or @dang can set me straight if they happen to read this.

johnb231 · 2025-07-10T05:42:43 1752126163

Elon Musk cofounded and funded OpenAI.

I use Grok, ChatGPT, and Gemini. They are all excellent, state of the art, and have their unique strengths and weaknesses.

johnb231 · 2025-06-11T01:09:25 1749604165

That is a preview / beta model with no expectation of stability. Google did nothing wrong there. No one should be using a preview model in production.

thegeomaster · 2025-06-11T01:54:14 1749606854

Hard disagree. Of course technically they didn't do anything explicitly against the public guidance (the checks and balances would never let them), but naming a model with a date very strongly implies immutability.

It's the same logic of why UB in C/C++ isn't a license to do whatever the compiler wants. We're humans and we operate on implications, common-sense assumptions and trust.

johnb231 · 2025-06-11T03:18:43 1749611923

The model is labelled as Preview. There are no guarantees of stability or availability for Preview models. Not intended for production workloads.

https://cloud.google.com/products?hl=en#product-launch-stage...

"At Preview, products or features are ready for testing by customers. Preview offerings are often publicly announced, but are not necessarily feature-complete, and no SLAs or technical support commitments are provided for these. Unless stated otherwise by Google, Preview offerings are intended for use in test environments only. The average Preview stage lasts about six months."

refulgentis · 2025-06-11T03:27:32 1749612452

There hasn't been a non-preview Gemini since...November? The previews are the same as everyone else's release cadance, "preview" is just a magic wand that meant the Launchcal (google's internal signoff tool, i.e. "wave will never happen again) needs less signoffs. Then it got to the point date-pinned models were getting swapped in, in the name of doing us a favor, and it's a...novel idea, we can both agree at the least.

I bet someone at Google would be a bit surprised to see someone jumping to legalese to act like this...novelty...is inherently due to the preview status, and based on anything more than a sense that there's no net harm done to us if it costs the same and is better.

I'm not sure they're wrong.

But it also leads to a sort of "nobody knows how anything works because we have 2^N configs and 5 bits" - for instance, 05-06 was also upgraded to 06-05. Except it wasn't, if you sent variable thinking to 05-06 after upgrade it'd fail. (and don't get me started on the 5 different thinking configurations for Gemini 2.5 flash thinking vs. gemini 05-06 vs. 06-05 and 0 thinking)

johnb231 · 2025-06-11T03:49:44 1749613784

I honestly have no idea what you are trying to say.

It's a preview model - for testing only, not for production. Really not that complicated.

refulgentis · 2025-06-11T04:10:11 1749615011

So you don't have anything to contribute beyond, and aren't interested in anything beyond, citing of terms?

Why are you in the comments section of a engineering news site?

(note: beyond your, excuse me while I'm direct now, boorish know-nothing reply, the terms you are citing have nothing to do with the thing people are actually discussing around you, despite your best efforts. It doesn't say "we might swap in a new service, congrats!", nor does it have anything to say about that. Your legalese at most describes why they'd pull 05-06, not forward 05-06 to 06-05. This is a novel idea.)

johnb231 · 2025-06-11T04:13:50 1749615230

This case was simply a matter of people not understanding the terms of service. There is nothing more to be said. It's that simple. The "engineers" should know that before deploying to prod. Basic competence.

And I mean I genuinely do not understand what you are trying to say. Couldn't parse it.

lcnPylGDnU4H9OF · 2025-06-11T12:46:50 1749646010

> And I mean I genuinely do not understand what you are trying to say. Couldn't parse it.

It’s always worth considering that this may be your problem. If you still don’t get it, the only valuable reply is one which asks a question. Also, including “it’s not that complicated” only serves to inflame.

refulgentis · 2025-06-11T04:16:37 1749615397

John, do you understand that the thing you're quoting says "We reserve the right to pull things", not "We reserve the right to swap in a new service"?

Do you understand that even if it did say that, that wasn't true either? It was some weird undocumentable half-beast?

I have exactly your attitude about their cavalier use of preview for all things Gemini, and even people's use of the preview models.

But I've also been on this site for 15 years and am a bit wow'd by your interlocution style here -- it's quite rare to see someone flip "the 3P provider swapped the service on us!" into "well they said they could turn it off, of course you should expect it to be swapped for the first time ever!" insert dull sneer about the quality of other engineers

johnb231 · 2025-06-11T04:21:00 1749615660

How is this so hard to understand? It's a preview service for testing only, not intended for production.

I am done with this thread. We are going around in circles.

refulgentis · 2025-06-11T04:23:44 1749615824

Well, no. Well, sure. You're done, but we're not going in circles. It'd just do too much damage to you to have to answer the simple question "Where does the legalese say they can swap in a new service?", so you have to pretend this is circular and just all-so-confusing, de facto, we have to pretend it is confusing and/or obviously wrong to use any Gemini 2+ at all.

It's a cute argument, as I noted, I'm emotionally sympathetic to it even, it's my favorite "get off my lawn." However, I've also been on the Internet long enough to know you write back, at length, when people try anti-intellectualism and why-are-we-even-talking-about-this as interaction.

johnb231 · 2025-06-11T04:35:20 1749616520

https://cloud.google.com/terms/service-terms

"b. Disclaimer. PRE-GA OFFERINGS ARE PROVIDED “AS IS” WITHOUT ANY EXPRESS OR IMPLIED WARRANTIES OR REPRESENTATIONS OF ANY KIND. Pre-GA Offerings (i) may be changed, suspended or discontinued at any time without prior notice to Customer and (ii) are not covered by any SLA or Google indemnity. Except as otherwise expressly indicated in a written notice or Google documentation, (A) Pre-GA Offerings are not covered by TSS, and (B) the Data Location Section above will not apply to Pre-GA Offerings."

anoncareer0212 · 2025-06-11T04:37:54 1749616674

[flagged]

johnb231 · 2025-06-11T04:39:35 1749616775

[flagged]

anoncareer0212 · 2025-06-11T04:40:45 1749616845

Been here for 15 years, and there's standards for interaction, especially for 19 day old accounts. I recommend other sites if you're expecting to be dismissive and rude without strong intellectual pushback.

Gigablah · 2025-06-11T06:08:24 1749622104

This is just “f—- off” dressed in formal language, so not much better.

0xbadcafebee · 2025-06-11T03:54:42 1749614082

There's a very large gulf between "what makes sense to Google" and "what makes sense to Human Beings". I have so many rants about Google's poor treatment of "customers" that they feel like Oracle to me now. Like every time I use them, I'm really just falling prey to my own misguided idea that this time I won't get screwed over.

johnb231 · 2025-06-11T04:18:37 1749615517

The users aren't random "human beings" in this case. They are professional software developers who are expected to understand the basics. Deploying that model into production shows a lack of basic competence. It is clearly marked "preview" and is for test only.

0xbadcafebee · 2025-06-11T12:36:00 1749645360

That may be true, but it doesn't make the customer's claims not true. What Google did was counter-intuitive. That's a fact. Pointing at some fine print and saying "uhh actually, technically it's your stupid human brain is the problem, not us! we technically are allowed to do anything we want, just look at the fine print!!" does not make things better. We are human beings; we are flawed. That much should be obvious to any human organization. If you don't know how to make things that don't piss off human beings, the problem isn't with the humans.

If the "preview release" you were using was v0.3, and suddenly it started being v0.6 without warning, that would be insane. The only point of providing a version number is to give people an indicator of consistency. The datestamp is a version number. If they didn't want us to expect consistency, they should not have given it a version number. That's the whole point of rolling release branches, they have no version. You don't have "v2.0" of a rolling release, you just have "latest". They fucked up by giving it a datestamp.

This is an extremely old and well-known problem with software interfaces. Either you version it or you don't. If you do version it, and change it, you change the version, and give people dependent on the old version some time to upgrade. Otherwise it breaks things, and that pisses people off. The alternative is not versioning it, which is a signal that there is no consistency to be expected. Any decent software developer should have known all this.

And while I'm at it: what's with the name flip-flopping? In 2014, GCP issued a PR release explaining It was no longer using "Preview", but "Alpha" and "Beta" (https://cloudplatform.googleblog.com/2014/10/new-release-pha...). But the link you showed earlier says "Alpha" and "Beta" are now deprecated. But no PR release? I guess that's our bad for not constantly reading the fine print and expecting it to revert back to something from 11 years ago.