More

bredren · 2026-02-17T20:21:28 1771359688

Can you explain what you mean by your parallel tasks limitation?

KronisLV · 2026-02-17T21:12:38 1771362758

Instead of having my computer be the one running Claude Code and executing tasks, I might want to prefer to offload it to my other homelab servers to execute agents for me, working pretty much like traditional CI/CD, though with LLMs working on various tasks in Docker containers, each on either the same or different codebases, each having their own branches/worktrees, submitting pull/merge requests in a self-hosted Gitea/GitLab instance or whatever.

If I don't want to sit behind something like LiteLLM or OpenRouter, I can just use the Claude Agent SDK: https://platform.claude.com/docs/en/agent-sdk/overview

However, you're not supposed to really use it with your Claude Max subscription, but instead use an API key, where you pay per token (which doesn't seem nearly as affordable, compared to the Max plan, nobody would probably mind if I run it on homelab servers, but if I put it on work servers for a bit, technically I'd be in breach of the rules):

> Unless previously approved, Anthropic does not allow third party developers to offer claude.ai login or rate limits for their products, including agents built on the Claude Agent SDK. Please use the API key authentication methods described in this document instead.

If you look at how similar integrations already work, they also reference using the API directly: https://code.claude.com/docs/en/gitlab-ci-cd#how-it-works

A simpler version is already in Claude Code and they have their own cloud thing, I'd just personally prefer more freedom to build my own: https://www.youtube.com/watch?v=zrcCS9oHjtI (though there is the possibility of using the regular Claude Code non-interactively: https://code.claude.com/docs/en/headless)

It just feels a tad more hacky than just copying an API key when you use the API directly, there is stuff like https://github.com/anthropics/claude-code/issues/21765 but also "claude setup-token" (which you probably don't want to use all that much, given the lifetime?)

bredren · 2026-02-13T19:51:25 1771012285

Would be cool to include pricing info

Adanos · 2026-02-16T12:30:48 1771245048

Good idea! We're already showing the transaction price per share from the SEC filing. Are you thinking more along the lines of showing the current stock price alongside it, or maybe a price chart showing the stock's performance since the insider trade?

bredren · 2026-02-13T18:32:32 1771007552

This is cool, I’ve spent time picking through forks looking for high signal options.

It would be cool if this was paired with a skill to assist in interpreting results and separate out bug fixes from features.

stympy · 2026-02-13T19:41:08 1771011668

Yeah, I should add that! Here's a prompt I gave Claude:

"Based on the output of running forkwatch against the maximadeka/convertkit-ruby repo, what would you suggest for a PR to that repo?"

That resulted in Claude forking the repo, applying patches from the forks and offering to open this PR: https://github.com/maximadeka/convertkit-ruby/pull/41

bredren · 2026-02-12T17:05:24 1770915924

See also: https://news.ycombinator.com/item?id=46932911

bredren · 2026-02-09T20:45:10 1770669910

An anecdote: On one project, I use a skill + custom cli to assist getting PRs through a sometimes long and winding CI process. `/babysit-pr`

This includes regular checks on CI checks using `gh`. My skill / cli are broken right now:

`gh pr checks 8174 --repo [repo] 2>&1)`

   Error: Exit code 1

   Non-200 OK status code: 429 Too Many Requests
   Body:
   {
     "message": "This endpoint is temporarily being throttled. Please try again later. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (https://docs.github.com/en/site-policy/github-terms/github-terms-of-service)",
     "documentation_url": "https://docs.github.com/graphql/using-the-rest-api/rate-limits-for-the-rest-api",
     "status": "429"
   }

bredren · 2026-02-05T18:57:14 1770317834

The action is hot, no doubt. This reminds me of Spacewar! -> Galaxy Game / Computer Space.

bredren · 2026-01-31T21:59:26 1769896766

I co-founded Gliph, which was one of the first commercial, cross platform messaging apps to provide end to end encrypt.

One area of exposure was push notifications. I wonder if the access described wasn’t to the messages themselves but content rich notifications.

If so, both parties could be ~correct. Except the contractors would have been seeing what is technically metadata.

tptacek · 2026-01-31T22:02:49 1769896969

I'm unfamiliar with Gliph. What were the protocols/constructions you used?

bredren · 2026-01-31T02:49:34 1769827774

How about Apple? How is Apple training its next foundation models?

xvector · 2026-01-31T03:22:25 1769829745

Apple is sitting this whole thing out. Bizarre.

paxys · 2026-01-31T16:49:24 1769878164

The options for a company in their position are:

1. Sit out and buy the tech you need from competitors.

2. Spend to the tune of ~$100B+ in infra and talent, with no guarantee that the effort will be successful.

Meta picked option 2, but Apple has always had great success with 1 (search partnership with Google, hardware partnerships with Samsung etc.) so they are applying the same philosophy to AI as well. Their core competency is building consumer devices, and they are happy to outsource everything else.

deafpolygon · 2026-02-04T08:08:50 1770192530

Yep. They only stop to build something when it will benefit them, and impacts the bottom line.

runako · 2026-01-31T05:39:59 1769837999

This whole thread is about whether the most valuable startup of all time will be able to raise enough money to see the next calendar year.

It's definitely rational to decide to pay wholesale for LLMs given:

- consumer adoption is unclear. The "killer app" for OS integration has yet to ship by any vendor.

- owning SOTA foundation models can put you into a situation where you need to spend $100B with no clear return. This money gets spent up front regardless of how much value consumers derive from the product, or if they even use it at all. This is a lot of money!

- as apple has "missed" the last couple of years of the AI craze, there has been no meaningful ill effects to their business. Beyond the tech press, nobody cares yet.

vessenes · 2026-01-31T05:26:26 1769837186

I mean, they tried. They just tried and failed. It may work out for them, though — two years ago it looked like lift-off was likely, or at least possible, so having a frontier model was existential. Today it looks like you might be able to save many billions by being a fast follower. I wouldn’t be surprised if the lift-off narrative comes back around though; we still have maybe a decade until we really understand the best business model for LLMs and their siblings.

tonyedgecombe · 2026-01-31T06:30:48 1769841048

I think you are right. Their generative AI was clearly underwhelming. They have been losing many staff from their AI team.

I’m not sure it matters though. They just had a stonking quarter. iPhone sales are surging ahead. Their customers clearly don’t care about AI or Siri’s lacklustre performance.

9dev · 2026-01-31T09:13:27 1769850807

> Their customers clearly don’t care about AI or Siri’s lacklustre performance.

I would rather say their products didn’t just loose in value for not getting an improvement there. Everyone agrees that Siri sucks, but I’m pretty sure they tried to replace it with a natural language version built from the ground up, and realised it just didn’t work out yet: yes, they have a bad, but at least kinda-working voice assistant with lots of integrations into other apps. But replacing that with something that promises to do stuff and then does nothing, takes long to respond, and has less integrations due to the lack of keywords would have been a bad idea if the technology wasn’t there yet.

irishcoffee · 2026-01-31T07:58:57 1769846337

Honestly, what it seems like is financial discipline.

tonyedgecombe · 2026-01-31T11:32:04 1769859124

We do know that they made a number of promises on AI[1] and then had to roll them back because the results were so poor[2]. They then went on to fire the person responsible for this division[3].

That doesn't sound like a financial decision to me.

[1] https://www.apple.com/uk/newsroom/2024/06/wwdc24-highlights/

[2] https://www.bloomberg.com/news/features/2025-05-18/how-apple...

[3] https://nypost.com/2025/12/02/business/apple-shakes-up-ai-te...

pizlonator · 2026-01-31T21:17:26 1769894246

> I mean, they tried. They just tried and failed.

They tried to do something that probably would have looked like Copilot integration into Windows, and they chose not to do that, because they discovered that it sucked.

So, they failed in an internal sense, which is better than the externalized kind of failure that Microsoft experienced.

I think that the nut that hasn't been cracked is: how do you get LLMs to replace the OS shell and core set of apps that folks use. I think Microsoft is trying by shipping stuff that sucks and pissing off customers, while Apple tried internally declined to ship it. OpenClaw might be the most interesting stab in that direction, but even that doesn't feel like the last word on the subject.

catdog · 2026-01-31T08:24:37 1769847877

Well they tried and they failed. In that case maybe the smartest move is not to play. Looks like the technology is largely turning into a commodity in the long run anyways. So sitting this out and letting others make the mistakes first might not be the worst of all ideas.

cs_sorcerer · 2026-01-31T04:43:41 1769834621

From a technology standpoint I don’t feel Apple’s core competency is in AI model foundations

random_duck · 2026-01-31T04:18:43 1769833123

They might know something?

leptons · 2026-01-31T04:48:57 1769834937

More like they don't know the things others do. Siri is a laughing stock.

throwforfeds · 2026-01-31T13:31:22 1769866282

Sure, Siri is, but do people really buy their phone based off of a voice assistant? We're nowhere near having an AI-first UX a la "Her" and it's unclear we'll even go in that direction in the next 10 years.

consumer451 · 2026-01-31T02:55:55 1769828155

To use the parlance of this thread: "next" foundation models is doing a lot of heavy lifting here. Am I doing this right?

My point is, does Apple have any useful foundation models? Last I checked they made a deal with OpenAI, no wait, now with Google.

wmf · 2026-01-31T04:03:35 1769832215

Apple does have their own small foundation models but it's not clear they require a lot of GPUs to train.

consumer451 · 2026-01-31T05:37:20 1769837840

Do you mean like OCR in photos? In that case, yes, I didn't think about that. Are there other use cases aside from speach to text in Siri?

wmf · 2026-01-31T06:08:33 1769839713

I think they are also used for translation, summarization, etc. They're also available to other apps: https://developer.apple.com/documentation/FoundationModels

consumer451 · 2026-01-31T09:58:50 1769853530

Thanks, I am a dumb dumb about Apple, and mobile in general. I should have known this. I really appreciate the reply so that I know it now.

system2 · 2026-01-31T03:13:35 1769829215

I think Apple is waiting for the bubble to deflate, then do something different. And they have the ready to use user base to provide what they can make money from.

amluto · 2026-01-31T04:40:04 1769834404

If they were taking that approach, they would have absolutely first-class integration between AI tools and user data, complete with proper isolation for security and privacy and convenient ways for users to give agents access to the right things. And they would bide their time for the right models to show up at the right price with the right privacy guarantees.

I see no evidence of this happening.

irishcoffee · 2026-01-31T07:52:43 1769845963

As an outsider, the only thing the two of you disagree on is timing. I probably side with the ‘time is running out’ team at the current juncture.

ymyms · 2026-01-31T04:06:18 1769832378

They apparently are working on and are going to release 2(!) different versions of siri. Idk, that just screams "leadership doesn't know what to do and can't make a tough decision" to me. but who knows? maybe two versions of siri is what people will want.

consumer451 · 2026-01-31T04:08:38 1769832518

Arena mode! Which reply do you prefer? /s

But seriously, would one be for newer phone/tablet models, and one for older?

pinnochio · 2026-01-31T04:33:50 1769834030

It sounds like the first one, based on Gemini, will be more a more limited version of the second ("competitive with Gemini 3"). IDK if the second is also based on Gemini, but I'd be surprised if that weren't the case.

Seems like it's more a ramp-up than two completely separate Siri replacements.

aurareturn · 2026-01-31T04:32:58 1769833978

Apple can make more money from shorting the stock market, including their own stock, if they believe the bubble will deflate.

downrightmike · 2026-01-31T03:39:09 1769830749

They are in housing their AI to sell it as a secure way to AI, which 100% puts them in the lead for the foreseeable future.

bredren · 2026-01-29T15:52:12 1769701932

For CC, I suspect it also need to be testing and labeling separate runs against subscription, public API and Bedrock-served models?

It’s a terrific idea to provide this. ~Isitdownorisitjustme for LLMs would be the parakeet in the coalmine that could at least inform the multitude of discussion threads about suspected dips in performance (beyond HN).

What we could also use is similar stuff for Codex, and eventually Gemini.

Really, the providers themselves should be running these tests and publishing the data.

The availability status information is no longer sufficient to gauge the service delivery because it is by nature non-deterministic.

bredren · 2026-01-19T00:39:37 1768783177

You may have explained this elsewhere, but if not—-what kind of post processing did you do to upscale or refine the realsense video?

Can you add any interesting details on the benchmarking done against the RED camera rig?

spookie · 2026-01-19T10:17:34 1768817854

This is a great question, would love some some feedback on this.

I assume they stuck with realsense for proper depth maps. However, those are both limited to a 6 meters range, and their depth imaging isn't able to resolve features smaller than their native resolution allows (gets worse after 3m too, as there is less and less parallax among other issues). I wonder how they approached that as well.