More

ramesh31 · 2025-04-01T05:03:26 1743483806

Cline, full stop. It's approaching literal magic at this point.

ramesh31 · 2025-03-31T19:27:41 1743449261

I genuinely can't understand the thought process of a Yankees fan. If it's just a tradition thing, then sure whatever. But someone who watches them play and goes "yeah that's my team" is just mindblowing. They'll have a batting lineup that costs more than the opponents entire field, knowing full well they are all just hired guns who will be gone the moment the contract is up, and then you watch them in the playoffs against regular teams and it's just visually hilarious at this point. Like watching a bunch of NFL linebackers playing teeball.

techdmn · 2025-03-31T19:33:40 1743449620

Some people want to back a winner, and they don't really get too worked up about the details. Another example would be Ferrari in early 2000s in F1. Biggest budget, most skilled driver, all the dirty tricks at all levels (on-track, technical, political), plenty of fans.

dionidium · 2025-03-31T19:43:47 1743450227

> I genuinely can't understand the thought process of a Yankees fan.

There is very little free agency in American sports fandom. People are (for the most part) fans of the team local to where they grew up. (This kind of bums me out as someone raising kids in New England, which is not where I'm from, and so not whose teams I root for.)

bluGill · 2025-03-31T20:24:41 1743452681

Backing the local team always makes the most sense. In NYC you can choose Mets or Yankees (though where you live in the city affects even that). Choosing a team from some other city means you see your team play much less often and only after much effort. Worse there are less people to talk about the game as nobody has seen your team play and you didn't see their team plan. (except when your team plans the local team)

testing22321 · 2025-04-02T08:00:20 1743580820

That is true of all Poe sport in the USA. No salary cap and teams are a franchise that exist to make money.

ramesh31 · 2025-03-31T19:22:27 1743448947

Hard to imagine whatever advantage this is affording won't be nullified by pitchers within a season. The arms race is getting insane.

ramesh31 · 2025-03-31T15:19:42 1743434382

More capability, less reliability please. I want something that can achieve superhuman results 1 out of 10 times, not something that gives mediocre human results 9 out of 10 times.

All of reality is probabilistic. Expecting that to map deterministically to solving open ended complex problems is absurd. It's vectors all the way down.

klabb3 · 2025-03-31T17:06:52 1743440812

Reality is probabilistic yes but it’s not black box. We can improve our systems by understanding and addressing the flaws in our engineering. Do you want probabilistic black-box banking? Flight controls? Insurance?

”It works when it works” is fine when stakes are low and human is in the loop, like artwork for a blog post. And so in a way, I agree with you. AI doesn’t belong in intermediate computer-to-computer interactions, unless the stakes are low. What scares me is that the AI optimists are desperately looking to apply LLMs to domains and tasks where the cost of mistakes are high.

soulofmischief · 2025-03-31T15:35:08 1743435308

Stability is the bedrock of the evolution of stable systems. LLMs will not democratize software until an average person can get consistently decent and useful results without needing to be a senior engineer capable of a thorough audit.

ramesh31 · 2025-03-31T15:42:39 1743435759

>Stability is the bedrock of the evolution of stable systems.

So we also thought with AI in general, and spent decades toiling on rules based systems. Until interpretability was thrown out the window and we just started letting deep learning algorithms run wild with endless compute, and looked at the actual results. This will be very similar.

klabb3 · 2025-03-31T17:13:00 1743441180

This can be explained easily – there are simply some domains that were hard to model, and those are the ones where AI is outperforming humans. Natural language is the canonical example of this. Just because we focus on those domains now due to the recent advancements, doesn’t mean that AI will be better at every domain, especially the ones we understand exceptionally well. In fact, all evidence suggests that AI excels at some tasks and struggles with others. The null hypothesis should be that it continues to be the case, even as capability improves. Not all computation is the same.

skydhash · 2025-03-31T17:10:49 1743441049

Rules based systems are quite useful, not for interacting with an untrained human, but for getting things done. Deep learning can be good at exploring the edges of a problem space, but when a solution is found, we can actually get to the doing part.

soulofmischief · 2025-03-31T17:47:48 1743443268

Stability and probability are orthogonal concepts. You can have stable probabilistic systems. Look no further than our own universe, where everything is ultimately probabilistic and not "rules-based".

recursive · 2025-03-31T17:15:54 1743441354

> Expecting that to map deterministically to solving open ended complex problems is absurd.

TCP creates an abstraction layer with more reliability than what it's built on. If you can detect failure, you can create a retry loop, assuming you can understand the rules of the environment you're operating in.

ramesh31 · 2025-03-31T23:34:41 1743464081

>If you can detect failure, you can create a retry loop, assuming you can understand the rules of the environment you're operating in

Indeed, this is what makes autonomous agentic tool using systems robust as well. Those retry loops become ad-hoc where needed, and the agent can self correct based on error responses, compared to a defined workflow that would get stuck in said loop if it couldn't figure things out, or just error out the whole process.

Jianghong94 · 2025-03-31T15:50:00 1743436200

Superhuman results 1/10 are, in fact, a very strong reliability guarantee (maybe not up to today's nth 9 decimal standard that we are accustomed to, but probably much higher than any agent in real-world workflow).

deprave · 2025-03-31T15:46:13 1743435973

What would be a superhuman result for booking a flight?

mjmsmith · 2025-03-31T16:06:21 1743437181

10% of the time the seat on either side of you is empty, 90% of the time you land in the wrong country.

ramesh31 · 2025-03-31T14:09:44 1743430184

You guys really need a Docker build. This dependency chain with submodules is a nightmare.

phkahler · 2025-03-31T19:22:18 1743448938

I'm a hater of complexity and build systems in general. Following the instructions for building solvespace on Linux worked for me out of the box with zero issues and is not difficult. Just copy some commands:

https://github.com/solvespace/solvespace?tab=readme-ov-file#...

ramesh31 · 2025-03-31T20:14:01 1743452041

>I'm a hater of complexity and build systems in general.

But you already have a complex cmake build system in place. Adding a standard Docker image with all the deps for devs to compile on would do nothing but make contributing easier, and would not affect your CI/CD/testing pipeline at all. I followed the readme and spent half an hour trying to get this to build for MacOS before giving up.

If building your project for all supported environments requires anything more than a single one-line command, you're doing it wrong.

phkahler · 2025-04-03T15:44:21 1743695061

>> But you already have a complex cmake build system in place.

I didn't build it :-(

>> Adding a standard Docker image with all the deps for devs to compile on would do nothing but make contributing easier, and would not affect your CI/CD/testing pipeline at all.

I understand, but to me that's just more stuff to maintain and learn. Everyone wants to push their build setup upstream - snap packages, flatpak, now we need docker... And then you and I complain that the build system is complex, partly because it supports so many options. But it looks like the person taking up the AI challenge here is using Docker, so maybe we'll get that as a side effect :-)

jcheng · 2025-04-02T00:20:21 1743553221

I'm sympathetic in general, but in this case:

"You will need git, XCode tools, CMake and libomp. Git, CMake and libomp can be installed via Homebrew"

That really doesn't seem like much. Was there more to it than this?

Edit: I tried it myself and the cmake configure failed until I ran `brew link --force libomp`, after which it could start to build, but then failed again at:

    [ 55%] Building CXX object src/CMakeFiles/solvespace-core.dir/bsp.cpp.o
    c++: error: unknown argument: '-Xclang -fopenmp'

semi-extrinsic · 2025-03-31T17:01:12 1743440472

Alternative perspective: you kids with your Docker builds need to roll up your sleeves and learn how to actually compile a semi-complicated project if you expect to be able to contribute back to said project.

Philpax · 2025-03-31T18:56:46 1743447406

If your project is hard to build, that's your problem, not mine. I'll simply spend my time working on projects that respect it.

disgruntledphd2 · 2025-03-31T18:03:49 1743444229

I can see both perspectives! But honestly, making a project easier to build is almost always a good use of time if you'd like new people to contribute.

ramesh31 · 2025-03-31T20:12:03 1743451923

>"Alternative perspective: you kids with your Docker builds need to roll up your sleeves and learn how to actually compile a semi-complicated project if you expect to be able to contribute back to said project."

Well, that attitude is probably why the issue has been open for 2 years.

ramesh31 · 2025-03-28T15:04:55 1743174295

We thought modern industrialized nations could never go to total war again because people simply wouldn't accept it. Turns out they did.

pjc50 · 2025-03-28T15:47:00 1743176820

It only takes one dictator, then the wishes of the people become irrelevant. Or propagandized; I'm sure the war is quite popular in Russia still despite horrific casualties.

ben_w · 2025-03-28T17:44:32 1743183872

To an extent.

As I understand it, Russia hasn't been able to actually call it a "war" domestically, they did burn through prisoners rather than trained forces until they ran out of people willing to believe the chances of surviving to enjoy early release, and Russian forces have been only partially rather than fully mobilised with their conscripts mostly kept back from the front line for a while now due to domestic concerns.

ramesh31 · 2025-03-27T16:59:48 1743094788

Microsoft is the official owner of Playwright

ramesh31 · 2025-03-27T16:57:12 1743094632

>I burned though $25 in just 3 hours. Claude code will be great when they can get the cost down. If the cost is like 1/10th of that I’d be using it all the time, but +/- $10 / hour is too much.

I've been trying to figure this out, and I don't think it's malicious, but it's just a matter of incentives. Anthropic devs are certainly not paying retail prices for Claude usage, so their benchmark (or just intuition) of efficiency is probably much different than the average user. Without that hard constraint the incentive just isn't there for them to squeeze out a few more pennies, and it ends up way more expensive than stuff like Cline or Cursor.

ramesh31 · 2025-03-27T16:29:31 1743092971

There's something so quaint and comforting in revisiting the world of peak post-modernist "sarcastic irony" that infused everything in this era. It was just so damned sure of itself.

ramesh31 · 2025-03-27T12:10:37 1743077437

>And we expose more tool discovery flexibility rather than just list_tools()

Extremely curious about this as just directly listing all tools to an agent will obviously not scale well. What does the interface look like?