More

_heimdall · 2025-04-15T14:15:00 1744726500

In the same way that finding waste while increasing the federal budget isn't efficiency.

Technically, maybe you can squint and find small pieces that are more efficient but in the grand scheme of things they goal doesn't seem to be a smaller government.

_heimdall · 2025-04-15T14:08:37 1744726117

Its also worth noting that the NLRB has a proposed budget of $320M for the the 2025 fiscal year and a total of around 1,300 employees [1].

I'm a strong proponent of small government and don't know enough about the NLRB to say if I would find them useful, but that is well within the range of a small federal department today.

[1] https://www.nlrb.gov/sites/default/files/attachments/pages/n...

_heimdall · 2025-04-15T13:51:53 1744725113

I ended up drastically cutting back on Amazon purchases when they started getting flooded with brands like that.

Its absolutely on Amazon to maintain quality. There are certain brands and types of products I'll order there because they're just harder to find otherwise, but its mostly a last resort these days given that Amazon doesn't care to curate what is on their "shelves".

blueboo · 2025-04-15T14:12:14 1744726334

I trust in Amazon’s relentless quantitative focus. What you see is what people want to buy (and pay for)

_heimdall · 2025-04-15T13:47:46 1744724866

I assume we'll see backups on both sides. Containers backed up in Chinese ports and a huge backlog of unclaimed packages and delayed tariff bills waiting for USPS/UPS/FedEx to process them.

_heimdall · 2025-04-14T11:31:56 1744630316

And employers are now starting to require compliance with using LLMs regardless of employee curiosity.

Shopify now includes LLM use in annual reviews, and if I'm not mistaken GitHub followed suit.

_heimdall · 2025-04-14T11:30:10 1744630210

> This highlights an opportunity for organizations to better support their developers’ interest in AI tools, considering local regulations.

This is a funny one to see included in GitHub's report. If I'm not mistaken, github is now using the same approach as Shoplify with regards to requiring LLM use and including it as part of a self report survey for annual review.

I guess they took their 2024 survey to heart and are ready to 100x productivity.

_heimdall · 2025-04-14T11:24:48 1744629888

Wouldn't that then make those people developers? The total pool of developers would grow, the percentage couldn't go above 100%.

hnuser123456 · 2025-04-14T13:19:29 1744636769

I mean, I spent years learning to code in school and at home, but never managed to get a job doing it, so I just do what I can in my spare time, and LLMs help me feel like I haven't completely fallen off. I can still hack together cool stuff and keep learning.

_heimdall · 2025-04-14T14:11:52 1744639912

I actually meant it as a good thing! Our industry plays very loose with terms like "developer" and "engineer". We never really defined them well and its always felt more like gate keeping.

IMO if someone uses what tools they have, whether thats an LLM or vim, and is able to ship software they're a developer in my book.

rvnx · 2025-04-14T11:41:49 1744630909

Probably. There is a similar question: if you ask ChatGPT / Midjourney to generate a drawing, are you an artist ? (to me yes, which would mean that AI "vibe coders" are actual developers in their own way)

dimitri-vs · 2025-04-14T13:07:58 1744636078

If my 5 yo daughter draws a square with a triangle on top is she an architect?

guappa · 2025-04-14T13:16:30 1744636590

Yes, most architects can't really do the structure calculations themselves.

_heimdall · 2025-04-14T14:14:03 1744640043

That's quite a straw man example though.

If your daughter could draw a house with enough detail that someone could take it and actually build it then you'd be more along the lines of the GP's LLM artist question.

dimitri-vs · 2025-04-14T15:35:47 1744644947

Not really, the point was contrasting sentimental labels with professionally defined titles, which seems precisely the distinction needed here. It's easy enough to look up on the agreed upon term for software engineer / developer and agree that it's more than someone that copy pastes code until it just barely runs.

EDIT: To clarify I was only talking about vibe coder = developer. In this case the LLM is more of the developer and they are the product manager.

_heimdall · 2025-04-14T18:35:18 1744655718

Do we have professionally defined titles for developer or software engineer?

I've never seen it clarified so I tend to default to the lowest common denominator - if you're making software in some way you're a developer. The tools someone uses doesn't really factor into it for me (even if that is copy/pasting from stackoverflow).

toofy · 2025-04-15T00:50:49 1744678249

nope. if i ask an llm to give me a detailed schematic to build a bridge, im not magically * poof * a structural engineer.

rvnx · 2025-04-15T09:30:27 1744709427

I don't know, if you actually design in some way and deliver the solution for the structure of the bridge, aren't you THE structural engineer for that project ?

Credentials don't define capability, execution does.

danudey · 2025-04-14T21:18:57 1744665537

If I tell a human artist to draw me something, am I an artist?

No.

Neither are people who ask AI to draw them something.

_heimdall · 2025-04-15T01:36:02 1744680962

That probably depends on whether you consider LLMs, or human artists, as tools.

If someone uses an LLM to make code, is consider the LLM to be a tool that will only be as good as the person prompting it. The person, then, is the developer while the LLM is a tool they're using.

I don't consider auto complete, IDEs, or LSPs to take away from my being a developer.

This distinction likely goes out the window entirely if you consider an LLM to actually be intelligent, sentient, or conscious though.

_heimdall · 2025-04-14T00:32:34 1744590754

This argument isn't particularly compelling in my opinion.

I don't actually like the stochastic parrot argument either to be fair.

I feel like the author is ignoring the various knobs (randomization factors may be a better term) applied to the models during inference that are tuned specifically to make the output more believable or appealing.

Turn the knobs too far and the output is unintelligible garbage. Don't then them far enough and the output feels very robotic or mathematical, its obvious that the output isn't human. The other risk of not turning the knobs far enough would be copyright infringement, but I don't know if that happens often in practice.

Claiming that LLMs aren't stochastic parrots without dealing with the fact that we forced randomization factors into the mix misses a huge potential argument that they are just cleverly disguised stochastic parrots.

chongli · 2025-04-14T01:14:36 1744593276

This seems like it was inevitable. Most people do not understand the meaning of the word "stochastic" and so they're likely to simply ignore it in favour of reading the term as "_____ parrot."

What you have described, a probability distribution with carefully-tuned parameters, is perfectly captured by the word stochastic as it's commonly used by statisticians.

amenhotep · 2025-04-14T08:59:14 1744621154

Human brains are similarly finely tuned and have similar knobs, it seems to me. People with no short term memory have the same conversations over and over again. Drunk people tend to be very predictable. There are circuits that give us an overwhelming sense of impending doom, or euphoria, or the conviction that our loved ones have been replaced by imposters. LLMs with very perturbed samplers bear, sometimes, a striking resemblance to people on certain mind-altering substances.

_heimdall · 2025-04-14T12:55:58 1744635358

And that's really a core of the problem, we don't well understand how the human mind works and we can't really define or identify "intelligence."

I mentioned I don't like the stochastic parrot argument, and that I find this article's argument lacking. Both are for the same reason, the arguments are making claims that we simply can't make while missing the fundamental understanding of what intelligence really is and how human (and other animals) brains work.

mbauman · 2025-04-14T00:58:21 1744592301

Yes, this really seems like an argument between two contrived straw people at the absolute extremes.

_heimdall · 2025-04-14T00:27:05 1744590425

I have a deaf friend who can read lips in two languages. As far as I am aware she can pick up humor of all kinds in both.

She knows ASL as well, but I don't think she knows any other dialect of sign language (is dialect the right term? I'm not actually sure).

_heimdall · 2025-04-14T00:24:20 1744590260

You seem to be coming with the assumption that the difference between parrots and what many would consider intelligence is math, or that math is a reliable indicator of those different groups.

What makes you believe that is the case?

nopinsight · 2025-04-14T00:31:53 1744590713

Solving hard math problems requires understanding the structure of complex mathematical reasoning. No animal is known to be capable of that.

Most definitions and measurements of intelligence by most laypeople and psychologists include the ability to reason, with mathematical reasoning widely accepted as part of or a proxy for it. They are imperfect but “intelligence” does not have a universally accepted definition.

Do you have a better measurement or definition?

_heimdall · 2025-04-14T01:28:54 1744594134

Math is a contrived system though, there are no fundamental laws of nature that require math to be done the way we do it.

A human society may develop their own math in a base 13 system, or an entirely different way of representing the same concepts. When they can't solve our base 10 math problems in a way that matches how we expect does that mean they are parrots?

Part of the problem here is that we still have yet to land on a clear, standard definition of intelligence that most people agree with. We could look to IQ, and all of its problems, but then we should be giving LLMs an IQ test to answer rather than a math test.

nopinsight · 2025-04-14T01:43:44 1744595024

The fact that much of physics can be so elegantly described by math suggests the structures of our math could be quite universal, at least in our universe.

Check out the problems in the MATH dataset, especially Level 5 problems. They are fairly advanced (by most people’s standards) and most are not dependent on which N in the base-N system used to solve them. The answers would be different of course but the structures of the problems and solutions remain largely intact.

Website for tracking IQ measurements of LLMs:

https://www.trackingai.org/

The best one already scores higher than all but the top 10-20% of most populations.

timr · 2025-04-14T02:31:22 1744597882

> Solving hard math problems requires understanding the structure of complex mathematical reasoning. No animal is known to be capable of that.

Except, it doesn't. Maybe some math problems do -- or maybe all of them do, when the text isn't in the training set -- but it turns out that most problems can be solved by a machine that regurgitates text, randomly, from all the math problems ever written down.

One of the ways that this debate ends in a boring cul-de-sac is that people leap to conclusions about the meaning of the challenges that they're using to define intelligence. "The problem has only been solved by humans before", they exclaim, "therefore, the solution of the problem by machine is a demonstration of human intelligence!"

We know from first principles what transformer architectures are doing. If the problem can be solved within the constraints of that simple architecture, then by definition, the problem is insufficient to define the limits of capability of a more complex system. It's very tempting to instead conclude that the system is demonstrating mysterious voodoo emergent behavior, but that's a bit like concluding that the magician really did saw the girl in half.

bluefirebrand · 2025-04-14T00:37:08 1744591028

> Solving hard math problems requires understanding the structure of mathematical reasoning

Not when you already know all of the answers and just have to draw a line between the questions and the answers!

nopinsight · 2025-04-14T00:38:38 1744591118

Please check out the post on Math-Perturb-Hard conveniently linked to above before making a comment without responding to it.

A relevant bit:

“for MATH-P-Hard, we make hard perturbations, i.e., small but fundamental modifications to the problem so that the modified problem cannot be solved using the same method as the original problem. Instead, it requires deeper math understanding and harder problem-solving skills.”

bluefirebrand · 2025-04-14T00:44:32 1744591472

Seems like that would explain why it scored 10%, not 100%, to me

A child could score the same knowing the outcomes and guessing randomly which ones go to which questions

nopinsight · 2025-04-14T00:53:08 1744591988

My request:

“Could you explain this sentence concisely?

For the skeptics: Scoring just 10% or so in Math-Perturb-Hard below the original MATH Level 5 (hardest) dataset seems in line with or actually better than most people would do.”

Gemini 2.5 Pro:

“The sentence argues that even if a model's score drops by about 10% on the "Math-Perturb-Hard" dataset compared to the original "MATH Level 5" (hardest) dataset, this is actually a reasonable, perhaps even good, outcome. It suggests this performance decrease is likely similar to or better than how most humans would perform when facing such modified, difficult math problems.”

nkurz · 2025-04-14T02:00:35 1744596035

I think 'nopinsight' and the paper are arguing that the drop is 10%, not that the final score is 10%. For example, Deepseek-R1 dropped from 96.30 to 85.19. Are you actually arguing that a child guessing randomly would be able to score the same, or was this a misunderstanding?