Does anyone want to talk about the hack itself? Can anyone give more details tha...

monocasa · 2025-02-14T17:47:22 1739555242

Someone unminified the js, and it turned out that a bunch of the rest endpoints it knew about were just unverified crud endpoints for the site.

https://archive.ph/2025.02.14-132833/https://www.404media.co...

Euphorbium · 2025-02-14T17:53:33 1739555613

Smells exactly like llm created solution.

monocasa · 2025-02-14T18:03:20 1739556200

Or just what happens when you hire a bunch of 20 year olds and let them loose.

That's currently how I model my usage of LLMs in code. A smart veeeery junior engineer that needs to be kept on a veeeeery short leash.

ellisv · 2025-02-14T18:22:51 1739557371

Yes. LLMs are very much like a smart intern you hired with no real experience who is very eager to please you.

heavyset_go · 2025-02-14T19:27:23 1739561243

IMO, they're worse than that. You can teach an intern things, correct their mistakes, help them become better and your investment will lead to them performing better.

LLMs are an eternal intern that can only repeat what it's gleaned from some articles it skimmed last year or whatever. If your expected response isn't in its corpus, or isn't in it frequently enough, and it can't just regurgitate an amalgamation of the top N articles you'd find on Google anyway, tough luck.

worik · 2025-02-14T19:56:48 1739563008

The Age of the Eternal Intern

SketchySeaBeast · 2025-02-14T21:38:22 1739569102

LLMs are to interns what house cats are to babies. They seem more self sufficient at first, but soon the toddler grows up and you're stuck with an animal who will forever need you to scoop its poops.

kridsdale1 · 2025-02-14T20:24:04 1739564644

And the content online is now written by Fully Automated Eternal September

kps · 2025-02-14T20:27:41 1739564861

Today is Friday the 11490th of September 1993.

willturman · 2025-02-14T21:24:40 1739568280

Without a mechanism to detect output from LLMs, we’re essentially facing an eternal model collapse with each new ingestion of information from academic journals, to blogs, to art. [1][2]

[1] https://en.m.wikipedia.org/wiki/Model_collapse

[2]https://thebullshitmachines.com/lesson-16-the-first-step-fal...

lgas · 2025-02-15T06:41:03 1739601663

> You can teach an intern things, correct their mistakes, help them become better and your investment will lead to them performing better.

You can't do the same way you do with a human developer, but you can do a somewhat effective form of it through things like .cursorrules files and the like.

NewJazz · 2025-02-14T19:48:31 1739562511

Even at 20 years old I would not have done this.

kevin_thibedeau · 2025-02-14T20:31:05 1739565065

The difference is that today's digital natives regard computers as magic and most don't know what's really happening when their framework du jour spits out some "unreadable" text.

mlinhares · 2025-02-14T21:40:39 1739569239

So much this, I was interning at a government entity at 20 and I already knew you needed credentials to do shit. Most frameworks have this by default for free, we're so incredibly screwed with these folks running rampant and destroying the government.

gvx · 2025-02-14T18:44:06 1739558646

One who thinks "open source" means blindly copy/pasting code snippets found online.

daveguy · 2025-02-14T23:24:59 1739575499

It's definitely both. A bunch of 20 year olds were let loose to be "super efficient." So, to be efficient they use LLMs to implement what should be a major government oversight webpage. Even after the fix the list is a few half-baked partial document excerpts with a few sentences saying, "look how great we are!" It's embarrassing.

Maxatar · 2025-02-15T01:11:41 1739581901

Does it? At least my experience is that ChatGPT goes super hard on security, heavily promoting the use of best practices.

Maybe they used Grok ;P

tatersolid · 2025-02-15T04:53:11 1739595191

> At least my experience is that ChatGPT goes super hard on security, heavily promoting the use of best practices.

Not my experience at all. Every LLM produces lots of trivial SQLI/XSS/other-injection vulnerabilities. Worse they seem to completely authorization business logic, error handling, and logging even when prompted to do so.

tatersolid · 2025-02-15T11:50:19 1739620219

Post-edit window, the above should read “…completely skip authorization…”

zamalek · 2025-02-15T03:57:16 1739591836

Does it, though? The saying says we shouldn't mistake incompetence for malice, but that requires more than usual for Musk's retinue.

Smells like getting a backdoor in early.

daveguy · 2025-02-15T04:34:51 1739594091

Apparently they get backdoors in as incompetently as they create efficiency.

AirMax98 · 2025-02-14T19:10:34 1739560234

My first guess is that this is an unauthenticated server action.[0]

0 - https://blog.arcjet.com/next-js-server-action-security/

rcpt · 2025-02-14T18:04:09 1739556249

Maybe doge should have used an LLM to generate defenses

caboteria · 2025-02-14T18:17:58 1739557078

They did, and this is what they got.

CharlesChadwick · 2025-02-14T22:27:19 1739572039

Just checked the DOGE website; I'm not too sure about this theory given that POST requests are blocked and the only APIs you can find (ie. /api/offices) only supports GET requests and if the UUID doesn't match, it 404s.

I don't see any CRUD endpoints for modifying the database

1_1xdev1 · 2025-02-14T22:44:10 1739573050

DOGE noticed. They might have "fixed" the vulnerability by now

https://doge.gov/workforce?orgId=69ee18bc-9ac8-467e-84b0-106... is what's linked to by the "Workforce" header, and it now looks different than the screenshots

insane_dreamer · 2025-02-14T17:59:16 1739555956

Good thing we have the best and brightest at DOGE!

croisillon · 2025-02-14T18:24:37 1739557477

well they pay for a blue checkmark, they _must_ be the cleverest we have

rbanffy · 2025-02-14T18:36:13 1739558173

It's been a while since I last saw a CMS pulling data from a database... It's a miracle the website didn't crumble under the load.

ksenzee · 2025-02-15T06:37:17 1739601437

Put a CMS behind a well-configured CDN and it's essentially a static site generator. If you have cache invalidation figured out, you get all the speed and scalability benefits of a static site without ever having to regenerate your content.

rbanffy · 2025-02-15T17:09:17 1739639357

I’m guessing it didn’t have much in front of it because the management endpoints were accessible from the public Internet. I think you mentioning the “well configured CDN” is key here. If there was a CDN in front of it, it wasn’t well configured.

BTW, I spent a lot of my career configuring load balancing, caches, proxies, sharding, and CDNs for Plone (a CMS that’s popular with governments) websites.

ksenzee · 2025-02-15T21:03:31 1739653411

Yeah sorry, I didn't mean to imply these folks have any clue what they're doing. I misread your comment as "it's been a while since I saw a CMS-based site, big sites are all static now" instead of "it's been a while since I saw a CMS rawdogging it."

0x1ceb00da · 2025-02-15T03:16:11 1739589371

https://m.youtube.com/watch?v=woPff-Tpkns&pp=ygUSdW5kZXJ0YWx...

internetter · 2025-02-14T16:36:04 1739550964

According to a source of mine, there were unsecured API endpoints for modification

rsynnott · 2025-02-14T13:25:10 1739539510

> The database it is pulling from can be and has been written to by third parties, and will show up on the live website.

Not enough detail to say for sure; could be SQL injection, could be credentials exposed in the frontend.

seba_dos1 · 2025-02-14T17:54:48 1739555688

...or endpoints not requiring any credentials at all.

rsynnott · 2025-02-14T22:08:38 1739570918

… Oh, yes. After reading more carefully I see it, er, IS that. Where the hell did Musk find these people? 1996?

CharlesChadwick · 2025-02-14T22:33:48 1739572428

I'm not too sure about this theory; just went on the DOGE site and the API endpoints don't allow for POST requests, and I can't find anything that allows me to upload

a012 · 2025-02-14T13:03:49 1739538229

My bet is on SQL injection

radicalbyte · 2025-02-14T17:46:46 1739555206

They used one of those databases which are easy to connect directly to the internet, it's the same thing as about 90% of modern data breaches.

Every generation we make things much easier, lower the bar, and are rewarded when amateurs make amateur mistakes like this.

bregma · 2025-02-14T19:07:59 1739560079

We made it so easy to program that any idiot could do it. So they do.

radicalbyte · 2025-02-14T19:49:56 1739562596

I found my new signature quote.

koakuma-chan · 2025-02-14T20:46:07 1739565967

No way this is real.

rozap · 2025-02-14T17:33:28 1739554408

In the year of our lord 2025? I doubt it. I'd put money on "some third party cloud service was configured in a silly way".

But, I would love to see details.

guywithahat · 2025-02-14T18:09:50 1739556590

I mean the article is paywalled but it sounds like this is isolated to their site-displayed twitter feed; basically the site was hosted by cloudflare and you could insert your own fake tweets into what was recorded on the site (but not on the actual DOGE twitter feed). I don't think any data was actually compromised

Volundr · 2025-02-14T19:59:47 1739563187

I can't speak to any data that may or may not be compromised, but this isn't about inserting fake tweets. Anything in their "government org chart" can be edited unauthenticated.

ProjectArcturis · 2025-02-14T18:47:48 1739558868

Yeah, it's just tremendously embarrassing. These are supposed to be the tech geniuses who can parse 50 years of accumulated legacy code and find all the government waste? In 3 weeks?

ghthor · 2025-02-14T20:39:34 1739565574

Data science and websites are different beasts.

gregw2 · 2025-02-14T23:22:56 1739575376

I'm not yet sure whether they are even doing data science.

Anecdote time (pinch of salt required):

A relative of mine studying accounting went to the Doge site to see the "audit" and "analytics" records that some acquaintance arguing with her said "see the doge site!" for the proof.

What she found when visiting the site was no "audit" at all, but instead a word count of how often objectional terms appear in legislation or government sites. (DEI? Trans? LGBTQ?).

Being in the analytics/data engineering space myself, I was pretty amused to hear that was the quality of "analytics" being done.

Wasn't "word count" the "hello world" example for Hadoop big data back in 2013?

throwway120385 · 2025-02-14T22:31:37 1739572297

Some of the "data science" people I've met certainly believed that they could architect entire software systems just because they understood how to structure data in databases.

ProjectArcturis · 2025-02-14T22:41:58 1739572918

Surely technical competence is strongly correlated across the two beasts.