More

siscia · 2025-12-13T20:49:07 1765658947

I will be crucified by this, but I think you are doing it wrong.

I would split it in 2 steps.

First, just move it to svelte, maintain the same functionality and ideally wrap it into some tests. As mentioned you want something that can be used as pass/no-pass filter. As in yes, the code did not change the functionality.

Then, apply another pass from Svelte bad quality to Svelte good quality. Here the trick is that "good quality" is quite different and subjective. I found the models not quite able to grasp what "good quality" means in a codebase.

For the second pass, ideally you would feed an example of good modules in your codebase to follow and a description of what you think it is important.

siscia · 2025-11-30T14:09:29 1764511769

I am toying with something similar.

However my approach would be to use duckdb and S3 over lambda.

Leaving many of the concerns to the infrastructure. Like basically no OOM. No need to manage servers.

siscia · 2025-11-25T06:47:41 1764053261

With my partner we have been working to invert the overall model.

She started grading conversation than the students have with LLMs.

From the question that the students ask, it is obvious who knows the material and who is struggling.

We do have a custom setup, so that she creates an homework. There is a custom prompt to avoid the LLM answering the homework question. But thats pretty much it.

The results seems promising, with students spending 30m or so going back and forth with the LLMs.

If any educator wants to Ty or is interested in more information, let me know and we can see how we collaborate.

bo1024 · 2025-11-25T12:58:42 1764075522

This makes some sense, but my first question would be how do you define a clear, fair grading rubric? Second, this sounds like it could work for checking who is smart, but can it motivate students to put in work to learn the material?

siscia · 2025-10-19T20:33:15 1760905995

I believe the broader question would be if a free market is always USEFUL and DESIRABLE for individuals and community as a whole. And what is freedom when individual and community interest are not necessary the same.

samdoesnothing · 2025-10-19T20:39:30 1760906370

What you're really asking is if fundamental individual human rights are desirable for individuals and community as a whole, which is of course a hotly debated topic. So yeah, it goes all the way down to fundamental questions like if we should have freedom of association.

siscia · 2025-10-17T18:38:25 1760726305

Just to echo the point of MCP, they seem cool, but in my experience just using a CLI is orders of magnitude faster to write and to debug (I just run the CLI myself, put test in the code, etc...)

jascha_eng · 2025-10-17T18:53:16 1760727196

Jup and it doesn't bloat the context unnecessarily. The agent can call --help when it needs it. Just imagine a kubectl MCP with all the commands as individual tools, doesn't make any sense whatsoever.

nomel · 2025-10-17T19:09:45 1760728185

> and it doesn't bloat the context unnecessarily.

And, this is why I usually use simple system prompts/direct chat for "heavy" problems/development that require reasoning. The context bloat is getting pretty nutty, and is definitely detrimental to performance.

okeuro49 · 2025-10-17T19:07:01 1760728021

Do you have any information e.g. blog posts on this pattern?

HDThoreaun · 2025-10-17T20:35:10 1760733310

The point of this stuff is to increase reliability. Sure the LLM has a good chance of figuring out the skill by itself, the idea is that its less likely to fuck up with the skill though. This is an engineering advancement that makes it easier for businesses to rely on LLMs for routine stuff with less oversight.

siscia · 2025-09-07T19:00:57 1757271657

This is absolutely neat.

siscia · 2025-08-10T00:56:52 1754787412

I find myself on both sides actually.

I did have some great luck producing quite useful and impactful code. But also lost time chasing tiny changes.

PessimalDecimal · 2025-08-10T22:56:39 1754866599

Overall, has it been a net savings for you?

siscia · 2025-08-10T00:54:30 1754787270

Do you really?

Frontier models seems remarkably similar in performance.

Yeah some nuances for sure, but the whole article could apply to every model.

arthur-st · 2025-08-10T01:23:59 1754789039

4o on ChatGPT.com vs. Opus in an IDE is like cooking food without kitchen tools vs. using them. 4o is neither a coding-optimized model nor a reasoning model in general.

dnh44 · 2025-08-10T01:27:43 1754789263

You're not pushing them hard enough if you're not seeing a vast difference between 4o and Opus. Or possibly they're equivalent in the field you're working in but I suspect it's the former.

gronglo · 2025-08-10T03:51:39 1754797899

Opus, in my opinion, is steps away from AGI. 4o doesn't come close.

siscia · 2025-08-03T01:41:32 1754185292

Also, do you have a better way to spend that money?

SturgeonsLaw · 2025-08-03T04:59:31 1754197171

If it were me, yeah, park it in bonds and live off the interest on a tropical beach. Spend my days spearfishing and drinking beers with the locals. Have no concerns except how even my tan is (and tbh I don't see myself caring too much about that).

I'd forget the word shareholder even exists.

nopinsight · 2025-08-03T06:45:56 1754203556

Sounds good in theory but I'd be bored to death in a month, at most two. Traveling the world...maybe good for a few more months and that's it.

Wouldn't you yearn for any more impact given how much that amount of resource could improve the lives of many, if used wisely?

aleph_minus_one · 2025-08-03T11:43:35 1754221415

> Wouldn't you yearn for any more impact given how much that amount of resource could improve the lives of many, if used wisely?

Cynical take: increasing Meta's stock value does improve the lifes of many - the many stock holders.

Thus: when you talk about improving lifes, you better specify which group you are targeting, and why you selected this particular group.

bluefirebrand · 2025-08-03T13:11:37 1754226697

I am interested in improving the lives of the many people who cannot afford to be stockholders

The reason I'm interested in this is twofold

First, I think the current system is exploitative. I don't advocate for communism or anything, but the current system of extracting value from the lower class is disgusting

Second, they outnumber the successful people by a vast margin and I don't want them to have a reason to re-invent the guillotine

jebarker · 2025-08-04T19:45:21 1754336721

> they outnumber the successful people by a vast margin

you can be successful and lower class.

Mistletoe · 2025-08-03T08:24:56 1754209496

The world is sufficiently large and complex that a few months wouldn’t even scrape the tip of the iceberg.

nopinsight · 2025-08-03T11:59:28 1754222368

I agree. I just personally wouldn’t want to wander around exploring it continuously for months without more interesting work/goals. Even though cultures and geography may be wonderfully varied, their ranges are way smaller than what could be.

bildung · 2025-08-03T11:32:48 1754220768

If you want to improve the lives of many, by all means go for it, I think that is a wonderful ambition to have in live and something I strive for, too!

But we are talking about an ad company here, trying to branch out into ai to sell more ads, right? Meta existing is without a doubt a net negative for mankind.

scrollaway · 2025-08-03T10:01:30 1754215290

What you’re suggesting does not at all require a lot of money.

scoreandmore · 2025-08-03T14:01:37 1754229697

I met a youngster on Boca del Toro island in Panama a decade or so ago. I was about to be fired from my FAANG job so I used up years and years of vacation for one big trip before I was let go. We hung out for a few days while I was there (I don’t recommend the place at all btw). He cashed out from early twitter and was setting up surf schools all of the world. All he did was travel, surf, drink, and fuck. I’m still angry that laughed at all the dumb startups in the late 2000s instead of joining them. But this guy did what you’re suggesting, and I think there are many more unknown techbros who did it too.

beala · 2025-08-03T16:01:59 1754236919

Setting up an international chain of surf schools actually sounds quite ambitious and stressful, though.

unshavedyak · 2025-08-03T23:22:36 1754263356

I imagine stress is relative to risk. I’d hope someone who has retirement on lock wouldn’t risk that for a side project.

Though for me the risk of the shops failing and people being out of a job would still stress me heh

eastbound · 2025-08-03T05:08:11 1754197691

And let others govern the world?

sabedevops · 2025-08-03T05:35:40 1754199340

I met a traveller from an antique land, Who said: “Two vast and trunkless legs of stone Stand in the desert. Near them, on the sand, Half sunk, a shattered visage lies, whose frown, And wrinkled lip, and sneer of cold command, Tell that its sculptor well those passions read Which yet survive, stamped on these lifeless things, The hand that mocked them and the heart that fed; And on the pedestal these words appear: "My name is Ozymandias, king of kings: Look on my works, ye Mighty, and despair!" Nothing beside remains. Round the decay Of that colossal wreck, boundless and bare, The lone and level sands stretch far away.

- Percy Bysshe Shelley

aswanson · 2025-08-03T13:19:55 1754227195

I take that more as a rumination on the futility of vanity and self-aggrandizing rather than "ruling the world " which in the modern day comes down to politics. Yes, there is considerable overlap with ego, but there's more to that topic than pure self-worship.

aleph_minus_one · 2025-08-03T11:37:27 1754221047

> Also, do you have a better way to spend that money?

Yes, I do.

I am aware of some quite deep scientific results that would have a deep impact (and thus likely bring a lot of business value) if these were applied in practice.

croes · 2025-08-03T09:41:48 1754214108

There are some problems in the world that Meta could help fixing with that money.

ponector · 2025-08-03T22:53:03 1754261583

250M is not that much to solve a global problem, just take a look at bill gates fund.

Still huge amount of money which can improve life of millions.

aswanson · 2025-08-03T13:21:21 1754227281

Given history of the guy at the helm, most likely won't.

achenet · 2025-08-03T10:36:10 1754217370

downsize Facebook back to like a couple thousand people max, use the resulting savings to retire and start your own AI instead of doing the whole shadow artist "I'll hire John Carmack/top AI researcher to work for me because deep down I can't believe I'd ever be as good as them and my ego is too afraid to look foolish so I won't even try even if deep down that's what I want more than being a capricious billionaire"?

or am I just projecting my beliefs onto Mark Zuckerberg here?

hattmall · 2025-08-03T13:41:43 1754228503

Retire? Anyone with more than about 10-20 million that continues to work has some sort of pathology that leaves them unsatisfied. Normal people rarely even get to that level because they are too busy enjoying life. Anyone making billions has some serious issues that they are likely stuck with because hubris won't let them seek meaningful help.

aleph_minus_one · 2025-08-03T13:50:32 1754229032

> Normal people rarely even get to that level because they are too busy enjoying life.

And that's why these "normal people" don't become insanely rich.

(just to be clear: the reverse direction does not hold: just a tiny fraction of such workaholics will become insanely rich).

siscia · 2025-06-21T16:15:55 1750522555

In fairness, their design does not seem to be regional. With problems in one region bringing down another, apparently not unrelated, region.

With this kind of architecture, this sort of problems is just bound to happen.

During my time in AWS, region independence was a must. And some services were able to operate at least for a while without degrading also when some core dependencies were not available. Think like loosing S3.

And after that, the service would keep operating, but with a degraded experience.

I am stunned that this level of isolation is not common in GCP.

rybosome · 2025-06-21T16:28:29 1750523309

Global dependencies were disallowed back in 2018 with a tiny handful of exceptions that were difficult or impossible to make fully regional. Chemist, the service that went down, was one of those.

Generally GCP wants regionality, but because it offers so many higher-level inter-region features, some kind of a global layer is basically inevitable.

flaminHotSpeedo · 2025-06-21T18:37:07 1750531027

AWS regions are fundamentally different from GCP regions. GCP marketing tries really hard to make it seem otherwise, or that GCP has all the advantages of AWS regions plus the advantages of their approach, which means heavily on "effectively global" services. There are tradeoffs, for example multi region in GCP is often trivial and GCP can enforce fairness across regions, but that comes at the cost of availability. Which would be fine - GCP SLA's reflect the fact that they rarely consider regions to be a reliable fault containers, but GCP marketing, IMO, creates a dangerous situation by pretending to be something they aren't.

Even in the mini incident report they were going through extreme linguistic gymnastics trying to claim they are regional. Describing the service that caused the outage, which is responsible for global quota enforcement and is configured using a data store that replicates data globally in near real time, with apparently no option to delay replication, they said:

   Service Control is a regional service that has a regional datastore that it reads quota and policy information from. This datastore metadata gets replicated almost instantly globally to manage quota policies for Google Cloud and our customers.

Not only would AWS call this a global service, the whole concept of global quotas would not fly at AWS.

valenterry · 2025-06-21T16:26:01 1750523161

How does AWS do that though? Do the re-implement all the code in every region? Because even the slightest re-use of code could trigger a synchronous (possibly delayed) downtime of all regions.

crop_rotation · 2025-06-21T16:31:11 1750523471

Reusing code doesn't trigger region dependencies.

> Do the re-implement all the code in every region?

Everyone does.

The difference is AWS very strongly ensures that regions are independent failure domains. The GCP architecture is global with all the pros and cons that implies. e.g GCP has a truly global load balancer while AWS can not since everything is at core regional.

nijave · 2025-06-21T17:44:48 1750527888

They definitely roll out code (at least for some services) one region at a time. That doesn't prevent old bugs/issues from coming up but it definitely helps prevent new ones from becoming global outages.

valenterry · 2025-06-22T07:59:10 1750579150

Right, that makes sense. But if it's an evil bug that triggers e.g. over a year-change only, then that might not help.

So I suppose theoretically also AWS can go down all together, even if less likely.

cyberax · 2025-06-21T18:05:46 1750529146

Region (and even availability zones) in AWS are independent. The regions all have overlapping IPv4 addresses, so direct cross-region connectivity is impossible.

So it's actually really hard to accidentally make cross-region calls, if you're working inside the AWS infrastructure. The call has to happen over the public Internet, and you need a special approval for that.

Deployments also happen gradually, typically only a few regions at a time. There's an internal tool that allows things to be gradually rolled out and automatically rolled back if monitoring detects that something is off.

dangoodmanUT · 2025-06-21T16:41:14 1750524074

Does Route53 depend on services in us-east-1 though? Or maybe it's something else, but i recall us-east-1 downtime causing service downtime for global services

cyberax · 2025-06-21T18:09:47 1750529387

As far as I remember, Route53 is semi-regional. The master copy is kept in us-east-1, but individual regions have replicated data. So if us-east-1 goes down, the individual regions will keep working with the last known state.

Amazon calls this "static stability".

toast0 · 2025-06-21T18:45:53 1750531553

Static stability is a good start, but isn't enough.

In this outage, my service (on GCP) had static stability, which was great. However, some other similar services failed, and we got more load, but we couldn't start additional instances to handle the load because of the outage, and so we had overloaded servers and poor service quality.

Mayhaps we could have adjusted load across regions to manage instance load, but that's not something we normally do.

flaminHotSpeedo · 2025-06-22T19:30:11 1750620611

One of the core pieces of static stability (at least in one definition, it's an overloaded term) is being able to handle failure scenarios from a steady state.

The classic example is overprovisioning so that you can handle the extra zonal load in the event of a zonal outage without needing to scale up.