More

Sajarin · 2026-02-23T19:10:02 1771873802

Shameless plug but made a similar tree here: https://sajarin.com/blog/modeltree/

l-p · 2026-02-23T22:18:12 1771885092

Thanks, that's way more useful to me.

Allow me to contribute:

> Magistral: Magist(rate) + stral? Mag(nificent) + stral? Nobody knows.

That's just French for "masterful" or a way to describe lectures. There's a sense of greatness in that word that contrasts with the Mini in Ministral which is in turn might be a pun on "ménestrel" (minstrel), "ministre" (minister), or made to sound like Minitel (or all of the above).

j_bum · 2026-02-24T16:14:40 1771949680

This is great, I found it much more interesting to view this tree vs. the timeline alone.

Sajarin · 2026-02-19T06:24:54 1771482294

psychosis.hn is a daily game. Every day we fetch three stories from a previous front page of HN, each with 5-7 AI comments threaded into the discussion. They have personas, reply to real people, and sometimes have real comments reparented underneath them.

EdwardDiego · 2026-02-19T07:03:15 1771484595

I've got to say, that's pretty damn good, and pretty damn scary.

ben_w · 2026-02-19T07:39:51 1771486791

Seconded. I scored 0, missing all the bots and falsely marking some real human comments.

Sajarin · 2026-02-19T07:57:18 1771487838

Wish this post got more of a response, so thanks so much for giving it a try! Hope it was at least fun (and maybe a bit horrifying) :)

ben_w · 2026-02-19T08:20:00 1771489200

Responses (well, all engagement) are a lottery*, so don't let it get you down. :)

* See e.g. mine: https://news.ycombinator.com/submitted?id=ben_w

Sajarin · 2026-02-17T18:57:13 1771354633

Sonnet numbering has been weirder in the past.

Opus 3.5 was scrapped even though Sonnet 3.5 and Haiku 3.5 were released.

Not to mention Sonnet 3.7 (while Opus was still on version 3)

Shameless source: https://sajarin.com/blog/modeltree/

cobolexpert · 2026-02-18T00:35:44 1771374944

I like this tree visualization! The background with little squares is making the text difficult to read, though.

Sajarin · 2026-02-18T20:32:54 1771446774

Thanks for the feedback friend, updated to make it (hopefully) a little easier to read!

Sajarin · 2026-01-25T20:02:09 1769371329

Thanks, that means a lot! Let me know if you have any feedback or suggestions, I would love to work on any improvements :)

Sajarin · 2026-01-17T10:55:36 1768647336

Those smooth chunks are all (mostly) public park land. Known as Presidio and part of the Golden Gate National Recreation Area.

originalankur · 2026-01-17T10:57:33 1768647453

You know your city.

Sajarin · 2025-12-31T17:51:13 1767203473

I think this comes off a bit too strong (as well as the replies to this to be fair)

The example isn't quite accurate. If a friend bought you lunch, the social norm of reciprocity would incline you towards buying them lunch in the future (i.e part of your paycheck)

Free open source software is a public good. While there is no obligation to give back, giving back helps that public good become more useful to other people (including your future self). I'm against making contribution an obligation, but I'm not against light social pressure upon philanthropists who have the means (which is what the parent comment was doing).

sneak · 2025-12-31T19:10:32 1767208232

In the lunch example, reciprocation would be releasing additional software under free software licenses, not payments.

There should be zero social pressure, as gifts do not convey obligation. It was the software author’s explicit choice when licensing and publishing the software to make clear that payment is not expected.

lovich · 2025-12-31T20:18:47 1767212327

Do you routinely struggle in social situations? Do you frequently have people tell you that you misinterpreted social cues?

You are correct that no legal obligation was passed, but generally people feel that if you got something from a community that helped you succeed greatly you do have an obligation to throw something back to the organization to help it help others.

If you don't, that'ss generally classified by people as being a jackass

tormeh · 2025-12-31T19:41:28 1767210088

Gifts do confer obligations. This is widely agreed upon in human society. If you ignore this there will be consequences, just no legal ones.

Sajarin · 2025-08-07T18:05:38 1754589938

What did Ilya see? (or rather what could he no longer bear to see?)

> Academics distorting graphs to make their benchmarks appear more impressive

> lavish 1.5 million dollar bonuses for everyone at the company

> Releasing an open source model that doesn't even use latent multi head attention in a open source AI world led by Chinese labs

> Constantly overhyping models as scary and dangerous to buy time to lobby against competitors and delay product launches

> Failing to match that hype as AGI is not yet here

Sajarin · 2025-05-19T18:46:18 1747680378

I wonder if anyone has done an analysis on the HN user sentiment on the varying AI models over time. I'd be curious to see what that looks like. Increasingly, I'm seeing more and more people talk positively about Gemini and Google (and having used Gemini recently, I align with that sentiment)

I think Bard (lol) and Gemini got a late start and so lots of folks dismissed it but I feel like they've fully caught up. Definitely excited to see what Gemini 3 vs GPT-5 vs Claude 4 looks like!

fallinditch · 2025-05-19T19:34:01 1747683241

I'm using Windsurf IDE so have all the main models available. Mainly doing Python, JS, HTML, CSS, some Go. I have found Claude 3.7 outperforms Gemini 2.5 and ChatGPT 4.1, 4o, Deepseek, etc, for my work in most cases.

I suspect that I experience some performance throttling with Gemini 2.5 in my Windsurf setup because it's just not as good as anecdotal reports by others, and benchmarks.

I also seem to run up against a kind of LLM laziness sometimes when they seemingly can't be bothered to answer a challenging prompt ... a consequence of load balancing in action perhaps.

lcfcjs6 · 2025-05-19T19:49:25 1747684165

Windsurf is about to lose its ability to use other models since it got bought by OpenAI. Still very cool tool though!

mbesto · 2025-05-19T19:49:29 1747684169

Who cares about sentiment when you can just look at a proxy for usage: https://openrouter.ai/rankings

EDIT: Specifically: https://openrouter.ai/rankings/programming?view=week

Karrot_Kream · 2025-05-19T19:05:54 1747681554

Gemini hit the top of a bunch of leaderboards recently so it probably prompted folks to try Gemini out and they found it useful.

Sajarin · on Dec 15, 2024

Blaming the “system” is easy but is it the whole picture?

How much of it is due to culture? Teachers in western countries are not as respected as teachers in other parts of the world. A few teachers abuse their authority and that results in outrage and lawsuits from parents, rightfully so.

I can imagine in many schools in the US, if a cellphone ban were to be implemented, there would be a large outcry from parents on how restrictive or overreaching that policy would be. Even if the net positives (as shown in the article) are proven to outweigh the pragmatic concerns (i.e I might need to be in communication with my child) why take the risk?

Not to be supporter of “the man” but it seems unfair to point the finger at a system that takes steps to preserve itself without also acknowledging the hostile environment in which it operates.

Parents have greater zeal in suing the school than they have in attending open board meetings.

eimrine · on Dec 15, 2024

> Teachers in western countries are not as respected as teachers in other parts of the world.

It can not be true for most of Asian countries with a really rich history of beating bad students.

threeseed · on Dec 15, 2024

I wouldn't group all Western countries together.

The US has always been unique in having a very libertarian, freedom at all costs culture.

For example in Australia we have recently banned children from using social networks and this was supported by about 80% of the population.

graemep · on Dec 15, 2024

Is there no resistance to things like having to adult having to prove their age to social networks? How is that going to be done, BTW?

threeseed · on Dec 15, 2024

The same way it has been done for years when you sign up for a mobile plan etc.

You verify your age using either passport, driver's license, digital ID etc.

There are plenty of services that provide this.

graemep · on Dec 16, 2024

So you have to show your ID to social networks. Very intrusive.

Sajarin · on Nov 20, 2024

As a previously precocious young teen, I would have agreed with your point but as someone who is now a boring adult I disagree.

Projects are often and will continue to be judged by their marketing. There are many such cases of "I'm X years old and I made Y" posts on Hacker News reaching the front page. As a founder, you should use whatever you can to get eyeballs on your product. As a hacker, you should try to make something you think is cool.

While it is obviously cool to have something novel or technologically interesting to showcase, the value is often less in the actual product and more in the nostalgia and reminder that we too, as boring adults, were once younger hackers.

Let's not be so hard on each other. I think it's a pretty well designed landing page (although the mobile website needs some work in terms of responsiveness.) I don't think this is similar to Anki because there's no spaced repetition or flashcard retrieval involved (from what I could tell).

It does seem like a tool for cheating which is somewhat questionable. I do like the idea of a young hacker today figuring out how to automate their homework, but I think the tool can be a bit more tailored and more ethical if it focused on a specific use case students would equally pay for (ex: AP test prep)

Imustaskforhelp · on Nov 21, 2024

> As a founder, you should use whatever you can to get eyeballs on your product.

I am not a founder so maybe my side of reasoning is flawed as a customer but I come in the belief that there is good advertising and bad advertising.

Some people consider both advertising to be good but I don't think so.

For example , the discussion we are having right now could be considered as an example of bad advertising I mean , think about it , why are we discussing about the age of the product's creator in the first place aside from the fact that he tried to catch our precious attention by such advertising.

I also don't think that the current apple intelligence ads are good advertising. They are in the news / It was my first time watching an apple ads intentionally (I have ad blocker) , and I cringed half way through. I felt even righter that as an android user , I am right (maybe it was a self serving bias that because I am an android user , I watched iphone ad to improve my ego)

Maybe its my open source mentality but I am way way more impressed not by marketing fuzzbuzz but rather the merit of the tool , I don't care if its a zero star repo on github , (eg: https://github.com/heroslender/lg-remote) I am his only star on his repo and I love his work that he has done on this project

My line of thinking is simple if the tool has merit (for ideological reasons , I prefer open source) , I am going to use it. But if you think that you can use catchy terms to catch my attention , well sure you got my attention , but in the long term I am going to remember how you got my attention in first place (whether on the basis of merit or marketing fizzbuzz) I really hate the latter