Because most of these things are not multi-trillion-dollar ideas. "We found a way to make illustrators, copyeditors, and paralegals, and several dozen other professions, somewhat obsolete" in no way justifies the valuations of OpenAI or Nvidia.
>Because most of these things are not multi-trillion-dollar ideas.
That's right, but there's more. When you think about the cost of compute and power for these LLM companies, they have no choice. It MUST be a multi-trillion-dollar idea or it's completely uninvestable. That's the only way they can sucker more and more money into this scheme.
We paid for newspapers and they ran ads. We paid for cable TV and it had ads. We went to the cinema and watched ads.
Ad-free paid services were a brief aberration, essentially a bait-and-switch: "see how much nicer we are from the old-school competitors". Now that the competitors are gone, Netflix is doing ads, Amazon is doing ads... why wouldn't Spotify?
I hate it, but the reality is that we groan on online forums but don't actually leave.
It's very much easier to ignore ads in a newspaper. It's not so easy when you are forced to listen to them before the thing you paid for. It's not the same.
"But you don't leave" is an excuse the human-hating producers tell themselves. But it's normal for consumers to be protected from the petty misery of natural market forces by legislation. We have enacted many such controls.
maybe you did, but I did not. my attention is valuable to me and I do not pay people to waste it. i had tapes, CDs, mp3s, and paid spotify. this is a well-proven market.
So you're advocating for stronger and more invasive controls?...
I think this is a sensible compromise. It gives parents more control than before without relying on shady third-party software or without turning every platform into a cop. Yeah, it also aligns with Meta's interests, but so what?
The age attestation solutions pursued by the EU are far more invasive in this respect, even though they notionally protect identity. They mean that the "default" internet experience is going to be nerfed until you can present a cryptographic proof that you're worthy.
> I think this is a sensible compromise. It gives parents more control than before without relying on shady third-party software or without turning every platform into a cop.
It doesn't give parents any control whatsoever. It just forces the OS to tell every website your child goes to how old they are. It doesn't require those websites to hide certain content for certain age groups. It doesn't define what types of content are appropriate for which age groups, it just makes sure that every advertiser bidding on your child's eyes knows what age range they fall into to.
If anything this takes control away from parents because even the cases where a website does their best to restrict content based on which age the OS tells them your kid is, it's the website setting the rules and not the parents. You might think that your 16 year old can read an article about STDs, but if the website your kid visits doesn't think so you as the parent don't get any choice.
With 3rd party software parents are controlling what software is used, they have the ability to decide which kinds of content are appropriate for their children and can be allowed and which types of content should be blocked. They can black/whitelist as they see fit. All of the power is in the parent's hands. This law gives parents one choice only: "Do I honestly tell my OS how old my child is". That's the end of the parent's involvement and the end of their power.
I think it shows the limits of hand curation. It's a tiny, human-reviewed slice of the "small web", only allowing a subset of blogs... but if you select the "programming" category and click around for a short while, you get a fair amount of obvious AI slop.
I don't think it's Kagi's fault, but I guess it's depressing in a way. A lot of "small web" bloggers dream of being a part of the "big web", and when they get a cheat button, they have no second thoughts about mashing it.
This is not how corporate fraud usually happens. You don't tamper with the quarterly report, especially since it gets audited. You tamper with the input data close to the source. For example, you record revenue that hasn't happened yet or you delay the recording of losses.
It's easy to hand-curate a list of 5,000 "small web" URLs. The problem is scaling. For example, Kagi has a hand-curated "small web" filter, but I never use it because far more interesting and relevant "small web" websites are outside the filter than in it. The same is true for most other lists curated by individual folks. They're neat, but also sort of useless because they are too small: 95% of the things you're looking for are not there.
The question is how do you take it to a million? There probably are at least that many good personal and non-commercial websites out there, but if you open it up, you invite spam & slop.
I mainly use Kagi Small Web as a starting point of my day, with my morning coffee. Especially now when categories are added, always find something worth reading. The size here does not present a problem as I would usually browse 20-30 sites this way.
Right, but that basically works as a retro alternative to scrolling through social media. If you're looking for something specific, it's simultaneously true that there's a small web page that answers your question and that it's not on any "small web" list because the owner of the webpage never submitted it there, or didn't meet the criteria for inclusion.
For example, I have several non-commercial, personal websites that I think anyone would agree are "small web", but each of them fails the Kagi inclusion criteria for a different reason. One is not a blog, another is a blog but with the wrong cadence of posts, etc.
The (artificial) barrier to entry is there for a reason - one person maintains the entire project and because it is fairly technical to submit the acceptance rate has been close to 99%.
Looking at the criteria again, I can think of at least three things that arbitrarily exclude large swathes of the small web:
1) The requirement that it needs to be a blog. There's plenty of small-web sites of people who obsess over really wonderful and wacky stuff (e.g., https://www.fleacircus.co.uk/History.htm) but don't qualify here.
2) The requirement that it needs to be updated regularly. Same as above - I get that infrequently updated websites don't generate a "daily morning" feed, but admitting them wouldn't harm in any way.
3) Blanket ban on Substack-like platforms while allowing Blogspot, Wordpress.com, YouTube, etc. Bloggers follow trends, so you're effectively excluding a significant proportion of personal blogs created in the last six years, including the stuff that isn't monetized or behind interstitials. The outcomes are pretty weird: for example, noahpinionblog.blogspot.com is on your list, but noahpinion.blog is apparently no longer small web.
> Why not have 2000 hand curated directories instead?
It depends on what you're trying to achieve. If you want to have a personal feed of stories from interesting people, 50 is probably enough to give you some interesting daily reading. But if you want to build a "small web" search lens, you absolutely need to cover. For example, Kagi is billing a "small web" search filter, but it excludes a lot of the small web because they only allow actively-maintained blogs and only a subset of them.
My approach operates under the assumption that good, non-commercial webpages will be similar to other good webpages. Slop, SEO spam, and affiliate content will resemble other such content.
So a similarity-based graph/network of webpages should cluster good with good, bad with bad. That is what I've seen so far, anyway.
With that, you just need to enter the graph in the right place, something that is fairly trivial.
> Google that now optimizes their algorithm for monetization and not usefulness.
I don't think they do that. Instead, "usefulness" is mostly synonymous with commercial intent: searching for <x> often means "I want to buy <x>".
Even for non-commercial queries, I think the sad reality is that most people subconsciously prefer LLM-generated or content-farmed stuff too. It looks more professional, has nice images (never mind that they're stock photos or AI-generated), etc. Your average student looking for an explanation of why the sky is blue is more interested in a TikTok-style short than some white-on-black or black-on-gray webpage that gives them 1990s vibes.
TL;DR: I think that Google gives the average person exactly the results they want. It might be not what a small minority on HN wants.
Google and most search engines optimize for what is most likely to be clicked on. This works poorly and creates a huge popularity bias at scale because it starts feeding on its own tail: What major search engines show you is after all a large contributor to what's most likely to be clicked on.
The reason Marginalia (for some queries) feels like it shows such refreshing results is that it simply does not take popularity into account.
> I think that Google gives the average person exactly the results they want.
There is some truth in this, but to me it's similar to saying that a drug dealer gives their customers exactly what they want. People "want" those things because Google and its ilk have conditioned them to want those things.
Ah, the classic, forever-untestable "it's just around the corner" hypothesis.
I've lived through multiple "it's gonna be over in 12-18 months" arguments since November 2022. It's a truism for any technology to say that it's going to get better over time. But if you're convinced that "AI has already won", why not make a specific prediction? What jobs are going to be obsolete by when?
> Cities have a lower carbon footprint per capita, lower land use per capita, people own fewer cars, use public transportation more often. If everyone lived in a city, nature would be better off.
I think that's apples to oranges. If we didn't have cities, we also wouldn't have eight billion people in the world.
A better question for the parent is how do you enforce that vision of everyone living on their 20 acres in harmony with nature? This is not something that capitalism or some other -ism does to us. Your neighbor will have children, these children will have children, and before long, you have a settlement of 50 people on these 20 acres, most certainly no longer living in harmony with nature. At that point, they must build infrastructure. That infrastructure may be feasible to build if they pool their resources with the neighbors. Boom, you have a village, then a town, then a city.
So what's the solution here? Do we forcibly sterilize people? Lock them up if they have children? What's the anti-growth strategy we're actually advocating for?
Because the habitable surface of the planet is less than 100 million square kilometers and only a fraction of that is suitable for subsistence farming. The only reason we can accommodate 8 billion is that the majority of them live in high-density settlements and that food is grown on an industrial scale elsewhere.
This is obviously not a reversible trend. People having close proximity to one another, creating economies of scale where everyone does what they are best at instead of everyone doing everything for themselves is what allows big cities to be possible.
I'm sure all of this was inevitable as there likely hasn't ever been a time where humans were not getting together to form communities when it was beneficial to do so.
So what's the solution here? Do we forcibly sterilize people? Lock them up if they have children? What's the anti-growth strategy we're actually advocating for?
You don't need to. Fertility rate per woman in wealthy countries decreases every year.
> Do we forcibly sterilize people? Lock them up if they have children?
Oh give me a break. Developed countries have had below-replacement fertility rates for decades.
If your goal is to reduce birth rates, we do the things that we already know do that naturally: comprehensive, fact-based sex ed; cheap and easy access to contraceptives; social safety nets and support; etc etc etc.
It's not a mystery and it's not even difficult. You don't need to jump to straight to abhorrent crimes against humanity.
Yeah, most charitably, it seems to be some sort of an LLM art project. And it's another day when we engage with slop because it happens to say something we like.
reply