Plus, they're getting real world training data from everyone who either hasn't or doesn't have the ability to opt out of their stuff being used.
For my personal stuff, I don't opt out of training for this very reason. What's more, I resent Stack Overflow and Reddit etc. trying to gate-keep the content that I wanted to give to the community and charge rent for it.
I used to intentionally post question-answer style posts where I would both ask the question,wait for a while, then answer the question on both Reddit and Stack Overflow. I don't do that anymore because I'm not giving them free money if they're not passing some of the benefit on to the community
"... resent SO and Reddit trying to gatekeep": Am curious why you felt they were gatekeeping your content. They are free websites, and anybody who wants/needs to read your content, can.
"... not giving them free money if they're notnpassing some of the benefits ..." - Could you expand on the specific benefits you wanted them to pass on to the community? As a user, being able to find other people's content that is relevant to my current need is already a pretty solid benefit.
> For my personal stuff, I don't opt out of training for this very reason. What's more, I resent Stack Overflow and Reddit etc. trying to gate-keep the content that I wanted to give to the community and charge rent for it.
And AI companies don't charge for their stuff and charge rent?
Too bad library documentation has taken a nosedive for a lot of modern tools. I blame all those automagic documentation/packaging tools that take some 2 line string and your args for each function and turn it into a boilerplate documentation page. You truly learn nothing you wouldn't by just looking at the source code of the function for pages generated like this. The author spends no additional time adding potentially useful information. A far cry from what you get when you man bash for example, where they even teach you how to file a bug report.