hey,
I've been recently going down a rabbit hole in terms of 4chan history. I am wondering if there are any archives of 4chan, regardless of image availability, out there.
"This is a text-only compilation of 4chan threads, primarily from the years 2006-2008. The total unique threads in this collection is roughly 10 million."
It seems like it wouldn't be hard to train a model on this era of 4chan and create a simulacrum. Posts would happen at a certain frequency with all the users being simulated. I wonder if you could even allow user posts and have the model reply in a realistic way (bullying and all)?
The remarkable thing is how effective bullying is at moderating idiots. Astroturfing state actors aside, the site maintains a high level of on topic civility.
There's something inherently wrong with ordering posts by vote count, both because a confident/deceptive idiot can spread his lies like a virus and because it's too easy to game.
I suppose you need to be deep in the trenches to find anyone still willing to have discourse there. But as another commenter said, some of the niche boards, like subreddits, maintain some pretty high quality conversation.
Slashdot solved this ages ago with its moderation system, vote moderation has been flawed since inception, because people downvote when they disagree not based on the quality of the post.
Since governments started to get more involved the more and more data lost forever. It’s good to see that at least projects like archive keeps it. But I’m afraid that will be lost too.
"This is a text-only compilation of 4chan threads, primarily from the years 2006-2008. The total unique threads in this collection is roughly 10 million."