Okay, sure. :-) Take the set of people [1] who: - Have had an email account for ...

ajuc · on June 4, 2020

That's cheating because improving AI is working on both sides - it's making better classifiers but also better spam generators :)

In the future with general AI classifying spam will be a hard problem even for people.

Imagine you get a message looking like it's from your facebook friend with his catchphrases telling you about his last trip and saying how great that travel agency was :) Spam or not?

dataflow · on June 4, 2020

Quite an interesting point I hadn't considered at all. On the one hand I'm wondering: what's your suggestion on how to address this with minimal changes to my criteria? On the other hand I'm wondering: well, if the analog of this is that AGI might get solved with more AGI, then that's only going to make me less likely to be worried in the first place!

ajuc · on June 4, 2020

To make it fair you can archive current spam data and see if future spam classifiers (not trained on that particular data) recognizes it correctly.

> if the analog of this is that AGI might get solved with more AGI, then that's only going to make me less likely to be worried in the first place!

that's going to end in arms race that leaves people unaware of what's even happening (aka singularity)

dataflow · on June 4, 2020

I had issues with just taking "current spam data, not trained on that particular data":

- It allows trainers to hard-code future rules based on their experience of what has passed through past filters, even if their model isn't technically trained on this dataset

- You might get similar emails sent to different mailboxes and the instances not included would still be allowed (and I don't really want to go down the rabbithole of defining a similarity metric between emails)

- I think I want to allow spammers to evolve their capabilities at least using current techniques, which we all presumably agree is "less than AGI". After all, intelligence implies adapting to a dynamic environment. It's not really going to feel like AGI (and certainly not going to make me worry) if it looks like AGI is trivial to outsmart by humans or less-than-AGI techniques.