Your example is about saliency and perception. Modeling these to guide lossy compression is an important feature of high-end encoders, but that is largely independent of compression techniques used.
It's possible to do optimal-ish highly compressible dither (it's been done for LZW), but the results are still pretty disappointing compared to even old JPEG.
It's possible to do optimal-ish highly compressible dither (it's been done for LZW), but the results are still pretty disappointing compared to even old JPEG.