I see a possible solution that may work in a short term: Instead of displaying a...

adrianN · on July 29, 2012

Do OCR on every frame, perform majority voting on the result. You just made the spammer's task easier.

dchichkov · on July 29, 2012

Here is an example of a problem that would be hard for computer to solve:

http://www.youtube.com/watch?v=4G4y79ZbaBs

muyuu · on July 29, 2012

How so? the fixed part can be easily extracted. If it also moved (while morphing) then I guess it would be hard, but fixed dots in a moving background would take just a few frames for a computer to solve.

Raticide · on July 29, 2012

You just diff each frame and keep the parts that don't change much. Very simple to solve.

dchichkov · on July 29, 2012

Just move letters slightly. And make them morph a bit. Would still be obvious to a human, but a computer trying to average anything would fail miserably.

This is an unsolved computer vision problem.

odin1415 · on July 29, 2012

which pixels stay black in all frames?

mgurlitz · on July 29, 2012

Captchas are hard because there's only so much an algorithm can extract from spatial information. Computers are excellent with temporal data, given essentially unlimited memory for past video frames. Computers benefit more from increased information than humans do.

dchichkov · on July 29, 2012

This is computer vision, an I'm somewhat an expert in the area. I can tell that video sequence recognition is _much_ harder problem than image recognition.

For example, if you would show an letter made out of random noise moving through random noise, current computer vision algorithms would not be able to recognize anything. And you would pick out that letter immediately. Human visual subsystem is really amazing in that sense.

donatzsky · on July 29, 2012

It should be possible to do this with an animated GIF. Do you have any references/examples I could use as a starting point?

dchichkov · on July 29, 2012

Oh. I remember reading some vision paper and in the supplement materials there've been a couple of videos with letters moving. Doubt, I'll be able to find it that easily.

Should be relatively easy to code with any library that can draw a text on a bitmap. Like PIL, matplotlib, etc. Use ffmpeg to make a video out of frames.

1. draw letters (just black/white) masks; 2. fill letters with noise; 4. fill background with noise; 5. copy letters using a mask onto background, using X,Y as loc; 6. add a little bit of new noise to letters; 8. modify X,Y coordinates (move letters SLIGHTLY); 9. go to step 4.

donatzsky · on July 31, 2012

I made a proof-of-concept implementation and, well, it works, but I suspect they're too difficult for many people. They can be made easier by emphasizing one of the colors in the letters, but that opens them up to more traditional attacks.

http://www.anaerob.dk/animcap/k,w.gif

http://www.anaerob.dk/animcap/r,g,b,c,m,w.gif

http://www.anaerob.dk/animcap/r,g,b,c,m,y,k,w.gif

http://www.anaerob.dk/animcap/r,g,b,c,m,y,w.gif

http://www.anaerob.dk/animcap/r,g,b,c,m,y.gif

http://www.anaerob.dk/animcap/r,g,b,c,m.gif

http://www.anaerob.dk/animcap/r,g,b,m,y.gif

http://www.anaerob.dk/animcap/r,g,b,m.gif

http://www.anaerob.dk/animcap/r,g,b,y,k,w.gif

http://www.anaerob.dk/animcap/r,g,b,y,w.gif

http://www.anaerob.dk/animcap/r,g,b,y.gif

http://www.anaerob.dk/animcap/r,g,b.gif

realo · on July 29, 2012

This is simple, brillant. The best solution I have seen ever.

Do you have a patent already? :)