I think you misunderstood, I am referring to the fact that Marvin Minsky in 1966, asked Gerald Sussman to "spend the summer linking a camera to a computer and getting the computer to describe what it saw".
We certainly got nearly there, but it was nearly 50 years later, not 3 months.
Similarly, something that might look somewhat simple to us right now, might also be a lot more difficult.
And they're pretty much there. Have you seen some of the latest results in that field?