Black boxness isn't a property of the thing, it's a property of our understanding of the thing. Just because we don't understand the how pattern recognition in humans works, doesn't mean the process has no internal structure, or that we couldn't potentially understand it in the future.
Right, and currently we understand neither human vision, nor CNN vision - they are both black boxes. I don't think anyone said they must remain black boxes forever.
Edit: I guess you were referring to this:
> Robust pattern recognition based on 100+'s of factors is just inherently black box.
I think he meant " ... given our current knowledge."