They are also all trained to do well on the same evals, right? So doesn't it just boil down to neural nets being universal function approximators?
They are also all trained to do well on the same evals, right? So doesn't it just boil down to neural nets being universal function approximators?