Both. Companies are certainly building bigger and bigger clusters for training. ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

ummonk on Dec 16, 2019 | parent | context | favorite | on: The AI Index 2019 Report

Both. Companies are certainly building bigger and bigger clusters for training.

At the same time though, consumer GPUs have gotten significantly faster (compare e.g. an Nvidia 2080TI to a 980TI), and learning algorithms keep improving / better learning algorithms become more widely used (e.g. Adam instead of stochastic gradient descent).

antpls on Dec 16, 2019 [–]

And also, architectural search allowed for neural networks to use more efficient builtin blocks, using many less parameters, and achieving the same accuracy with smaller models (and lowering training cost)

Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact