I don't see why they don't throw away the model and train it again after making ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		JoeyBananas on May 5, 2022 \| parent \| context \| favorite \| on: 100 Pages of raw notes released with the language ... I don't see why they don't throw away the model and train it again after making changes. It only took them 1 month.

charcircuit on May 5, 2022 [–]

Because it's faster to start training from a relevant existing model than from random weights

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact