Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The problem is typically day 1000 problems: The database broke, nobody really understands all the stuff and dependencies by the kubernetes helm chart and still you have to fix it.

Downtime is now calculated in days and not hours.



Recover to a snapshot in one to two hours, then debug

Dump the snapshot into a managed DB short-term if you have to if the team can’t wrangle the controller




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: