Really? That hasn't been my experience. Out of 20 python scripts, at least 50% have some error, some of which look right until you run them. A few involve hallucinations, like calling non-existent methods.
Actually yes that’s true, but within a few iterations it quickly repairs those flaws when given the errors and changes. I’m always running the draft script immediately and iterating on it. Sometimes in same context but often I’ll take it’s script, change it a bit, then start a new context with that version to keep adding features.