This is because you haven't given it a tool to verify the task is done. TDD work...

rohansood15 · 2025-08-10T06:34:45 1754807685

What do you think it is 'mocking'? It is exactly the behavior that would make the tests work. And unless I give it access to production, it has no way to verify tasks like how values (in this case secrets/envs) are being passed.

Plus, this is all besides the point. Simon argued that the model hallucinates less, not a specific product.