You want stability in your code base, you can't generate piles of code every time a bug comes in or a new features is being requested.As you said, this problem has to do with the stakeholders, nothing to do with AI's ability.
No need to fix code. By then, it would just be cheaper and easier to ask AI to generate new code every time.
These two issues are easier to solve than you might think. I saw a demo that does exactly these — project management. This is not the ChatGPT that you would use via a browser, but a dedicated software interfaces with ChatGPT via API. It also uses up to a million tokens in an hour. The problem lies in translating those requirements into codes. I would actually go as far as saying that project management would get automated first before coding would, because code must follow a very strict formatting standard spanning across multiple files.
What if the newly generated code has new bugs or has business killing performance issues. How will you fix that, just reprompt and pray to rngjesus that everything will be hallucinated well this time. How will users of the end product react when the AI generates completely random user interface after each prompt and deployment. There's a reason why most (semi)professional software teams write almost equal amount of unit, integration, e2e, ui tests code compared to feature code just to make sure the new code doesn't break existing code.