This is the primary reason I've not given agents more power than something extremely controlled (I.e. only a function to turn on/off the lights). As I was always concerned that these generational models might accidentally do something dumb or annoying, let alone something that might be illegal or harmful.
I don't really see how anyone would ask AI to complete something completely autonomously at this stage, without oversight.