I agree with your point in general - there's no use in being skeptical if you call stuff fake at the first sign. Innocent until proven guilty or whatever.
You're right that the image could be clipart, and a human could have made those typos, but there are some non-human tells in here other than that.
The P in protection is a lowercase capital, and one of the r's in irŕritation is accented. The latter could've been mistyped on a phone keyboard, but in that case it would've been autocorrected, and that still doesn't explain the P.
There's also some weird GPT-like descriptions that are thrown in for no reason. Why does the bandana being "fashionable" matter? How are arm warmers "subtle", exactly?
As another commentor pointed out, a T-shirt with a graphic is more likely to make you recognisable, so that part's just wrong. I also don't get why "tactical/military bags" should be "avoided"? Seems like the kinda misinformation AI would spout.
What I really don't get here is why someone would use AI to generate the entire thing, rather than only using it for the graphic (still morally questionable but excusable) and adding the text in manually (something you can do in MS paint).
Person who took this video is an actual hero. I don't want to imagine what could have happened to that woman without them.
Also: don't make the mistake of looking at the replies to this on Twitter. I forgot just how bad it was, but nearly every top reply is by verified accounts trying to justify this somehow. It's actually insane to me the complete lack of empathy these people show - past party lines and political differences, isn't this obviously a bad thing that shouldn't have happened?