Available image generators are already capable of generating those images and they weren’t even trained on it. Once a neural network can detect/generate two separate concepts, it can detect/generate the overlap. It won’t be as fine-tuned obviously, but can still turn out scarily accurate.
I can’t imagine having someone watch 3 months x 13 hours of real-time security footage is worth the 10k, unless the insurance would pay his salary.
But now I know why stores sometimes have their most expensive stuff just sitting there in full view. It’s not just for the customers’ viewing.