A test suite for test suites, is it?
How can I tell that a mutation is
- indeed of undesired behaviour,
- sufficiently distinct from the original that it deserves its own test case?
Especially with AI-generated code, I venture that mutation might actually have a chance of improving the code. Isn’t that what genetic algorithms were about?