
Anthropic Opus 4.6's Limited Effectiveness in Detecting Vulnerabilities
CybersecurityVulnerabilitiesAISoftwareTesting
A benchmark test evaluated Anthropic Opus 4.6’s ability to detect simple vulnerabilities in C code, revealing it identified roughly 25% of flaws. The model exhibited a high false positive rate and inconsistent results across multiple runs. Techniques such as judge agents and requiring justifications for findings improved performance slightly, but overall effectiveness remained limited.