
AgentRE-Bench: Can LLM Agents Reverse Engineer Malware
CybersecurityMalwareReverse EngineeringAI Agents
A team has developed an agentic benchmark named AgentRE-Bench to evaluate the ability of LLM-based agents to perform reverse engineering of threats. The project aims to measure the performance of these agents in this field. The results obtained are presented as interesting. The authors are soliciting feedback from the community on this work.