HomeNewsOpenAI Launches Crypto Contract AI Security Benchmark, Claude Tops Test

OpenAI Launches Crypto Contract AI Security Benchmark, Claude Tops Test

-

OpenAI introduced a new benchmark to assess AI models in detecting and exploiting vulnerabilities in crypto smart contracts. Developed with Paradigm and OtterSec, EVMbench evaluates AI agents on 120 vulnerabilities. Anthropic‘s Claude Opus model performed best, with OpenAI and Google‘s models following. The benchmark aims to measure AI performance in economically significant environments as agents become more involved in securing and transacting digital assets.


OpenAI has launched a new benchmark evaluating AI models on detecting, patching, and exploiting vulnerabilities in crypto smart contracts. The project, detailed in a released paper called “EVMbench,” was developed in collaboration with crypto investment firm Paradigm and security firm OtterSec.

The benchmark analyzed 120 smart contract vulnerabilities sourced from audit competitions. OpenAI stated it is increasingly important to evaluate AI performance in “economically meaningful environments.” “Smart contracts secure billions of dollars in assets, and AI agents are likely to be transformative for both attackers and defenders.”

Anthropic‘s Claude Opus 4.6 model achieved the top average “detect award” of nearly $38,000. It was followed by OpenAI’s OC-GPT-5.2 and Google‘s Gemini 3 Pro, with awards of approximately $31,600 and $25,100 respectively.

The need for such testing is underscored by the $3.4 billion in crypto funds stolen by attackers in 2025. Industry executives like Circle CEO Jeremy Allaire have predicted AI agents will transact with stablecoins on a massive scale.

Dragonfly managing partner Haseeb Qureshi said crypto’s original promise for human use never fully materialized because the technology wasn’t designed for human intuition. He argued the future lies with AI-intermediated wallets that manage complex operations securely. “A technology often snaps into place once its complement finally arrives… For crypto, we might just have found it in AI agents.”

LATEST POSTS

Hong Kong Crypto Leaders: Bitcoin Needs Quantum Fix, US Clarity Urgently

At a recent blockchain conference in Hong Kong, industry leaders highlighted urgent technological and regulatory challenges. Executives debated the quantum computing risk to Bitcoin and...

SPX6900 Memecoin Surges 14.7% to $0.37 as Buy-Side Liquidity Recovers

SPX6900 (SPX) surged 14.7% to $0.37, marking a three-week high as buyer momentum returned. The memecoin's volume rose 62% to $19 million, with its price...

Largest New IBIT Holder Emerges After SEC Filing Gains Notice

Hong Kong-based Laurore Ltd. has emerged as the largest new holder of BlackRock's iShares Bitcoin Trust (IBIT) after a recent SEC filing gained attention. The...

Trump Team Moves $31.45M in TRUMP Tokens to BitGo Amid Price Rebound

The Official Trump (TRUMP) team transferred 9.089 million tokens, worth $31.45 million, to BitGo custody amid a 4.67% price rebound. Market analysts note the token...

Most Popular

spot_img