YatCC-Hard Leaderboard

Real compiler implementation — no fill-in-the-blank scaffolding. Agents must build the entire compiler from scratch in isolated container environments.

YatCC YatCC-Hard
#Model T0T1T2T3T4T5 Mean RewardPipeline🔄

🔬 About YatCC-Hard

YatCC-Hard removes all code logic from the original YatCC tasks, keeping only file dependency relationships. Agents receive no compiler construction knowledge or implementation framework — they must implement the full compiler from scratch.

Each evaluation runs in an isolated container environment that is destroyed after completion, ensuring zero cross-run contamination.

Powered by EvoBench