s2n-bignum-bench Leaderboard
A Practical Benchmark for Evaluating Low-Level Code Reasoning of LLMs
GitHub
Paper
Submit
Metric:
Pass@1
Category:
All
ARM FC
x86 FC
Prog State
Bit Vector
Generic