🔎 CRUXEval Sample Explorer 🔎
CRUXEval is a benchmark complementary to HumanEval and MBPP measuring code reasoning, understanding, and execution capabilities!
CRUXEval-I
CRUXEval-O
|
CRUXEval-I
|
CRUXEval-O
|