mmlutqasafimhellaswaggsm8knqagi_englishCRUXEval-outputDS1000CRUXEval-inputarc_challengembpplcb_codegenmbpp+piqasiqahumanevalhumaneval+0510152025
model_familydeepseek-coderopencodeinterpreter-dsQwen1.5llamadeepseek-llmllama2Mixtral-8Meta-Llama-3deepseek-basedeepseek-instructcodellamawizardcoderLLama3DSCodermeta-llama-Llama-3Qwen-Qwen1.5deepseek-ai-deepseek-codercodegenbenchmark_idsignal to noise