Difference between revisions of "Measuring Massive Multitask Language Understanding (MMLU)"

From wikieduonline

Jump to navigation Jump to search

Latest revision as of 09:58, 3 October 2024

hellaswag (10-shot)
winograde (5-shot)
arc challenge (25-shot)
TriviaQA (5-shot)
TruthfulQA
GSM8K
MATH
HumanEval

See also[edit]

MATH
LLM, MLLM, LoRA, LLaMA, LLaMA3, QLoRA, Falcon, PaLM 2, Gemini, Mixtral 8x7B, BitNet, Measuring Massive Multitask Language Understanding (MMLU), NVLM

Retrieved from "https://www.wikieduonline.com/index.php?title=Measuring_Massive_Multitask_Language_Understanding_(MMLU)&oldid=347300"

LLM

Advertising: