Popular repositories Loading
-
alpaca_eval
alpaca_eval PublicForked from tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Jupyter Notebook
-
FastChat
FastChat PublicForked from lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Python
-
lm-evaluation-harness
lm-evaluation-harness PublicForked from EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Python
-
portuguese-llm-bench
portuguese-llm-bench PublicUnified evaluation suite for Portuguese LLMs — covering language understanding, reasoning, safety, and toxicity benchmarks.
Python
If the problem persists, check the GitHub status page or contact support.


