A Streamlit tool to evaluate and compare LLM responses across 5 RLHF-inspired quality dimensions — Instruction Following, Truthfulness, Prompt Correctness, Writing Quality, and Verbosity.
-
Updated
Mar 21, 2026 - Python
A Streamlit tool to evaluate and compare LLM responses across 5 RLHF-inspired quality dimensions — Instruction Following, Truthfulness, Prompt Correctness, Writing Quality, and Verbosity.
Predicting student placement based on CGPA and IQ using SVM (Support Vector Machine kernel="rbf").
Add a description, image, and links to the model-eval topic page so that developers can more easily learn about it.
To associate your repository with the model-eval topic, visit your repo's landing page and select "manage topics."