⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".
-
Updated
Feb 23, 2026 - Jupyter Notebook
⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".
[KDD 2026] An automatic, extensible Framework to Evaluate User-Proxy Agents for Human-Likeness. 🌟 Star if you like it!
📊 Daily auto-updated snapshots of all Arena AI (LMSYS Chatbot Arena) leaderboards — LLM, Vision, Code, Video, Image & more. Structured JSON with historical tracking.
A simple script to export LMArena.ai conversations to Markdown when the session hits the token limit.
A technical guide and live-tracking repository for the world's top AI models, specialized by coding, reasoning, and multimodal performance.
A simple JavaScript utility to force-unlock disabled textareas and buttons on LMArena.ai when the UI freezes or the AI gets stuck in an infinite loop.
🔍 Detect anonymous LLM models with this Python tool, designed to automate the identification of the "riftrunner" model on lmarena.ai.
Export Chatbot Arena conversations to Markdown files to bypass session limits and preserve full chat context including hidden thinking blocks.
30 conversational LLM datasets (~7.7M rows) normalized to one unified schema and published as a single HuggingFace dataset with per-source configs.
ML ensemble for automated AI prompt scoring — 66.7% accuracy across 17,393 prompts, validated via GPT-4 at 77.3% accuracy and 83.0% F1
A 2026 historical tracker and data analysis tool for the LMSYS Chatbot Arena ELO ratings, including coding and reasoning sub-categories.
Fine-tuning and comparing DeBERTa-v3, RoBERTa, and ELECTRA on the Chatbot Arena human-preference task with a shared Siamese architecture.
Rank LLM APIs by cost-effectiveness against Arena ELO scores · 按 Arena ELO 性价比对 LLM API 实时定价排名
LLM evaluation arena with blind side-by-side comparison, ELO ratings, and category leaderboards. Like LMSYS Chatbot Arena, but self-hosted.
Add a description, image, and links to the chatbot-arena topic page so that developers can more easily learn about it.
To associate your repository with the chatbot-arena topic, visit your repo's landing page and select "manage topics."