Popular repositories Loading
-
-
RPBench-Auto
RPBench-Auto PublicAn automated pipeline for evaluating LLMs for role-playing.
-
EmergentTTS-Eval-public
EmergentTTS-Eval-public Public[NeurIPS' 25] Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.
-
-
-
Repositories
Showing 9 of 9 repositories
- tau2-bench Public Forked from sierra-research/tau2-bench
τ-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains
boson-ai/tau2-bench’s past year of commit activity - hackathon-m3-api-card-public Public
boson-ai/hackathon-m3-api-card-public’s past year of commit activity - EmergentTTS-Eval-public Public
[NeurIPS' 25] Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.
boson-ai/EmergentTTS-Eval-public’s past year of commit activity - hackathon-msac-public Public
boson-ai/hackathon-msac-public’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…