LLM-as-a-Judge evaluation platform for ecommerce search. Scores relevance, computes IR metrics, and flags quality issues across multiple retail verticals
python search autocomplete information-retrieval ecommerce autosuggest ndcg ecommerce-search relevance-evaluation llm-eval llm-evaluation llm-as-a-judge search-eval search-quality
-
Updated
Mar 15, 2026 - Python