geval

Star

Here are 3 public repositories matching this topic...

geval-labs / geval

Star

Eval-driven release gates for AI applications

open-source evaluation ai-agents llms evals llm-evaluation aievals geval

Updated Feb 11, 2026
TypeScript

Pavansomisetty21 / GEval-Metrics-Analyzing-the-Reliability-of-LLM-Responses

Sponsor

Star

In this we evaluate the LLM responses and find accuracy

llm-evaluation-metrics llm-evals geval

Updated Jul 8, 2025
Python

Wariowaqo / copilotstudio-testing-gevals

Star

Semantic testing for Microsoft Copilot Studio agents using Pytest and DeepEval G-Eval metrics (LLM-as-a-Judge). Generates interactive HTML reports for agent response quality.

pytest llm-evaluation copilot-studio geval

Updated Feb 17, 2026
Python

Improve this page

Add a description, image, and links to the geval topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the geval topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

geval

Here are 3 public repositories matching this topic...

geval-labs / geval

Pavansomisetty21 / GEval-Metrics-Analyzing-the-Reliability-of-LLM-Responses

Wariowaqo / copilotstudio-testing-gevals

Improve this page

Add this topic to your repo