Skip to content

Phase 2: Add Code Interpreter for Data Analysis#5

Merged
san360 merged 1 commit into
mainfrom
feature/phase2-code-interpreter
May 18, 2026
Merged

Phase 2: Add Code Interpreter for Data Analysis#5
san360 merged 1 commit into
mainfrom
feature/phase2-code-interpreter

Conversation

@san360
Copy link
Copy Markdown
Owner

@san360 san360 commented May 18, 2026

Summary

  • Adds code_interpreter tool alongside existing web_search
  • Extends system prompt with ## Data Analysis section
  • Evaluation now runs all 8 queries (Phase 1 + Phase 2)

Changes

File Change
agents/tech-trends-agent.json Added code_interpreter alongside web_search, phase → "2"
prompts/tech-trends-agent.md Added ## Data Analysis (Phase 2) section
evals/eval-config.json phase_filternull (run all cases)

What to check

  • Phase 1 queries still score at or above threshold (no regression)
  • Phase 2 data analysis queries score acceptably
  • Agent correctly uses code interpreter for calculation queries
  • After merge, deploy-prod.yml commits updated artifact

Phase

Phase 2 of 3 — web search + code interpreter. Phase 3 is model upgrade.

@github-actions
Copy link
Copy Markdown
Contributor

✅ Agent Deployment & Evaluation Report

🤖 Agent Details

Property Value
Agent tech-trends-agent
Version 22
Semver 0.0.0-pr.5
Phase 2
Model gpt-4.1
Commit ecf6352
Timestamp 2026-05-18 14:36:55 UTC

📊 Pipeline Results

Step Status Details
Deploy to TEST PASSED Agent version 22 deployed
Smoke Test PASSED Invoked agent via Responses API
Foundry Evaluation PASSED Evaluated with golden dataset

🛠️ Tools Configuration

Tool Enabled
web_search
code_interpreter

🔗 Links


🤖 Updated automatically by the CI pipeline · 2026-05-18 14:36:55 UTC

@san360 san360 merged commit 7315a50 into main May 18, 2026
1 check passed
@san360 san360 deleted the feature/phase2-code-interpreter branch May 18, 2026 14:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant