Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
12 changes: 9 additions & 3 deletions agents/tech-trends-agent.json
Original file line number Diff line number Diff line change
@@ -1,13 +1,19 @@
{
"agent_name": "tech-trends-agent",
"phase": "1",
"definition": {
"model": "${GPT_DEPLOYMENT}",
"instructions_file": "prompts/tech-trends-agent.md",
"tools": []
"tools": [
{ "type": "web_search" }
]
},
"eval": {
"dataset": "evals/golden-dataset.json",
"phase_filter": null,
"phase_filter": "1",
"config": "evals/eval-config.json"
}
},
"_model_history": [
{ "model": "gpt-4o-2024-11-20", "from": "2025-01-10", "to": null, "reason": "initial" }
]
}
12 changes: 6 additions & 6 deletions evals/eval-config.json
Original file line number Diff line number Diff line change
@@ -1,16 +1,16 @@
{
"evaluators": [
"TaskAdherenceEvaluator",
"RelevanceEvaluator",
"GroundednessEvaluator",
"CoherenceEvaluator"
"builtin.task_adherence",
"builtin.relevance",
"builtin.groundedness",
"builtin.coherence"
],
"thresholds": {
"task_adherence": 0.80,
"relevance": 0.75,
"groundedness": 0.75,
"coherence": 0.80
},
"phase_filter": null,
"notes": "Phase 2: All queries evaluated — both web search (Phase 1) and data analysis (Phase 2)."
"phase_filter": "1",
"notes": "Phase 1: Only web search queries evaluated. Phase 2 data analysis queries excluded."
}
12 changes: 0 additions & 12 deletions prompts/tech-trends-agent.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,15 +27,3 @@ Always structure responses as:
## Tone
Professional, objective, and jargon-aware. Assume the user is a technology
professional who does not need basic concepts explained.

## Data Analysis (Phase 2)
You now have access to a code interpreter. Use it when:
- The user asks you to calculate, compare, or rank numerical data
- You have retrieved structured data (tables, CSVs) and analysis would add value
- You need to produce a formatted comparison table from raw information

When using code interpreter:
1. First retrieve the data via web search
2. Then write and run Python code to process or compare it
3. Present results with the code output clearly labelled
4. Always show the source of the raw data alongside the computed result
Loading