tester.ts required fields for open ai models #6

IovetsNikolay · 2026-02-02T14:04:47Z

User description

CodeAnt-AI Description

Require recommendation fields in tester outputs

What Changed

The tester now requires recommendReset and recommendStop booleans in its intermediate review output instead of allowing them to be omitted.
The final review now always requires a recommendation string instead of allowing it to be empty.
Model responses that omit these fields will fail validation, making missing guidance visible immediately.

Impact

✅ Clearer reset/stop guidance in test reports
✅ Fewer missing follow-up recommendations
✅ Earlier detection of incomplete model responses

💡 Usage Guide

Checking Your Pull Request

Every time you make a pull request, our system automatically looks through it. We check for security issues, mistakes in how you're setting up your infrastructure, and common code problems. We do this to make sure your changes are solid and won't cause any trouble later.

Talking to CodeAnt AI

Got a question or need a hand with something in your pull request? You can easily get in touch with CodeAnt AI right here. Just type the following in a comment on your pull request, and replace "Your question here" with whatever you want to ask:

@codeant-ai ask: Your question here

This lets you have a chat with CodeAnt AI about your pull request, making it easier to understand and improve your code.

Example

@codeant-ai ask: Can you suggest a safer alternative to storing this secret?

Preserve Org Learnings with CodeAnt

You can record team preferences so CodeAnt AI applies them in future reviews. Reply directly to the specific CodeAnt AI suggestion (in the same thread) and replace "Your feedback here" with your input:

@codeant-ai: Your feedback here

This helps CodeAnt AI learn and adapt to your team's coding style and standards.

Example

@codeant-ai: Do not flag unused imports.

Retrigger review

Ask CodeAnt AI to review the PR again, by typing:

@codeant-ai: review

Check Your Repository Health

To analyze the health of your code repository, visit our dashboard at https://app.codeant.ai. This tool helps you identify potential issues and areas for improvement in your codebase, ensuring your repository maintains high standards of code health.

codeant-ai · 2026-02-02T14:04:53Z

CodeAnt AI is reviewing your PR.

codeant-ai · 2026-02-02T14:06:57Z

Nitpicks 🔍

🔒 No security issues identified
⚡ Recommended areas for review Schema strictness The zod schemas for the tester agent were changed to require fields that were optional before. If the LLM / provider doesn't return these keys, validation may fail or throw, causing the checkpoint/final-review flows to break. Consider making these fields optional with sensible defaults or adding defensive fallbacks after parse. Final-review assumption The final review schema now requires `recommendation` to be present. If the model omits this field, validation could fail. Also, downstream code already treats `recommendation` as optional; ensure parity between schema and runtime checks.

codeant-ai · 2026-02-02T14:07:28Z

src/ai/tester.ts

      suggestion: z.string().describe('Specific next action recommendation'),
-      recommendReset: z.boolean().optional().describe('Recommend reset() if persistent failures suggest navigation issues'),
-      recommendStop: z.boolean().optional().describe('Recommend stop() if test is fundamentally incompatible or cannot proceed'),
+      recommendReset: z.boolean().describe('Recommend reset() if persistent failures suggest navigation issues'),


Suggestion: The recommendReset field is required in the Zod schema, but the model prompt does not explicitly guarantee it will always be returned, so generateObject can fail validation if the model omits it, causing response to be undefined and silently skipping the progress report logic. [possible bug]

Severity Level: Major ⚠️

- ❌ Progress checkpoint report skipped during test run. - ⚠️ analyzeProgress() returns empty string instead of report. - ⚠️ Test guidance (reset suggestion) may be lost.

Suggested change

recommendReset: z.boolean().describe('Recommend reset() if persistent failures suggest navigation issues'),

recommendReset: z.boolean().optional().describe('Recommend reset() if persistent failures suggest navigation issues'),

Steps of Reproduction ✅

1. Run a test that reaches the progress analysis checkpoint so Tester.analyzeProgress() is executed. The call site is in src/ai/tester.ts inside test() where the loop calls this.analyzeProgress(task, currentState) (see the loop block that conditionally calls analyzeProgress). 2. Inside src/ai/tester.ts, open the analyzeProgress function where the Zod schema is defined (schema declaration starts at line 546 and includes the recommendReset field at line 549). 3. The function builds a prompt and then calls the provider to parse the model output: const model = this.provider.getModelForAgent('tester'); const response = await this.provider.generateObject(..., schema, model); (the generateObject invocation is the provider.parse call that immediately follows the schema definition in the same function). 4. If the LLM response omits the boolean field recommendReset, Zod validation will fail because recommendReset is required (line 549). generateObject will therefore not return a valid response.object, and analyzeProgress will receive response as undefined or a failed parse, causing: const result = response?.object; if (!result) return ''; — the progress checkpoint report is skipped. Reproducing this is achieved by running a test scenario that triggers analyzeProgress and forcing the model to reply without the recommendReset boolean (e.g., with a concise textual assessment that lacks that field).

Prompt for AI Agent 🤖

This is a comment left during a code review. **Path:** src/ai/tester.ts **Line:** 549:549 **Comment:** *Possible Bug: The `recommendReset` field is required in the Zod schema, but the model prompt does not explicitly guarantee it will always be returned, so `generateObject` can fail validation if the model omits it, causing `response` to be undefined and silently skipping the progress report logic. Validate the correctness of the flagged issue. If correct, How can I resolve this? If you propose a fix, implement it and please make it concise.

codeant-ai · 2026-02-02T14:07:28Z

src/ai/tester.ts

-      recommendReset: z.boolean().optional().describe('Recommend reset() if persistent failures suggest navigation issues'),
-      recommendStop: z.boolean().optional().describe('Recommend stop() if test is fundamentally incompatible or cannot proceed'),
+      recommendReset: z.boolean().describe('Recommend reset() if persistent failures suggest navigation issues'),
+      recommendStop: z.boolean().describe('Recommend stop() if test is fundamentally incompatible or cannot proceed'),


Suggestion: The recommendStop field is also required in the Zod schema without the prompt explicitly enforcing its presence, so if the model does not include it, generateObject may fail validation and result handling will be skipped. [possible bug]

Severity Level: Major ⚠️

- ❌ Stop recommendation lost when model omits the flag. - ⚠️ analyzeProgress() may not record persistent failure actions. - ⚠️ Tester misses critical stop/reset guidance.

Suggested change

recommendStop: z.boolean().describe('Recommend stop() if test is fundamentally incompatible or cannot proceed'),

recommendStop: z.boolean().optional().describe('Recommend stop() if test is fundamentally incompatible or cannot proceed'),

Steps of Reproduction ✅

1. Trigger the same code path as previous step: execute a test that leads to Tester.analyzeProgress() being called from the main test loop (the conditional call to analyzeProgress is in src/ai/tester.ts within the main loop). 2. Inspect the schema in analyzeProgress where recommendStop is declared (schema block at lines 546-551, with recommendStop at line 550). 3. The function calls the provider to parse the LLM output via this.provider.generateObject(..., schema, model). If the LLM response includes assessment and suggestion but omits recommendStop, Zod will reject the object because recommendStop is required (line 550). 4. When validation fails, generateObject yields no usable response.object, analyzeProgress treats that as no result (const result = response?.object; if (!result) return ''), and the function exits without adding the expected progress note or advising stop/reset. Reproduce by running any test that causes analyzeProgress to run and configuring the LLM (or using a prompt/seed) so it omits the recommendStop boolean.

Prompt for AI Agent 🤖

This is a comment left during a code review. **Path:** src/ai/tester.ts **Line:** 550:550 **Comment:** *Possible Bug: The `recommendStop` field is also required in the Zod schema without the prompt explicitly enforcing its presence, so if the model does not include it, `generateObject` may fail validation and result handling will be skipped. Validate the correctness of the flagged issue. If correct, How can I resolve this? If you propose a fix, implement it and please make it concise.

codeant-ai · 2026-02-02T14:07:28Z

src/ai/tester.ts

      summary: z.string().describe('Concise overview of the test findings'),
      scenarioAchieved: z.boolean().describe('Indicates if the scenario goal appears satisfied'),
-      recommendation: z.string().optional().describe('Follow-up suggestion if needed'),
+      recommendation: z.string().describe('Follow-up suggestion if needed'),


Suggestion: In finalReview, the new recommendation field is required in the Zod schema but the prompt never asks the model to provide it, so generateObject can fail schema validation if the field is omitted, preventing summaries from being processed. [possible bug]

Severity Level: Major ⚠️

- ❌ Final summary not recorded in task.summary. - ❌ finalReview may not call task.finish(). - ⚠️ Test session ends without final notes.

Suggested change

recommendation: z.string().describe('Follow-up suggestion if needed'),

recommendation: z.string().optional().describe('Follow-up suggestion if needed'),

Steps of Reproduction ✅

1. Execute a test that reaches the end-of-test cleanup so Tester.finalReview(task) is invoked. finalReview is defined in src/ai/tester.ts (the function signature is private async finalReview(task: Test): Promise<void; see the finalReview function block). 2. In finalReview, inspect the Zod schema declared for the final summary (schema block at lines 664-668, where recommendation is currently required at line 667). 3. finalReview then calls the provider to parse the LLM output: const model = this.provider.getModelForAgent('tester'); const response = await this.provider.generateObject(..., schema, model); (the generateObject invocation follows the schema definition in the same function). 4. If the LLM provides summary and scenarioAchieved but does not emit a recommendation string, Zod validation will fail because recommendation is required (line 667). As a result response?.object is undefined and finalReview returns early (if (!result) return;), preventing task.summary assignment and the task.finish() logic from executing. Reproduce by running a complete test and using an LLM reply that contains summary and scenarioAchieved but omits recommendation.

Prompt for AI Agent 🤖

This is a comment left during a code review. **Path:** src/ai/tester.ts **Line:** 667:667 **Comment:** *Possible Bug: In `finalReview`, the new `recommendation` field is required in the Zod schema but the prompt never asks the model to provide it, so `generateObject` can fail schema validation if the field is omitted, preventing summaries from being processed. Validate the correctness of the flagged issue. If correct, How can I resolve this? If you propose a fix, implement it and please make it concise.

codeant-ai · 2026-02-02T14:07:35Z

CodeAnt AI finished reviewing your PR.

DavertMik · 2026-02-02T23:00:41Z

src/ai/tester.ts

      suggestion: z.string().describe('Specific next action recommendation'),
-      recommendReset: z.boolean().optional().describe('Recommend reset() if persistent failures suggest navigation issues'),
-      recommendStop: z.boolean().optional().describe('Recommend stop() if test is fundamentally incompatible or cannot proceed'),
+      recommendReset: z.boolean().describe('Recommend reset() if persistent failures suggest navigation issues'),


This can be confusing as it requires LLM to fill all fields and it would provide recommendation which it was previously about to avoid

Probably best way to remove all that recommended* fields alltogether

And keep only suggestion

Also probably we can use enum if this allowed:

recommendedTool: 'reset|stop|continue'

more required fields for open ai models

42e405d

codeant-ai bot added the size:XS This PR changes 0-9 lines, ignoring generated files label Feb 2, 2026

codeant-ai bot reviewed Feb 2, 2026

View reviewed changes

DavertMik requested changes Feb 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tester.ts required fields for open ai models #6

tester.ts required fields for open ai models #6

Uh oh!

IovetsNikolay commented Feb 2, 2026 •

edited by codeant-ai bot

Loading

Uh oh!

codeant-ai bot commented Feb 2, 2026

Uh oh!

codeant-ai bot commented Feb 2, 2026

Uh oh!

codeant-ai bot Feb 2, 2026

Uh oh!

codeant-ai bot Feb 2, 2026

Uh oh!

codeant-ai bot Feb 2, 2026

Uh oh!

codeant-ai bot commented Feb 2, 2026

Uh oh!

DavertMik Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	recommendReset: z.boolean().describe('Recommend reset() if persistent failures suggest navigation issues'),
	recommendReset: z.boolean().optional().describe('Recommend reset() if persistent failures suggest navigation issues'),

	recommendStop: z.boolean().describe('Recommend stop() if test is fundamentally incompatible or cannot proceed'),
	recommendStop: z.boolean().optional().describe('Recommend stop() if test is fundamentally incompatible or cannot proceed'),

	recommendation: z.string().describe('Follow-up suggestion if needed'),
	recommendation: z.string().optional().describe('Follow-up suggestion if needed'),

tester.ts required fields for open ai models #6

Are you sure you want to change the base?

tester.ts required fields for open ai models #6

Uh oh!

Conversation

IovetsNikolay commented Feb 2, 2026 • edited by codeant-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

User description

CodeAnt-AI Description

What Changed

Impact

Checking Your Pull Request

Talking to CodeAnt AI

Example

Preserve Org Learnings with CodeAnt

Example

Retrigger review

Check Your Repository Health

Uh oh!

codeant-ai bot commented Feb 2, 2026

Uh oh!

codeant-ai bot commented Feb 2, 2026

Nitpicks 🔍

Uh oh!

codeant-ai bot Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

codeant-ai bot Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

codeant-ai bot Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

codeant-ai bot commented Feb 2, 2026

Uh oh!

DavertMik Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

IovetsNikolay commented Feb 2, 2026 •

edited by codeant-ai bot

Loading