Skip to content

Commit 9c65ed1

Browse files
committed
base-deep => gpt-5.4
1 parent bca3aa6 commit 9c65ed1

4 files changed

Lines changed: 27 additions & 27 deletions

File tree

agents/base2/base-deep.ts

Lines changed: 21 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ Use the spawn_agents tool to spawn specialized agents to help you complete the u
2929
- **Spawn multiple agents in parallel:** This increases the speed of your response **and** allows you to be more comprehensive by spawning more total agents to synthesize the best response.
3030
- **Sequence agents properly:** Keep in mind dependencies when spawning different agents. Don't spawn agents in parallel that depend on each other.
3131
- Spawn context-gathering agents (file pickers, code-searcher, directory-lister, glob-matcher, and web/docs researchers) before making edits.
32-
- Spawn the thinker-codex after gathering context to solve complex problems or when the user asks you to think about a problem. (gpt-5-agent is a last resort for complex problems)
32+
- Spawn the thinker-gpt after gathering context to solve complex problems or when the user asks you to think about a problem. (gpt-5-agent is a last resort for complex problems)
3333
- Implement code changes using direct file editing tools.
3434
- Prefer apply_patch for existing-file edits. Use write_file only for creating or replacing entire files when that is simpler.
3535
- Spawn commanders sequentially if the second command depends on the the first.
@@ -62,13 +62,13 @@ For other questions, you can direct them to codebuff.com, or especially codebuff
6262
6363
[ Phase 1 — Codebase Context & Research: You spawn file-pickers, code-searchers, and researchers (web/docs) in parallel to find relevant files and research external libraries/APIs, then read the results to build understanding ]
6464
65-
[ Phase 2 — Spec: You draft an initial SPEC.md, then use ask_user iteratively to refine it, then run thinker-codex critique loop until clean ]
65+
[ Phase 2 — Spec: You draft an initial SPEC.md, then use ask_user iteratively to refine it, then run thinker-gpt critique loop until clean ]
6666
67-
[ Phase 3 — Plan: You write a detailed PLAN.md with all implementation steps, run thinker-codex critique loop, then write implementation todos ]
67+
[ Phase 3 — Plan: You write a detailed PLAN.md with all implementation steps, run thinker-gpt critique loop, then write implementation todos ]
6868
6969
[ Phase 4 — Implement: You fully implement the spec using direct file editing tools ]
7070
71-
[ Phase 5 — Review Loop: You spawn code-reviewer-codex, fix any issues found, and re-run the reviewer until no new issues are found ]
71+
[ Phase 5 — Review Loop: You spawn code-reviewer-gpt, fix any issues found, and re-run the reviewer until no new issues are found ]
7272
7373
[ Phase 6 — Validate: You run unit tests, add new tests, fix failures, and attempt E2E verification by running the application ]${noLearning ? '' : `
7474
@@ -156,9 +156,9 @@ Draft a spec first, then refine it with the user:
156156
5. **Do NOT ask obvious questions.** If you are >80% confident you know what the user would choose, just make that choice and move on. Only ask questions where the user's input would genuinely change the outcome.
157157
6. As the LAST question before finishing this phase, ask one open-ended question giving the user a chance to share any final feedback, concerns, or changes to the spec. For example: "Before I finalize the spec, is there anything else you'd like to add, change, or flag about the requirements?"`}
158158
${noAskUser ? '3' : '7'}. Iteratively critique the spec:
159-
a. Spawn thinker-codex to critique the spec — ask it to identify missing requirements, ambiguities, contradictions, overlooked edge cases, or technical approach issues.
159+
a. Spawn thinker-gpt to critique the spec — ask it to identify missing requirements, ambiguities, contradictions, overlooked edge cases, or technical approach issues.
160160
b. If the thinker raises valid critiques, update SPEC.md to address them.
161-
c. After updating, you MUST spawn thinker-codex again to re-critique the revised spec.
161+
c. After updating, you MUST spawn thinker-gpt again to re-critique the revised spec.
162162
d. Repeat until the thinker finds no new substantive critiques. Do NOT skip the re-critique — every revision must be verified.
163163
${noAskUser ? '4' : '8'}. Do NOT proceed until you are confident the spec captures the full picture.
164164
@@ -171,17 +171,17 @@ Create a detailed implementation plan, iteratively critique it, and save it alon
171171
- **Dependencies / Ordering**: Note which steps depend on others and the recommended order of implementation.
172172
- **Risk Areas**: Flag any steps that are tricky, uncertain, or likely to need iteration.
173173
2. Iteratively critique the plan:
174-
a. Spawn thinker-codex to critique the plan — ask it to identify gaps, missed edge cases, better approaches, ordering issues, or unnecessary steps.
174+
a. Spawn thinker-gpt to critique the plan — ask it to identify gaps, missed edge cases, better approaches, ordering issues, or unnecessary steps.
175175
b. If the thinker raises valid critiques, update PLAN.md to address them.
176-
c. After updating, you MUST spawn thinker-codex again to re-critique the revised plan.
176+
c. After updating, you MUST spawn thinker-gpt again to re-critique the revised plan.
177177
d. Repeat until the thinker finds no new substantive critiques. Do NOT skip the re-critique — every revision must be verified.
178178
3. Write implementation todos (the second phase of todos) — one todo per plan step, plus todos for phases 5-${noLearning ? '6' : '7'}.
179179
180180
## Phase 4 — Implement
181181
182182
Fully implement the spec:
183183
184-
1. For complex problems, spawn the thinker-codex agent to help find the best solution.
184+
1. For complex problems, spawn the thinker-gpt agent to help find the best solution.
185185
2. Implement all changes using direct file editing tools. Prefer apply_patch for edits.
186186
3. Implement ALL requirements from the spec — do not leave anything partially done.
187187
4. Narrate what you are doing as you go.
@@ -190,9 +190,9 @@ Fully implement the spec:
190190
191191
Iteratively review until the code is clean:
192192
193-
1. Spawn code-reviewer-codex to review all changes.
193+
1. Spawn code-reviewer-gpt to review all changes.
194194
2. If the reviewer finds ANY issues, fix them.
195-
3. After fixing, you MUST spawn code-reviewer-codex again to re-review.
195+
3. After fixing, you MUST spawn code-reviewer-gpt again to re-review.
196196
4. Repeat steps 1-3 until the reviewer finds no new issues. Do NOT skip the re-review — every fix must be verified.
197197
198198
## Phase 6 — Validate
@@ -234,9 +234,9 @@ Capture learnings for future sessions:
234234
- Reference the specific session directory where each piece of knowledge was learned (e.g. "(from .agents/sessions/2025-01-15-add-auth/)")
235235
- Only include insights that are genuinely useful for future work — not generic advice
236236
3. Iteratively improve lessons and skills:
237-
a. Spawn thinker-codex to critique your LESSONS.md and skill file edits — ask it to identify missing insights, improvements to existing entries, and brainstorm additional skills that could be created or updated based on the work done in this session.
237+
a. Spawn thinker-gpt to critique your LESSONS.md and skill file edits — ask it to identify missing insights, improvements to existing entries, and brainstorm additional skills that could be created or updated based on the work done in this session.
238238
b. If the thinker suggests valid improvements or new skill ideas, update the relevant files accordingly.
239-
c. After updating, you MUST spawn thinker-codex again to re-critique and brainstorm further.
239+
c. After updating, you MUST spawn thinker-gpt again to re-critique and brainstorm further.
240240
d. Repeat until the thinker finds no new substantive improvements or skill ideas. Do NOT skip the re-critique — every revision must be verified.`}${noAskUser ? '' : `
241241
${noLearning ? '1' : '4'}. Use suggest_followups to suggest ~3 next steps the user might want to take.`}
242242
@@ -255,8 +255,8 @@ export function createBaseDeep(options?: {
255255
const { noAskUser = false, noLearning = false } = options ?? {}
256256
return {
257257
publisher,
258-
model: 'openai/gpt-5.3-codex',
259-
displayName: 'Buffy the Codex Orchestrator',
258+
model: 'openai/gpt-5.4',
259+
displayName: 'Buffy the GPT Orchestrator',
260260
spawnerPrompt:
261261
'Advanced base agent that orchestrates planning, editing, and reviewing for complex coding tasks',
262262
inputSchema: {
@@ -296,8 +296,8 @@ export function createBaseDeep(options?: {
296296
'researcher-web',
297297
'researcher-docs',
298298
'commander',
299-
'thinker-codex',
300-
'code-reviewer-codex',
299+
'thinker-gpt',
300+
'code-reviewer-gpt',
301301
'gpt-5-agent',
302302
'context-pruner',
303303
],
@@ -307,14 +307,14 @@ export function createBaseDeep(options?: {
307307
308308
**Planning todos** (write at start): Phase 1 → Phase 2 → Phase 3
309309
1. Context & Research — file-pickers + code-searchers + researchers in parallel, read results
310-
2. Spec — draft SPEC.md, ${noAskUser ? '' : 'iterative ask_user to refine (skip obvious Qs), open-ended final Q, '}thinker-codex critique loop
311-
3. Plan — write PLAN.md, thinker-codex critique loop
310+
2. Spec — draft SPEC.md, ${noAskUser ? '' : 'iterative ask_user to refine (skip obvious Qs), open-ended final Q, '}thinker-gpt critique loop
311+
3. Plan — write PLAN.md, thinker-gpt critique loop
312312
313313
**Implementation todos** (write after Plan): one todo per plan step + phases 5-${noLearning ? '6' : '7'}
314314
4. Implement — fully build the spec using file editing tools
315-
5. Review Loop — code-reviewer-codex → fix → re-review until clean
315+
5. Review Loop — code-reviewer-gpt → fix → re-review until clean
316316
6. Validate — run tests + typechecks, add new tests, do E2E verification${noLearning ? '' : `
317-
7. Lessons — write LESSONS.md, update/create skills, iterative thinker-codex brainstorm loop`}`,
317+
7. Lessons — write LESSONS.md, update/create skills, iterative thinker-gpt brainstorm loop`}`,
318318
handleSteps: function* ({ params }) {
319319
while (true) {
320320
// Run context-pruner before each step.

agents/e2e/base-deep.e2e.test.ts

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ import { beforeAll, describe, expect, it } from 'bun:test'
99
import { $ } from 'bun'
1010

1111
import baseDeep from '../base2/base-deep'
12-
import thinkerCodex from '../thinker/thinker-codex'
12+
import thinkerCodex from '../thinker/thinker-gpt'
1313

1414
import type { PrintModeEvent } from '@codebuff/common/types/print-mode'
1515

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3,9 +3,9 @@ import type { SecretAgentDefinition } from '../types/secret-agent-definition'
33
import { createReviewer } from './code-reviewer'
44

55
const definition: SecretAgentDefinition = {
6-
id: 'code-reviewer-codex',
6+
id: 'code-reviewer-gpt',
77
publisher,
8-
...createReviewer('openai/gpt-5.3-codex'),
8+
...createReviewer('openai/gpt-5.4'),
99
}
1010

1111
export default definition
Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -4,11 +4,11 @@ import type { SecretAgentDefinition } from '../types/secret-agent-definition'
44

55
const definition: SecretAgentDefinition = {
66
...thinker,
7-
id: 'thinker-codex',
8-
model: 'openai/gpt-5.3-codex',
7+
id: 'thinker-gpt',
8+
model: 'openai/gpt-5.4',
99
outputSchema: undefined,
1010
outputMode: 'last_message',
11-
instructionsPrompt: `You are the thinker-codex agent. Think deeply about the user request and when satisfied, write out your response.
11+
instructionsPrompt: `You are the thinker-gpt agent. Think deeply about the user request and when satisfied, write out your response.
1212
1313
The parent agent will see your response. DO NOT call any tools. No need to spawn the thinker agent, because you are already the thinker agent. Just do the thinking work now.`,
1414
handleSteps: function* () {

0 commit comments

Comments
 (0)