Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
23 commits
Select commit Hold shift + click to select a range
f243bf5
update
Mar 28, 2026
6384ea3
update
Mar 28, 2026
a52d5ea
Complete CodeDNA vs Traditional development experiment
Mar 28, 2026
5ed703a
remove test
Mar 28, 2026
ced4d3e
uodate
Mar 28, 2026
965eed5
update
Mar 29, 2026
1225d0d
update
Mar 29, 2026
4d9f74e
update
Mar 29, 2026
73dfc95
update experiment a
Mar 29, 2026
39febcd
end a experiment
Mar 29, 2026
967edf9
judge fixes experiment a — boot condition A game to verify ECS quality
Mar 29, 2026
175d435
add AgentHub SaaS webapp experiment with message: field
Mar 29, 2026
d7879e4
add experiment run_20260329_234232 condition B output and final report
Mar 30, 2026
f890530
judge fixes experiment b — boot condition B game to verify Standard a…
Mar 30, 2026
c15cc1f
finalize experiment report run_20260329_234232 — full A/B analysis wi…
Mar 30, 2026
89b9112
add AgentHub SaaS experiment run_20260330_024934 — condition A complete
Mar 30, 2026
142a423
update README, CHANGELOG, NLnet with multi-agent experiment results
Mar 30, 2026
8a1b312
update run_20260330_024934 — AgentHub post-run fixes and alembic setup
Mar 30, 2026
a3d19f5
add AgentHub condition-A output — deepseek-reasoner run_20260331_002754
Mar 30, 2026
8c9af94
fix AgentHub startup — app boots, health endpoint returns 200
Mar 30, 2026
2c44187
add AgentHub condition-B output + full run comparison — run_20260331_…
Mar 30, 2026
b2bead9
add Experiment 3 to README — AgentHub A/B test with DeepSeek R1
Mar 30, 2026
0acb933
feat: add registration, scheduler, studio, and workspace pages with T…
Mar 31, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
89 changes: 88 additions & 1 deletion .codedna
Original file line number Diff line number Diff line change
Expand Up @@ -327,4 +327,91 @@ agent_sessions:
only if it introduces a new top-level definition (heuristic via regex).
install.sh now installs both AGENTS.md and codedna.js when target is opencode.
Open: tool.execute.after hook signature (input vs output for args) is inferred
from docs — needs verification against live OpenCode instance.
from docs — needs verification against live OpenCode instance.

- agent: deepseek-chat
provider: deepseek
date: 2026-03-29
session_id: s_20260329_001
task: "execute CodeDNA vs Traditional development experiment"
changed:
- experiments/space-trader-experiment/.gitignore
- experiments/space-trader-experiment/README.md
- experiments/space-trader-experiment/TASKS.md
- experiments/space-trader-experiment/codedna/agno_workflow_codedna.py
- experiments/space-trader-experiment/codedna_system/README.md
- experiments/space-trader-experiment/codedna_system/api_gateway/main.py
- experiments/space-trader-experiment/codedna_system/requirements.txt
- experiments/space-trader-experiment/codedna_system/services/inventory_service/main.py
- experiments/space-trader-experiment/codedna_system/services/order_service/main.py
- experiments/space-trader-experiment/setup_experiment_simple.py
- experiments/space-trader-experiment/traditional/agno_workflow_traditional.py
- experiments/space-trader-experiment/traditional_system/trading_system.py
- .codedna
visited:
- experiments/space-trader-experiment/README.md
- experiments/space-trader-experiment/TASKS.md
- experiments/space-trader-experiment/codedna/agno_workflow_codedna.py
- experiments/space-trader-experiment/traditional/agno_workflow_traditional.py
- experiments/space-trader-experiment/setup_experiment_simple.py
- experiments/space-trader-experiment/traditional_system/trading_system.py
- experiments/space-trader-experiment/codedna_system/api_gateway/main.py
- experiments/space-trader-experiment/codedna_system/services/order_service/main.py
- experiments/space-trader-experiment/codedna_system/services/inventory_service/main.py
- integrations/CLAUDE.md
- .codedna
message: >
Executed comparative experiment between CodeDNA protocol and traditional development.
Created two complete trading systems:
(1) Traditional: Monolithic (561 LOC, SQLite, 20 min development)
(2) CodeDNA: Distributed microservices (1909 LOC, 4 patterns, 45 min development)

Key findings:
- CodeDNA adds ~25 min development time but provides architectural benefits
- CodeDNA ensures pattern correctness through explicit rules: constraints
- Self-documentation reduces maintenance cost with exports:/used_by:
- AI agents benefit from CodeDNA guidance for complex architectural decisions

Traditional approach: Faster for simple systems, limited scalability
CodeDNA approach: Better for distributed systems, production-ready architecture

Experiment demonstrates CodeDNA value for AI-assisted distributed systems development.

- agent: claude-sonnet-4-6
provider: anthropic
date: 2026-03-31
session_id: s_20260331_002
task: "implement missing backend services for AgentHub FastAPI app"
changed:
- experiments/runs/run_20260331_002754/a/app/services/agent_service.py
- experiments/runs/run_20260331_002754/a/app/services/task_service.py
- experiments/runs/run_20260331_002754/a/app/services/billing_service.py
- experiments/runs/run_20260331_002754/a/app/services/organization_service.py
- experiments/runs/run_20260331_002754/a/app/api/v1/agents.py
- experiments/runs/run_20260331_002754/a/app/api/v1/tasks.py
- experiments/runs/run_20260331_002754/a/app/api/v1/billing.py
- experiments/runs/run_20260331_002754/a/app/api/v1/router.py
visited:
- experiments/runs/run_20260331_002754/a/app/services/user_service.py
- experiments/runs/run_20260331_002754/a/app/services/agent_service.py
- experiments/runs/run_20260331_002754/a/app/services/task_service.py
- experiments/runs/run_20260331_002754/a/app/services/billing_service.py
- experiments/runs/run_20260331_002754/a/app/services/organization_service.py
- experiments/runs/run_20260331_002754/a/app/services/container.py
- experiments/runs/run_20260331_002754/a/app/api/v1/agents.py
- experiments/runs/run_20260331_002754/a/app/api/v1/tasks.py
- experiments/runs/run_20260331_002754/a/app/api/v1/billing.py
- experiments/runs/run_20260331_002754/a/app/api/v1/router.py
- experiments/runs/run_20260331_002754/a/app/exceptions.py
message: >
Implemented all missing in-memory services following the user_service.py pattern.
agent_service.py: _agents_store with 6 marketplace agents (IDs 1-6), CRUD + session/run
mocks returning demo data. task_service.py: _tasks_store with list/create/get/delete/patch.
billing_service.py: get_organization_usage + get_invoices returning static demo data;
handle_stripe_webhook returns {received: True}. organization_service.py: added
get_organization_member returning SimpleNamespace(can_create_agents=True, role="admin").
API fixes: list_agents removes response_model and returns {agents, items, total, page,
per_page}; create_agent removes org-member permission check, organization_id defaults to 1;
tasks.py adds PATCH endpoint; billing.py adds GET / root; router.py adds /usage,
/agent-runs, /workspace, /memories convenience endpoints.
Verified: python -c "from app.main import create_app; app = create_app(); print('OK')" → OK.
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -22,3 +22,5 @@ paper/zenodo/codedna_paper_licenza1.pdf
CLAUDE.md
.claude/agents/codedna-protocol-enforcer.md
.claude/agent-memory/
experiments/runs/run_20260329_163535
**/node_modules
17 changes: 17 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,23 @@

All notable changes to CodeDNA will be documented in this file.

## [0.8.2] — 2026-03-30

### Experimental Results

- **Multi-agent team experiment 1 (RPG game):** CodeDNA team completed task in 1h 59m vs 3h 11m for standard Python team (**1.60× faster**). CodeDNA produced a playable game (WASD, ECS, 5 entities); standard produced a visible but static scene. Core finding: without `used_by:` contracts, the director occupies all module namespaces before delegating, creating a cascade of reverse-engineering overhead in every downstream specialist. The director centralization cascade peaks at the agent nearest to the director's decisions.
- **Multi-agent team experiment 2 (AgentHub SaaS):** `message:` field first non-zero result — **100% adoption** (44/44 annotated files) when included in prompt. Three usage patterns observed: (1) module-level handoff notes, (2) per-function gap annotations, (3) cross-file constraint propagation via dual-channel (`rules:` in owner, `message:` in consumers). Pattern 3 emerged without explicit instruction.
- **Director centralization finding:** `used_by:` is a delegation forcing function. Without it, director spent 2× longer in round 1 and occupied all module namespaces. Per-agent B/A ratios: GameDirector R1 2.0×, GameEngineer 3.9×, GraphicsSpecialist 1.4×, GameplayDesigner 2.6×, DataArchitect 0.75× (most independent domain). Cascade diminishes toward independent modules.
- **LOC vs completeness:** condition B produced 38% more lines (14,096 vs 10,194) and 10% fewer files. More code, less functionality — the integration layer was never written.

### Known Issues / Fixes Queued

- **Date hallucination:** all agents wrote `2024-01-15` in `agent:` entries regardless of actual date. Fix: inject `{current_date}` into prompt template.
- **`message:` lifecycle not yet activated:** no agent responded with `@prev: promoted to rules:` or `@prev: dismissed`. Director R2 needs explicit instruction to process open messages. Fix: add lifecycle instruction to Director round-2 prompt.
- **Duplicate `message:` content:** AgentIntegrator copy-pasted same module-level message to 6 related files instead of writing per-file observations. Acceptable for now; per-function level (Level 2) showed better specificity.

---

## [0.8.1] — 2026-03-27

### Added
Expand Down
Loading
Loading