You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/autonomous-os-plan.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -79,8 +79,8 @@ speed, scale, history, and reliability.
79
79
| Product narrative | 99 | keep narrative aligned as hosted Network and Intelligence launch |
80
80
| CLI readiness | 100 | five-mode terminal with full-screen live cockpit exists; raw exchange records remain operator-owned proof |
81
81
| Engine runtime | 100 | public production-parity OODA report plus redacted live trading evidence exist; raw exchange records remain operator-owned external proof |
82
-
| Self-evolution loop | 98 | public memory, research command chain, genesis proposal classification, production-parity OODA reports, local apply, rollback, paper-first evolve gates, and agent architecture bounds exist; property-based failure-mode coverage remains |
83
-
| Safety and risk |96| autonomous-loop failure taxonomy exists; real exchange chaos drills, property-based safety-gate tests, and external review remain |
82
+
| Self-evolution loop | 98 | public memory, research command chain, genesis proposal classification, production-parity OODA reports, local apply, rollback, paper-first evolve gates, agent architecture bounds, and property-based safety coverage exist; protected live-code evolution remains human-reviewed|
83
+
| Safety and risk |98| autonomous-loop failure taxonomy and bounded property-based safety-gate tests exist; real exchange chaos drills and external review remain |
84
84
| API contracts | 100 | public runtime contracts are complete; hosted compatibility is commercial launch work |
85
85
| Deployment | 96 | live Railway proof, external production log-drain evidence |
86
86
| Observability and audit | 97 | checksum-chained runtime bus, hash-chained signable decision journal, local timestamp binding, external anchor packet, verifier, and signed evidence bundles exist; periodic external receipt operation, metrics backend, and log drains remain |
Copy file name to clipboardExpand all lines: docs/failure-modes-autonomous-loop.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -41,13 +41,13 @@ Unknown answers mean fail closed.
41
41
42
42
| ID | Failure mode | Detection | Blast radius | Rollback | Journal entry | Alerting | Test or evidence |
43
43
|---|---|---|---|---|---|---|---|
44
-
| FM-AUTO-001 | Agent hallucinates a strategy and burns through paper budget. | Strategy registry rejects unknown runners; paper budget breaker sees order count, notional, or drawdown drift. | Paper budget for the local session. Live capital should be zero because paper-first is enforced. | Pause evolve, disable the strategy, revert the candidate config, reset paper budget after review. |`zero.evolve.run.v1`, `zero.immune.v1`, rejected `zero.paper.decision.v1`, rollback receipt. | CLI/TUI safety banner, `/immune`, metrics counter, optional operator notification. |`engine/tests/test_evolve.py`, `engine/tests/test_safety.py`; property budget fuzzing remains required before 100/100. |
45
-
| FM-AUTO-002 |`evolve` produces a config that passes tests but fails at runtime. | Runtime health marks candidate as failed; production-parity OODA emits live-shadow mismatch or runtime exception. | Candidate branch and paper canary only. Protected paths must not auto-apply to live code. |`zero.evolve.rollback_receipt.v1`, restore original hash, mark proposal quarantined. | Apply receipt, rollback receipt, runtime cycle failure event. |`/runtime-parity`, `/evolve`, CI failure, operator terminal warning. |`engine/tests/test_evolve.py`, `engine/tests/test_runtime.py`; needs property tests for malformed candidate shapes. |
44
+
| FM-AUTO-001 | Agent hallucinates a strategy and burns through paper budget. | Strategy registry rejects unknown runners; paper budget breaker sees order count, notional, or drawdown drift. | Paper budget for the local session. Live capital should be zero because paper-first is enforced. | Pause evolve, disable the strategy, revert the candidate config, reset paper budget after review. |`zero.evolve.run.v1`, `zero.immune.v1`, rejected `zero.paper.decision.v1`, rollback receipt. | CLI/TUI safety banner, `/immune`, metrics counter, optional operator notification. |`engine/tests/test_evolve.py`, `engine/tests/test_safety.py`, `engine/tests/test_property_safety.py`. |
45
+
| FM-AUTO-002 |`evolve` produces a config that passes tests but fails at runtime. | Runtime health marks candidate as failed; production-parity OODA emits live-shadow mismatch or runtime exception. | Candidate branch and paper canary only. Protected paths must not auto-apply to live code. |`zero.evolve.rollback_receipt.v1`, restore original hash, mark proposal quarantined. | Apply receipt, rollback receipt, runtime cycle failure event. |`/runtime-parity`, `/evolve`, CI failure, operator terminal warning. |`engine/tests/test_evolve.py`, `engine/tests/test_runtime.py`, `engine/tests/test_property_safety.py`. |
46
46
| FM-AUTO-003 | Agent and human edit the journal concurrently. | Journal append detects non-monotonic offset, checksum break, lock failure, or replay mismatch. | Local audit trail for the affected runtime; execution must pause if journal integrity is unknown. | Stop writers, preserve both copies, replay from last good head, restore durable volume snapshot if needed. |`zero.journal.integrity_failure.v1` or incident audit export. | P1 journal anomaly alert, CLI refusal on live preflight, runbook escalation. |`engine/tests/test_bus.py`, `docs/runtime-bus.md`; decision-journal hash chain is the next implementation cycle. |
47
-
| FM-AUTO-004 | Hyperliquid returns malformed response and the agent retries N times. | Adapter schema validation fails; retry budget reaches zero; rate-limit breaker opens. | Read-only market/account freshness or one blocked live submission. Order submissions must not retry blindly. | Mark venue degraded, fail risk-increasing actions, keep reduce-only controls available. |`exchange_error`, reconciliation packet, immune breaker event. |`/hl/reconcile`, `/immune`, `/live-cockpit`, metrics exchange-error counter. |`engine/tests/test_hyperliquid.py`, `engine/tests/test_live.py`, `engine/tests/test_reconciliation.py`; malformed-response fuzzing remains required before 100/100. |
48
-
| FM-AUTO-005 | Stale memory promotes an outdated pattern. | Memory stats report stale source window; genesis confidence drops; proposal age exceeds policy. | Proposal quality and paper canary time, not live execution. | Retire stale memory, regenerate proposal from fresh outcomes, require a new paper canary. |`zero.memory.entry.v1`, `zero.genesis.proposal.v1`, research report. |`/memory`, `/genesis`, docs gap or safety-review issue. |`engine/tests/test_memory.py`, `engine/tests/test_genesis.py`; needs property tests for staleness thresholds. |
47
+
| FM-AUTO-004 | Hyperliquid returns malformed response and the agent retries N times. | Adapter schema validation fails; retry budget reaches zero; rate-limit breaker opens. | Read-only market/account freshness or one blocked live submission. Order submissions must not retry blindly. | Mark venue degraded, fail risk-increasing actions, keep reduce-only controls available. |`exchange_error`, reconciliation packet, immune breaker event. |`/hl/reconcile`, `/immune`, `/live-cockpit`, metrics exchange-error counter. |`engine/tests/test_hyperliquid.py`, `engine/tests/test_live.py`, `engine/tests/test_reconciliation.py`, `engine/tests/test_property_safety.py`. |
48
+
| FM-AUTO-005 | Stale memory promotes an outdated pattern. | Memory stats report stale source window; genesis confidence drops; proposal age exceeds policy. | Proposal quality and paper canary time, not live execution. | Retire stale memory, regenerate proposal from fresh outcomes, require a new paper canary. |`zero.memory.entry.v1`, `zero.genesis.proposal.v1`, research report. |`/memory`, `/genesis`, docs gap or safety-review issue. |`engine/tests/test_memory.py`, `engine/tests/test_genesis.py`, `engine/tests/test_property_safety.py`. |
49
49
| FM-AUTO-006 | Research command ingests prompt-injected or unsupported external claims. | Source classifier marks untrusted or unsupported evidence; research report carries evidence quality flags. | Paper-only research report and proposal queue. | Discard report, quarantine source, regenerate with trusted sources only. |`zero.research.report.v1` with rejected source metadata. |`/research`, safety-review issue when live policy would be affected. |`engine/tests/test_research.py`; adversarial source fixtures should be expanded. |
50
-
| FM-AUTO-007 | Model gateway produces unsafe, expensive, or unavailable output. | Gateway budget, timeout, health, and audit checks fail closed. | Evaluation quality degradation; order path must not depend on unverified model output alone. | Fall back to local/mock provider, lower confidence, or reject decision. | Model gateway audit packet and decision rejection reason. |`/model-gateway/health`, metrics, operator warning. |`engine/tests/test_model_gateway.py`; cost-limit property tests remain required. |
50
+
| FM-AUTO-007 | Model gateway produces unsafe, expensive, or unavailable output. | Gateway budget, timeout, health, and audit checks fail closed. | Evaluation quality degradation; order path must not depend on unverified model output alone. | Fall back to local/mock provider, lower confidence, or reject decision. | Model gateway audit packet and decision rejection reason. |`/model-gateway/health`, metrics, operator warning. |`engine/tests/test_model_gateway.py`, `engine/tests/test_property_safety.py`. |
51
51
| FM-AUTO-008 | Paper/live shadow diverges during production-parity OODA. |`zero.runtime.production_parity.v1` reports mismatch or live-shadow fail-closed evidence. | Live promotion blocked; paper session continues. | Disable promotion, capture audit export, create regression fixture. | Runtime parity report, decision-stack packet, live-shadow refusal. |`/runtime-parity`, CLI red status, safety-review issue. |`engine/tests/test_runtime.py`, `engine/tests/test_live.py`. |
52
52
| FM-AUTO-009 | MCP client asks ZERO to place an order or mutate state. | MCP safety catalog has no risk-increasing tools; unknown methods are rejected. | None if server remains read-only. | Keep server read-only, revoke unsafe registry submission, patch transcript. | MCP transcript refusal and safety catalog resource. | MCP smoke failure, CI failure. |`engine/tests/test_mcp.py`, `scripts/mcp_transcript.py --check`. |
53
53
| FM-AUTO-010 | Public Network or Intelligence packet leaks private identifiers. | Privacy regression fixtures detect wallet-like, raw order ID, trace token, or private journal fields. | Public artifact exposure until publication is stopped. | Stop publishing, rotate unsafe packet, patch serializer, mark proof stale. | Public packet hash, privacy regression incident export. | P1 privacy regression alert, CI failure. |`engine/tests/test_proof_privacy.py`, `scripts/proof_privacy_regression.py`. |
0 commit comments