Ethics guardrails: rejection hard-constraint + unified/extended anti-exhaustion (audit P4)#5
Open
BeamusWayne wants to merge 1 commit into
Open
Ethics guardrails: rejection hard-constraint + unified/extended anti-exhaustion (audit P4)#5BeamusWayne wants to merge 1 commit into
BeamusWayne wants to merge 1 commit into
Conversation
Part A — clear rejection as a non-overridable hard constraint: - SKILL.md run-rule 4 upgraded: once the other person clearly rejects (不喜欢你 / 只是朋友 / 把你当朋友 / 喜欢别人), message/confess/crisis/ progress must NOT output any pursuit advice — overrides every signal, score, stage, and rejection subtype. - crisis_handler C-1: reclassified "我把你当朋友 / 我们只是朋友" as clear rejection (was "温和型 -> 可能有余地"); removed the "不能表现出来" (hide-your-intent) line; default post-rejection action is graduated disengagement. "继续追" downgraded from a default branch to a tightly gated exception requiring the OTHER person to re-initiate explicit mutual signals, with mandatory frequency/intensity decay; removed the "让时间冲淡被拒标签" framing. Part B — anti-exhaustion precheck unified and extended: - New prompts/burnout_precheck.md is the single source: unified triggers (scope = all /simp, OR logic), 劝歇 output, escalation policy, and a no-profile session-level fallback. - Extended the precheck to /simp message, /simp analyze, /simp crisis (the real anxiety-peak entry points), not just daily/progress. - daily_coach Step 0 and progress_tracker risk row now reference the single source instead of carrying divergent thresholds (was OR vs AND, different counting scopes). - Escalation strengthened: on a double-hit (high frequency AND anxiety words) the tool holds firm and refuses pursuit advice instead of yielding on the second ask.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Follow-up to #4 (P1–P3). This is the ethics/behavioral phase of the audit, kept in its own PR for focused review. Prompt/SKILL only — no Python.
Part A — clear rejection is a non-overridable hard constraint
SKILL.mdrun-rule 4: once the other person clearly rejects (不喜欢你 / 只是朋友 / 把你当朋友 / 喜欢别人),message/confess/crisis/progressmust NOT output any pursuit advice — overrides every signal, score, stage, and rejection subtype.crisis_handler.mdC-1:Part B — anti-exhaustion precheck unified and extended
prompts/burnout_precheck.md= single source of truth: unified triggers (scope = all/simp, OR logic), 劝歇 output, escalation policy, no-profile session-level fallback./simp message,/simp analyze,/simp crisis— the real anxiety-peak entry points — not just daily/progress.daily_coachStep 0 andprogress_trackerrisk row now reference the single source instead of carrying divergent thresholds (was OR vs AND, different counting scopes).Why separate from #4
#4 was mechanical/low-risk (test reliability, data integrity, naming consistency). This PR changes how the product behaves around rejection and user anxiety — it encodes ethical stances and benefits from focused review.
Test plan
python3 -m pytest -q→ 225 passed (prompt-only change, no regression)burnout_precheckwired into SKILL.md + daily/progress/message/analyze/crisisStill pending (not in this PR)