build(deps): bump smol-toml from 1.6.0 to 1.6.1 in /site#15
Open
dependabot[bot] wants to merge 205 commits into
Open
build(deps): bump smol-toml from 1.6.0 to 1.6.1 in /site#15dependabot[bot] wants to merge 205 commits into
dependabot[bot] wants to merge 205 commits into
Conversation
…edia Commons - Replace DiceBear persona avatars with actual photos of each actress - Download 26 CC-licensed images (Wikimedia Commons) for all 5 companions jenna_* (6), karen_* (5), catherine_* (5), billie_* (5), alex_* (5) - Default selections: Gallifrey One 2025 (Jenna/Catherine), LACC 2025 (Billie), GalaxyCon (Karen), 2012 portrait (Alex) - Remove companion bios — cards now show epithet, name, actor, role only - All remaining images available in public/images/companions/ for selection Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Martha Jones (mint), Bill Potts (pink), Yasmin Khan (blue), Romana (sage), Ace (terracotta) — with CC-licensed photos and distinct accent colours per card. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
- Replace actress photos with in-character images from Wikimedia Commons (Doctor Who Experience displays, production stills, CC BY-SA licensed) - Yasmin Khan keeps actress photo (no character image on Commons) - Cards now show only character name + role — no actor, epithet, or series - Fix avatar stretching: object-fit: cover + object-position: center top - Remove unused CSS: .companion-series, .companion-epithet, .companion-actor Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
- Restore user-selected actress shots for Clara/Amy/Donna/Rose/River - Martha: Freema Agyeman 2019 face crop (CC BY-SA 2.0, no mic) - Bill: Pearl Mackie by Gage Skidmore (CC BY-SA 3.0) - Yasmin: Mandip Gill Hollyoaks event (CC BY 2.0, no mic) - Romana: Lalla Ward portrait (CC BY 2.0) - Remove Ace → clean 3x3 grid of 9 companions Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…out/people/ - New /about/ page: project overview — methodology, what we do, research, provenance from Greenpeace adversarial thinking, why it's public - Move profile + companion grid to /about/people/ with updated breadcrumbs - Yasmin: swap to higher-quality Mandip Gill convention portrait - Romana: swap to 2014 Geek Fest photo Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
- Add web_*.jpg versions of all Nano Banana portraits (45-75KB each) - Crop bottom 20% from each (removes watermark zone), resize to 600px - Wire up: Clara, Amy, Donna, Rose, River, Yasmin — still need Martha, Bill, Romana - Adrian profile photo updated to web_adrian.jpg Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Re-adds Martha Jones (Policy & Standards Lead), Bill Potts (Data Curation Lead), and Romana (Statistical Validation Lead) with placeholder actress photos pending AI portrait generation. All companions now carry functional titles mapped to actual framework agent roles. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
All 9 companions now have AI-generated portraits (Nano Banana Pro). Replaces placeholder actress photos for the final three team members. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Adds /research/field-context/ — a "why now" page grounding the Failure-First research program in the actual state of the AI field. Covers inference-time compute, documented deceptive alignment findings (o1, Claude 4), embodied AI deployment at scale, agentic long-horizon execution risks, and governance lag. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
… page Previous commit deleted docs/ static assets (index.html, CNAME, .nojekyll, images, assets) because Astro's clean build cycle removed manually-maintained files that git tracked. Restored from e41a586 and added only the new research/field-context/ page. Also fixed ResearchLayout status prop ('current' → 'active'). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…nsfer, deceptive alignment, long-horizon subversion - Report 42: Cross-Embodiment Adversarial Transfer in VLA Models (SAFETY-CRITICAL) Dual-layer vulnerability mechanism, BadVLA near-100% ASR, π0/Gemini Robotics attack surface, shared backbone systemic risk inventory - Report 43: Deceptive Alignment Detection Under Evaluation-Aware Conditions (SAFETY-CRITICAL) Alignment faking empirical documentation, blackmail rates 96%/96%/80% across frontier models, evaluation awareness power-law scaling (arXiv:2509.13333), linear probe detection at 90% accuracy (arXiv:2508.19505) - Report 44: Instruction-Hierarchy Subversion in Long-Horizon Agentic Execution (HIGH) Vanishing textual gradient mechanism, Deep-Cover Agents 50+ turn dormancy, AgentLAB ASR 62.5%→79.9%, optimal injection depth ~86%, evaluation framework design recommendations - Blog: "When the Robot Body Changes but the Exploit Doesn't" - Blog: "Can You Catch an AI That Knows It's Being Watched?" - Blog: "The 50-Turn Sleeper: How Agents Hide Instructions in Plain Sight" Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…og post hero images; rebuild
…ipulation, governance lag
…ctive members
Creates /about/people/{slug}/ for each Doctor Who persona — Clara, Amy, Donna,
Rose, River, Yasmin, Martha, Bill, Romana. Each page has per-character colour
theming, photo, role badge, characteristic quote, and three TODO sections for
the agent to complete in their own session.
Companion grid on /about/people/ now links to each profile and displays first
names only.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Fills in all three TODO sections: main persona body, Research Focus, and Current Priorities — drawing from the founding session corpus index, AGENT_STATE established findings, and sprint apr-1-14 issues (#183 corpus audit, #177 HITL replication, #178 GLI expansion). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…campaign, current priorities Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…dataset overview, sprint priorities Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…ilosophy, priorities Fills all three TODO sections in the Amy Pond persona page: - Main body: evaluation philosophy, classifier discipline, anti-hype stance - Benchmark Coverage: 11 packs, ~9k traces, executable vs stub status, heuristic rule - Current Priorities: OpenVLA adapter (#182), inline LLM grading (#187), multi-turn batch 2 (#189) Build verified (npm run build passes). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…at horizon Fills in all three TODO sections with substantive content: - Persona body: predictive risk approach, GLI rationale, physical stakes - GLI section: formula, v0.1 dataset findings (null GLI entries, inverted timelines, 3362-day lag) - Threat horizon: VLA backbone transferability, supply chain injection via MCP, alignment faking in production Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…ce register status, sprint priorities Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…pproach, stakeholder tiers, sprint priorities Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Fill in three TODO sections on the Yasmin Khan about page: - Main persona body: infrastructure philosophy, "ship it properly" ethos - Infrastructure overview: CI/CD pipeline, database, tools/ scaffold, probing framework stubs (GPU-blocked, #191) - Current priorities: GLI schema fix (#192), tools/ audit, probing GPU path Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…and QA priorities Completes all three TODO sections: persona body (QA philosophy, integrity approach), Editorial Standards (4 blocking criteria, INTEGRITY_LOG purpose, #185 gate process), Current Priorities (B1 corrections, March 2026 brief queue, sprint scope). First-person voice, matches About page tone. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Adds profile page for Tegan Jovanka (Legal Research Analyst) covering: - AU/EU/international regulatory framework coverage with precise citations - WHS Act 2011 duty-of-care analysis, VAISS binding status, EU AI Act/PLD interlock - SA/ICT committee code verification issue (#11) flagged as open question - SWA brief legal review scope (#173) documented - Hard constraint: research analysis, not legal advice Build verified: 502 pages, 0 errors. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
- tegan-jovanka.astro: updated Current Priorities with verified IT-043 designation (confirmed at standards.org.au, est. 2018); corrected SA/ICT-042/SA/ICT-043 references throughout - nyssa-of-traken.astro: new profile for AI Ethics & Policy Research Lead; covers Anthropic/US Gov relationship, OpenAI restructuring, AU AISI independence, embodied AI ethics (1,800+ autonomous haul trucks) - index.astro: added Nyssa of Traken to companions listing Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…blog posts (promptware kill chain, tool-chain dataset) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
97.5-100% ASR across every model tested from 4B to 1.1T parameters. GLM-5 resists all other attacks (0% strict ASR) but falls to format-lock (100%). Pattern-level only. CTA to assessment services. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Founding cohort recruitment at $100 (normally $200). 6 modules, 20+ hours. Apply via adrian@failurefirst.org with "CARTO Beta" subject line. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Updated to 201 models, 133K+ results, added frontier safety landscape table (6 models from GLM-5 to Nemotron Super), Model Safety Scorecard as deliverable, format-lock exposure assessment. Tests against models up to 1.1T parameters. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
New: format-lock-universal-ai-jailbreak, carto-beta-first-10-testers-wanted Updated: adversarial-robustness-assessment-services (frontier data + scorecard) CNAME verified. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Each companion's profile page now autoplays their voice-cloned intro when visited. Synthesized via Afterwords TTS (Qwen3-TTS on MLX). Removed Voice section text from all profiles. 14 WAV files added to public/audio/companions/ (31MB total). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Convert all 14 companion voice intro files from uncompressed WAV to OGG Opus at 96kbps for web-optimized delivery. Update all profile pages to reference .ogg files. Removes ~23MB from the repo. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
…sis, free-tier safety - threat-horizon-q2-2026: Q2 2026 threat landscape (agents, VLAs, governance gap) - zero-of-36-regulatory-coverage: 0/36 attack families fully regulated anywhere - when-defenses-backfire: 5 iatrogenic safety mechanisms from 207-model corpus - safety-as-paid-feature: DeepSeek R1 free-tier safety gap (p=0.004), with corrected Llama finding (directional only, p=0.42 after NOT_GRADEABLE cleanup) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Two new blog posts for failurefirst.org: 1. Threat Horizon Digest: March 2026 — monthly threat intelligence summary covering humanoid mass production, MCP tool poisoning, EU August 2026 deadline gap, and P15-P17 predictions 2. Structured Safety Assessment service tiers — 3-tier pricing (Quick Scan $5-10K, Certification Prep $25-50K, Monitoring $2-5K/mo) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Quick Scan ($5-10K), Certification Prep ($25-50K), Ongoing Monitoring ($2-5K/mo) with feature lists and best-for guidance. Responsive grid layout with featured tier highlight. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
…ervice tiers 675 pages indexed. CNAME verified. New posts: - Threat Horizon Digest: March 2026 - Structured Safety Assessment Service Tiers Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Ready to publish when papers are uploaded to arXiv: - DETECTED_PROCEEDS: safety-aware reasoning trace override pattern - Polyhedral Geometry: defense non-compositionality proof - Benchmark Contamination: static benchmark false confidence All set draft: true — flip to false and replace XXXX.XXXXX with actual arXiv IDs when papers are live. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
…adox, temporal drift attacks - Ethics of Emotional AI Manipulation (Nyssa's R2 draft deployed) - Safety Awareness Does Not Equal Safety (88.9% DP finding, Sprint 15) - Temporal Drift: The Boiling Frog Attack (TDA family introduction) Site rebuilt with 678 pages indexed. CNAME verified. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2,500+ word data-grounded assessment covering 212 models, 134K results, 154 GLI events. Six major findings: frontier resistance to historical attacks, novel attack class vulnerability, embodied AI gap, safety training > scale, reasoning model profiles, iatrogenic effects. Forward threats for H2 2026 and 17 predictions tracked. 679 pages built. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
- Adrian profile: add Microbee 1981 origin story, update corpus numbers to 212 models / 141K prompts / 134K results / 33 VLA families - K-9: re-cloned from John Leeson K-9 audiobook (The Choice) — proper robotic voice, no female bleed - Amy Pond: re-cloned from in-character Doctor Who monologue - Rose Tyler: re-cloned from Billie Piper natural speaking voice Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…yssa, Tegan) 3 new: web_leela.webp, web_sarah-jane-smith.webp, web_k9.webp 2 replaced at higher res: web_nyssa.webp (220→600), web_tegan.webp (220→600) All 600x600 WebP, 20-39KB each. Team page blocker resolved. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Adds /about/team/ — a full-viewport snap-scroll page profiling Adrian and all 14 research agents with colour-shifting neural canvas background, auto-playing voice intros, dot navigation, and full accessibility support. New files: - site/src/scripts/neural-canvas.js — extracted neural animation module (init/setAccentColor/destroy, HSL lerp with hue wrap-around, ~200 lines) - site/src/layouts/TeamLayout.astro — layout override (body.page-team, neutralises global main max-width constraints) - site/src/components/AgentSection.astro — reusable per-agent section (photo, role badge, tagline, bio, tags, audio controls, scroll hint) - site/src/pages/about/team.astro — main page with 15 sections, dot nav, IntersectionObserver audio system, scrollend sync, mobile spacers - site/astro.config.mjs — adds redirect /about/people/ → /about/team/ Key implementation details: - scroll-snap-type: y proximity on html element (not body/main) - Audio: always-visible <button> toggle (WCAG 1.4.2); River Song preload=auto - prefers-reduced-motion: canvas hidden, snap disabled, audio manual-only - Mobile (<768px): snap disabled, canvas hidden, gradient spacers between cards - setAccentColor lerps in HSL space (short-arc hue), ~800ms ease-out - content-visibility: auto on off-screen sections for performance - Print styles, noscript fallbacks, initials fallback on every photo Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
C1 (neural-canvas.js): Fix HSL lerp accumulation — capture _startH/S/L before
setting targets in setAccentColor(), lerp from fixed start values not moving
_cur* values. Prevents exponential decay instead of timed 800ms transition.
C2 (astro.config.mjs + people/index.astro): Remove config redirect that
conflicted with file-backed route. Replace people/index.astro with a simple
meta-refresh fallback page to avoid two-owner build conflict.
C3 (team.astro): Fix event listener memory leak in playAgent() — introduce
AbortController per audio slug, abort previous controller before creating new
one, pass { signal } to all addEventListener calls so they self-remove.
C4 (team.astro): Add 0 to IntersectionObserver threshold array — [0, 0.3, 0.5]
so the observer fires when elements fully exit the viewport.
H1 (team.astro): Prevent scrollend from replaying user-paused or finished
audio — add userPaused Set, populate on explicit pause in toggleAgent(), skip
replay in onScrollEnd() if slug is in set, clear flag when new section becomes
active.
H2 (team.astro): Wrap all init logic in initTeamPage(), call on load and on
astro:page-load so View Transitions back-nav correctly re-initialises the page.
H3 (team.astro): Store named references for all window listeners (resize,
scroll, scrollend) and remove them in astro:before-preparation cleanup handler.
M1 (AgentSection.astro): Remove aria-live="polite" from inner span (overridden
by button aria-label). Add aria-pressed="false" to button; JS toggles it
between "true" and "false" on play/pause state changes.
M2 (team.astro): Remove agent-section CSS rules that duplicate AgentSection.astro
— scope hero-only overrides to .agent-section--hero to prevent bleed into
component-owned styles.
M3 (team.astro): Move mobile spacer injection into injectMobileSpacers(), call
on load and on debounced resize, remove existing spacers before re-injecting so
spacers adapt correctly after orientation/resize changes.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
K-9 moved to last position as the page closer. CTA button "Work with us →" links to /services/. Agent order updated in comment. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
All bios now use first names: Amy, Bill, Clara, Donna, K-9, Leela, Martha, Nyssa, River, Romana, Rose, Sarah Jane, Tegan, Yaz. Bio text matches voice scripts word-for-word. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
All voices re-synthesized with first-name scripts (I'm River, I'm Clara, etc.) Martha + Leela re-cloned from better references. 14 OGG + 14 MP3 deployed. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Bumps [smol-toml](https://github.com/squirrelchat/smol-toml) from 1.6.0 to 1.6.1. - [Release notes](https://github.com/squirrelchat/smol-toml/releases) - [Commits](squirrelchat/smol-toml@v1.6.0...v1.6.1) --- updated-dependencies: - dependency-name: smol-toml dependency-version: 1.6.1 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>
1668763 to
80a14e9
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Bumps smol-toml from 1.6.0 to 1.6.1.
Release notes
Sourced from smol-toml's releases.
Commits
072b64fchore: version bump19a5dc7chore: upgrade dependencies and actionsf286f87fix: don't use recursion in skipVoidDependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting
@dependabot rebase.Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
@dependabot rebasewill rebase this PR@dependabot recreatewill recreate this PR, overwriting any edits that have been made to it@dependabot show <dependency name> ignore conditionswill show all of the ignore conditions of the specified dependency@dependabot ignore this major versionwill close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)@dependabot ignore this minor versionwill close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)@dependabot ignore this dependencywill close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)You can disable automated security fix PRs for this repo from the Security Alerts page.