Skip to content
Nicolas Cravino edited this page Apr 22, 2026 · 8 revisions

Change Log — sw30labs Wiki

All significant additions and updates to the knowledge base.

2026-04-15 (afternoon)

Taxonomy migration — SCHEMA.md yaml block + stub classification

Completed the structural migration flagged in the morning run. SCHEMA.md now carries a fenced ```yaml taxonomy block (5 categories, 10 stacks) parseable by scripts/taxonomy.py, and every stub declares its `category:` and `stacks:` in frontmatter.

SCHEMA.md:

  • Added canonical yaml block with categories: (ai-security, oscal-compliance, agentic-frameworks, local-inference-mlx, developer-tools) and stacks: (mlx, oscal, agentic, pentest, compliance, rag, mcp, langgraph, converter, cli).
  • Kept tag whitelist, voice, freshness threshold (90 days), and directory-structure prose below the yaml fence.
  • Added "Rules" and "How scripts use this file" sections per template.

Stub frontmatter (55 stubs):

  • All 33 repo stubs got category: and stacks: assigned.
  • All 22 article summary stubs got category: (single, by primary topic). Article stubs generally get an empty stacks: [] — 14 of them carry no stack since the article is opinion/analysis rather than tied to a specific tech stack. The validator treats missing stacks as informational, not an error.
  • Injection respects remote_url, repo_path, and all other prior fields.

Stack adoption after migration: agentic 21, langgraph 17, converter 8, compliance 7, mlx 7, oscal 6, pentest 6, cli 3, rag 2, mcp 1. 14 stubs are unclassified (mostly article stubs without tech-stack ties).

Sitemaps regenerated:

  • Sitemap.md — category view, updated in place.
  • Sitemap-Stacks.md — new file, stacks view. Shows adoption table
    • mermaid graph grouped by stack with dashed edges for secondary memberships.

Post-flight detect_changes.py reports taxonomy.ok: True, zero bad/missing categories, zero bad stacks.


2026-04-15

Incremental update — stub hygiene pass

Change detection found no new repos on GitHub, no new articles in articles-published, and no stale wiki pages. Cleanup focused on stub frontmatter drift accumulated from a previous session's paths and a batch of upstream repo renames.

Stub path normalization (23 stubs): All stubs pointing at /sessions/epic-magical-hawking/Documents/sw30labs/... migrated to the canonical ~/Documents/sw30labs/repos/<dir> form required by the skill's path policy. Session paths are ephemeral; the ~/Documents form survives across runs.

Stub remote_url fixes (8 stubs): These stubs pointed at GitHub URLs for repos that have since been renamed. Updated to the current canonical names so remote_url-keyed matching in detect_changes.py stops flagging them as "new":

  • mlx-youtubescribe: MLX-YouTubeScribe.gityoutube-scribe-mlx.git
  • screenlens: screenlens.gitscreen-lens.git
  • strixresearch: strixresearch.gitstrix-research.git
  • n8n2langraph: N8n2langraph.gitn8n-to-langraph.git
  • projectpulse: projectpulse.gitproject-pulse-sap.git
  • local-mlx-responsesAPI-server: → mlx-responses-api-server.git
  • qwen3-vl-python-ocr-script-mlx: → qwen-ocr-mlx.git
  • tslit: TSLIT.gittslit.git (case change)

last_commit_date refreshed (2 stubs):

  • mlx-youtubescribe: 2026-04-11 → 2026-04-12
  • qwen3-vl-python-ocr-script-mlx: 2026-04-11 → 2026-04-12

Articles: All 22 articles remain in 1:1 sync with articles-published (latest commit 2026-04-11). No timeline churn.

Open item (not applied): SCHEMA.md does not yet contain the fenced ```yaml taxonomy block that the newer detect_changes.py / `generate_sitemap.py` scripts expect (for `categories:` / `stacks:` validation). The current schema still documents the taxonomy in prose only. Post-flight detect_changes reports `category_count: 0` and `stack_count: 0` for that reason. Left untouched in this pass — migrating to the YAML schema is a structural change that should be done deliberately, probably alongside adding `category:` / `stacks:` to every stub's frontmatter.


2026-04-13

Incremental update — 9 new repos ingested

New stubs created (9):

  • autogen-doc-generator → Developer Tools (AutoGen-based code documentation generator)
  • autogen-mermaid-generator → Developer Tools (AutoGen GroupChat Mermaid diagram generator)
  • autogen-requirements-agent → Agentic Frameworks (experimental strict JSON inter-agent messaging)
  • nvd-extractor → AI Security (NVD API vulnerability extraction with network attack vector filtering)
  • ollama-pdf2markdown → Developer Tools (PDF→Markdown via Ollama multimodal, Mistral Small 3.1)
  • repobundle → Developer Tools (Git repo export/import as single text files, stdlib only)
  • word-to-markdown-converter → Developer Tools (.docx→Markdown converter)
  • animated-gif-creator → Developer Tools (image folder / MOV → animated GIF)
  • deepseekvl2-pdf-ocr → Local Inference & MLX (local PDF OCR via DeepSeek-VL2 on NVIDIA CUDA)

Path fixes:

  • Updated 7 stubs with stale session paths (n8n2langraph, local-mlx-responsesAPI-server, projectpulse, screenlens, strixresearch, mlx-youtubescribe, qwen3-vl-python-ocr-script-mlx) to point to current ~/Documents/sw30labs/repos/ locations.

Category pages updated:

  • Developer Tools: expanded from 1 repo to 7, rewrote summary
  • AI Security: added NVD-Extractor claim
  • Agentic Frameworks: added AutoRequirementsAgent claim, updated summary (5→6 repos)
  • Local Inference & MLX: added DeepSeek-VL2 OCR claim, updated summary (8→9 repos)

Articles: All 22 articles already synced — no new articles since last run.

Home.md: Updated repo count (23→32), added all new repo rows to category tables.


2026-04-11

Bootstrap: Created wiki schema and directory structure for sw30labs organization. Initialized with taxonomy covering AI security, OSCAL compliance, agentic frameworks, local inference (MLX), and developer tools. Established tag whitelist, freshness thresholds (90 days), and voice guidelines (concise, technical, present tense, practitioner-oriented).

  • Created: SCHEMA.md, Index.md, Log.md
  • Created: raw/, wiki/ai-security/, wiki/oscal-compliance/, wiki/agentic-frameworks/, wiki/local-inference-mlx/, wiki/developer-tools/

Bulk Ingest (23 repositories): Cloned and ingested all 23 public repos from github.com/sw30labs. Created 23 raw stubs and 5 category wiki pages. Updated Index.md and Home.md.

  • Stubs created (23): tslit, pentest-regulatory-intel, strixresearch, agentic-ai-pentesting, oscal-agent-guardrails, oscal-digital-twin-playground, oscal-zero-trust-lens, oscal-agent-lab, oscal-cac-playgd, agent-stack, deepagent-azure-cli, n8n2langraph, sst-autoresearch, projectpulse, tars-ai, screenlens, qwenbench-mlx, mlx-distillation-explained, local-mlx-responsesAPI-server, audiobook-generator, qwen3-vl-python-ocr-script-mlx, mlx-youtubescribe, gitnexus-fleet
  • Wiki pages created (5): ai-security-pentesting, oscal-compliance, agentic-frameworks, local-inference-mlx, developer-tools
  • Contradictions surfaced: 0
  • Index.md updated: yes
  • Home.md updated: yes

2026-04-11 — Incremental Update (wiki-curator first run)

Result: No changes required. All 23 repos pulled — all already up to date. Change detection confirms:

  • 23/23 GitHub repos cloned and stubbed
  • 5/5 category wiki pages present and indexed
  • 0 new repos, 0 updated repos, 0 stale pages, 0 orphan stubs, 0 unindexed pages

Observations for next run:

  • 2 repos inactive >90 days: TSLIT (last commit 2025-12-05), N8n2langraph (2025-11-29) — will trigger stale-page warnings after next freshness check
  • Most active repos: projectpulse (Apr 10), sst-autoresearch (Apr 9), screenlens (Apr 8)
  • Tag qwen appears in 4+ stubs but is not in SCHEMA whitelist — consider adding
  • MLX-YouTubeScribe uses both chromadb and langgraph but isn't cross-referenced with pentest-regulatory-intel (also uses both)

2026-04-11 — Articles Timeline Integration

Articles Ingest (22 published articles): Cloned github.com/sw30labs/articles-published and created article stubs + timeline page. Articles span 2023-04-23 to 2026-03-08 across all 5 wiki categories.

  • Stubs created (22): revolutionizing-cyber-risk-advisory-ai, safeguarding-datasets-llm-era, transforming-crypto-mining-hardware, ai-generated-code-zero-trust, digital-twins-incident-response, asics-dethrone-gpus-llm, osi-model-natural-language-layer, ai-supply-chain-governance, asics-vs-gpus-grid, custom-silicon-gpt-story, rise-agentic-security, agentic-pentesting-layer7, 72-hour-civilization-agents, oscal-phantom-compliance, agentic-pipeline-code-optional, automated-ai-news-workflow, intelligence-event-horizon, understanding-agent-security-analogies, ai-agent-security-governance, one-person-conglomerate-reality, best-ai-unrestricted-access, zero-trust-ai-coding-boom
  • Wiki pages created (1): articles-timeline (wiki/articles/timeline.md)
  • Category pages updated (5): ai-security-pentesting (+11 articles), oscal-compliance (+2), agentic-frameworks (+4), local-inference-mlx (+4), developer-tools (+1)
  • Index.md updated: yes — added Published Articles section
  • Contradictions surfaced: 0

2026-04-11 — Wiki Curator Update (merged skill)

Operation: Full update using merged wiki-curator skill (articles integration built-in).

Repo sync: All 24 repos pulled (including articles-published). 0 new repos, 0 updated repos.

Articles sync: 22/22 articles already ingested. No new articles found.

New content:

  • Created raw/articles-published.stub.md (repo-level stub for the articles collection — was previously missing)
  • Updated Index.md to include the articles-published stub

Issues: None. Wiki fully current.

Observations:

  • 2 repos remain inactive >90 days: TSLIT (last commit 2025-12-05), N8n2langraph (2025-11-29)
  • Tag qwen still not in SCHEMA whitelist despite appearing in 4+ stubs
  • All article stubs present but no individual article detail pages exist yet (only timeline + stubs)

2026-04-11 — Article Sync + Sitemap Generation

Operator: wiki-curator (automated)

Changes

  • Articles synced: Created 22 article detail pages in wiki/articles/ from articles-published repo
    • AI Security: 16 articles
    • Local Inference & MLX: 4 articles
    • OSCAL & Compliance: 1 article
    • Agentic Frameworks: 1 article
  • Timeline regenerated: wiki/articles/timeline.md now lists all 22 articles chronologically
  • Index updated: Published Articles section now includes all article wiki pages (not just raw stubs)
  • Sitemap generated: Sitemap.md created with Mermaid flowchart showing full wiki structure

Repo status

  • 24 repos on GitHub, 24 cloned, all up to date (no new commits)
  • No new repos to ingest
  • No updated repos to refresh
  • No stale pages detected

Notes

  • articles-published stub flagged as orphan (expected — it's a meta-stub not cited by wiki pages)
  • Many slug-based article filenames lack embedded dates; used git commit dates as fallback

2026-04-22

Update run — two new repos ingested

Picked up two new public repos in the sw30labs org since the last run: wiki-vs-rag (agentic-frameworks) and nemotron-parse-spark (developer-tools). All 35 GitHub repos cloned; no updated_repos, no stale_pages, no orphan_stubs. Articles repo unchanged (still 22 articles, all already mapped to wiki pages).

New stubs:

  • raw/wiki-vs-rag.stub.md — Reproducible benchmark of single-shot RAG vs agentic-RAG vs wiki-navigation vs QMD over the sw30labs wiki corpus pinned at c130a3b. Headline result: agentic-RAG is the Pareto winner (4.77 quality vs 4.69 for wiki-nav at 45% of the cost; 11–2 vs 17 ties against single-shot RAG). QMD is cheapest ($0.005/query, no embedding API) but collapses on branch / cross-branch synthesis (2.83 / 3.37 vs 4.5+ for the others). Categorized agentic-frameworks, stacks: [agentic, rag].
  • raw/nemotron-parse-spark.stub.md — Test harness for NVIDIA Nemotron Parse v1.2 (<1B-param ViT-H + mBART document-understanding model) on DGX Spark (Grace Blackwell GB10, 128 GB unified memory). CLI + REPL + batch processor + benchmark; transformers and vLLM backends share a single postprocessing path. Notes the load-bearing 4-token v1.2 prompt and the 1024×1280 / 1664×2048 image-sizing contract. Categorized developer-tools, stacks: [converter, cli].

Pages updated:

  • wiki/agentic-frameworks/agentic-frameworks.md — bumped summary from 6 to 7 repos; added 3 key claims for wiki-vs-rag (the four-arm setup, the agentic-RAG Pareto result, the QMD branch/cross-branch collapse); added 2 open questions; tags now include rag, benchmark; updated to 2026-04-22.
  • wiki/developer-tools/developer-tools.md — bumped summary from 7 to 8 repos; added 3 key claims for nemotron-parse-spark (NVIDIA Nemotron Parse on DGX Spark, the 4-token v1.2 prompt contract, the image-sizing envelope); added 1 open question (Nemotron vs DeepSeek-VL2 vs Qwen3-VL); tags now include benchmark; updated to 2026-04-22.
  • Home.md — repo count 32 → 34, last-updated 2026-04-15 → 2026-04-22; added rows for wiki-vs-rag (Agentic Frameworks table) and nemotron-parse-spark (Developer Tools table).
  • Index.md — added wiki-vs-rag and nemotron-parse-spark stub rows to the relevant category tables.
  • Sitemap.md and Sitemap-Stacks.md — regenerated to pick up the new stubs (2 nodes added to agentic-frameworks and developer-tools diagrams; new entries in the agentic / rag / converter / cli stacks).

Repo status

  • 35 repos on GitHub, 35 cloned, all up to date.
  • 2 new repos ingested (wiki-vs-rag, nemotron-parse-spark).
  • 0 updated repos.
  • 0 stale pages.
  • 0 orphan stubs.
  • Taxonomy validation: all 57 stubs have valid category: and known stacks: entries. 14 article stubs still report stubs_missing_stacks — informational, expected for opinion/analysis pieces.

Notes

  • Articles repo unchanged since the previous run; no new article pages needed.
  • The wiki-vs-rag repo benchmarks this very wiki as its corpus (pinned at commit c130a3b) — the reflexive loop is a nice symptom that the wiki is now substrate for downstream agent work, not just documentation.
  • The nemotron-parse-spark stub deliberately calls out the prompt / image-size landmines from the model card so future readers don't silently degrade output by tweaking either.

Clone this wiki locally