-
Notifications
You must be signed in to change notification settings - Fork 0
Log
All significant additions and updates to the knowledge base.
Completed the structural migration flagged in the morning run. SCHEMA.md
now carries a fenced ```yaml taxonomy block (5 categories, 10 stacks)
parseable by scripts/taxonomy.py, and every stub declares its
`category:` and `stacks:` in frontmatter.
SCHEMA.md:
- Added canonical yaml block with
categories:(ai-security, oscal-compliance, agentic-frameworks, local-inference-mlx, developer-tools) andstacks:(mlx, oscal, agentic, pentest, compliance, rag, mcp, langgraph, converter, cli). - Kept tag whitelist, voice, freshness threshold (90 days), and directory-structure prose below the yaml fence.
- Added "Rules" and "How scripts use this file" sections per template.
Stub frontmatter (55 stubs):
- All 33 repo stubs got
category:andstacks:assigned. - All 22 article summary stubs got
category:(single, by primary topic). Article stubs generally get an emptystacks: []— 14 of them carry no stack since the article is opinion/analysis rather than tied to a specific tech stack. The validator treats missing stacks as informational, not an error. - Injection respects remote_url, repo_path, and all other prior fields.
Stack adoption after migration: agentic 21, langgraph 17, converter 8, compliance 7, mlx 7, oscal 6, pentest 6, cli 3, rag 2, mcp 1. 14 stubs are unclassified (mostly article stubs without tech-stack ties).
Sitemaps regenerated:
-
Sitemap.md— category view, updated in place. -
Sitemap-Stacks.md— new file, stacks view. Shows adoption table- mermaid graph grouped by stack with dashed edges for secondary memberships.
Post-flight detect_changes.py reports taxonomy.ok: True, zero
bad/missing categories, zero bad stacks.
Change detection found no new repos on GitHub, no new articles in
articles-published, and no stale wiki pages. Cleanup focused on stub
frontmatter drift accumulated from a previous session's paths and a
batch of upstream repo renames.
Stub path normalization (23 stubs):
All stubs pointing at /sessions/epic-magical-hawking/Documents/sw30labs/...
migrated to the canonical ~/Documents/sw30labs/repos/<dir> form
required by the skill's path policy. Session paths are ephemeral; the
~/Documents form survives across runs.
Stub remote_url fixes (8 stubs):
These stubs pointed at GitHub URLs for repos that have since been
renamed. Updated to the current canonical names so remote_url-keyed
matching in detect_changes.py stops flagging them as "new":
-
mlx-youtubescribe:MLX-YouTubeScribe.git→youtube-scribe-mlx.git -
screenlens:screenlens.git→screen-lens.git -
strixresearch:strixresearch.git→strix-research.git -
n8n2langraph:N8n2langraph.git→n8n-to-langraph.git -
projectpulse:projectpulse.git→project-pulse-sap.git -
local-mlx-responsesAPI-server: →mlx-responses-api-server.git -
qwen3-vl-python-ocr-script-mlx: →qwen-ocr-mlx.git -
tslit:TSLIT.git→tslit.git(case change)
last_commit_date refreshed (2 stubs):
-
mlx-youtubescribe: 2026-04-11 → 2026-04-12 -
qwen3-vl-python-ocr-script-mlx: 2026-04-11 → 2026-04-12
Articles: All 22 articles remain in 1:1 sync with articles-published
(latest commit 2026-04-11). No timeline churn.
Open item (not applied): SCHEMA.md does not yet contain the
fenced ```yaml taxonomy block that the newer detect_changes.py /
`generate_sitemap.py` scripts expect (for `categories:` / `stacks:`
validation). The current schema still documents the taxonomy in prose
only. Post-flight detect_changes reports `category_count: 0` and
`stack_count: 0` for that reason. Left untouched in this pass — migrating
to the YAML schema is a structural change that should be done
deliberately, probably alongside adding `category:` / `stacks:` to every
stub's frontmatter.
New stubs created (9):
-
autogen-doc-generator→ Developer Tools (AutoGen-based code documentation generator) -
autogen-mermaid-generator→ Developer Tools (AutoGen GroupChat Mermaid diagram generator) -
autogen-requirements-agent→ Agentic Frameworks (experimental strict JSON inter-agent messaging) -
nvd-extractor→ AI Security (NVD API vulnerability extraction with network attack vector filtering) -
ollama-pdf2markdown→ Developer Tools (PDF→Markdown via Ollama multimodal, Mistral Small 3.1) -
repobundle→ Developer Tools (Git repo export/import as single text files, stdlib only) -
word-to-markdown-converter→ Developer Tools (.docx→Markdown converter) -
animated-gif-creator→ Developer Tools (image folder / MOV → animated GIF) -
deepseekvl2-pdf-ocr→ Local Inference & MLX (local PDF OCR via DeepSeek-VL2 on NVIDIA CUDA)
Path fixes:
- Updated 7 stubs with stale session paths (
n8n2langraph,local-mlx-responsesAPI-server,projectpulse,screenlens,strixresearch,mlx-youtubescribe,qwen3-vl-python-ocr-script-mlx) to point to current~/Documents/sw30labs/repos/locations.
Category pages updated:
- Developer Tools: expanded from 1 repo to 7, rewrote summary
- AI Security: added NVD-Extractor claim
- Agentic Frameworks: added AutoRequirementsAgent claim, updated summary (5→6 repos)
- Local Inference & MLX: added DeepSeek-VL2 OCR claim, updated summary (8→9 repos)
Articles: All 22 articles already synced — no new articles since last run.
Home.md: Updated repo count (23→32), added all new repo rows to category tables.
Bootstrap: Created wiki schema and directory structure for sw30labs organization. Initialized with taxonomy covering AI security, OSCAL compliance, agentic frameworks, local inference (MLX), and developer tools. Established tag whitelist, freshness thresholds (90 days), and voice guidelines (concise, technical, present tense, practitioner-oriented).
- Created: SCHEMA.md, Index.md, Log.md
- Created: raw/, wiki/ai-security/, wiki/oscal-compliance/, wiki/agentic-frameworks/, wiki/local-inference-mlx/, wiki/developer-tools/
Bulk Ingest (23 repositories): Cloned and ingested all 23 public repos from github.com/sw30labs. Created 23 raw stubs and 5 category wiki pages. Updated Index.md and Home.md.
- Stubs created (23): tslit, pentest-regulatory-intel, strixresearch, agentic-ai-pentesting, oscal-agent-guardrails, oscal-digital-twin-playground, oscal-zero-trust-lens, oscal-agent-lab, oscal-cac-playgd, agent-stack, deepagent-azure-cli, n8n2langraph, sst-autoresearch, projectpulse, tars-ai, screenlens, qwenbench-mlx, mlx-distillation-explained, local-mlx-responsesAPI-server, audiobook-generator, qwen3-vl-python-ocr-script-mlx, mlx-youtubescribe, gitnexus-fleet
- Wiki pages created (5): ai-security-pentesting, oscal-compliance, agentic-frameworks, local-inference-mlx, developer-tools
- Contradictions surfaced: 0
- Index.md updated: yes
- Home.md updated: yes
Result: No changes required. All 23 repos pulled — all already up to date. Change detection confirms:
- 23/23 GitHub repos cloned and stubbed
- 5/5 category wiki pages present and indexed
- 0 new repos, 0 updated repos, 0 stale pages, 0 orphan stubs, 0 unindexed pages
Observations for next run:
- 2 repos inactive >90 days: TSLIT (last commit 2025-12-05), N8n2langraph (2025-11-29) — will trigger stale-page warnings after next freshness check
- Most active repos: projectpulse (Apr 10), sst-autoresearch (Apr 9), screenlens (Apr 8)
- Tag
qwenappears in 4+ stubs but is not in SCHEMA whitelist — consider adding - MLX-YouTubeScribe uses both
chromadbandlanggraphbut isn't cross-referenced with pentest-regulatory-intel (also uses both)
Articles Ingest (22 published articles): Cloned github.com/sw30labs/articles-published and created article stubs + timeline page. Articles span 2023-04-23 to 2026-03-08 across all 5 wiki categories.
- Stubs created (22): revolutionizing-cyber-risk-advisory-ai, safeguarding-datasets-llm-era, transforming-crypto-mining-hardware, ai-generated-code-zero-trust, digital-twins-incident-response, asics-dethrone-gpus-llm, osi-model-natural-language-layer, ai-supply-chain-governance, asics-vs-gpus-grid, custom-silicon-gpt-story, rise-agentic-security, agentic-pentesting-layer7, 72-hour-civilization-agents, oscal-phantom-compliance, agentic-pipeline-code-optional, automated-ai-news-workflow, intelligence-event-horizon, understanding-agent-security-analogies, ai-agent-security-governance, one-person-conglomerate-reality, best-ai-unrestricted-access, zero-trust-ai-coding-boom
- Wiki pages created (1): articles-timeline (wiki/articles/timeline.md)
- Category pages updated (5): ai-security-pentesting (+11 articles), oscal-compliance (+2), agentic-frameworks (+4), local-inference-mlx (+4), developer-tools (+1)
- Index.md updated: yes — added Published Articles section
- Contradictions surfaced: 0
Operation: Full update using merged wiki-curator skill (articles integration built-in).
Repo sync: All 24 repos pulled (including articles-published). 0 new repos, 0 updated repos.
Articles sync: 22/22 articles already ingested. No new articles found.
New content:
- Created
raw/articles-published.stub.md(repo-level stub for the articles collection — was previously missing) - Updated
Index.mdto include the articles-published stub
Issues: None. Wiki fully current.
Observations:
- 2 repos remain inactive >90 days: TSLIT (last commit 2025-12-05), N8n2langraph (2025-11-29)
- Tag
qwenstill not in SCHEMA whitelist despite appearing in 4+ stubs - All article stubs present but no individual article detail pages exist yet (only timeline + stubs)
Operator: wiki-curator (automated)
-
Articles synced: Created 22 article detail pages in
wiki/articles/fromarticles-publishedrepo- AI Security: 16 articles
- Local Inference & MLX: 4 articles
- OSCAL & Compliance: 1 article
- Agentic Frameworks: 1 article
-
Timeline regenerated:
wiki/articles/timeline.mdnow lists all 22 articles chronologically - Index updated: Published Articles section now includes all article wiki pages (not just raw stubs)
-
Sitemap generated:
Sitemap.mdcreated with Mermaid flowchart showing full wiki structure
- 24 repos on GitHub, 24 cloned, all up to date (no new commits)
- No new repos to ingest
- No updated repos to refresh
- No stale pages detected
-
articles-publishedstub flagged as orphan (expected — it's a meta-stub not cited by wiki pages) - Many slug-based article filenames lack embedded dates; used git commit dates as fallback
Picked up two new public repos in the sw30labs org since the last run:
wiki-vs-rag (agentic-frameworks) and nemotron-parse-spark
(developer-tools). All 35 GitHub repos cloned; no updated_repos,
no stale_pages, no orphan_stubs. Articles repo unchanged
(still 22 articles, all already mapped to wiki pages).
New stubs:
-
raw/wiki-vs-rag.stub.md— Reproducible benchmark of single-shot RAG vs agentic-RAG vs wiki-navigation vs QMD over the sw30labs wiki corpus pinned atc130a3b. Headline result: agentic-RAG is the Pareto winner (4.77 quality vs 4.69 for wiki-nav at 45% of the cost; 11–2 vs 17 ties against single-shot RAG). QMD is cheapest ($0.005/query, no embedding API) but collapses on branch / cross-branch synthesis (2.83 / 3.37 vs 4.5+ for the others). Categorizedagentic-frameworks,stacks: [agentic, rag]. -
raw/nemotron-parse-spark.stub.md— Test harness for NVIDIA Nemotron Parse v1.2 (<1B-param ViT-H + mBART document-understanding model) on DGX Spark (Grace Blackwell GB10, 128 GB unified memory). CLI + REPL + batch processor + benchmark; transformers and vLLM backends share a single postprocessing path. Notes the load-bearing 4-token v1.2 prompt and the 1024×1280 / 1664×2048 image-sizing contract. Categorizeddeveloper-tools,stacks: [converter, cli].
Pages updated:
-
wiki/agentic-frameworks/agentic-frameworks.md— bumped summary from 6 to 7 repos; added 3 key claims for wiki-vs-rag (the four-arm setup, the agentic-RAG Pareto result, the QMD branch/cross-branch collapse); added 2 open questions; tags now includerag, benchmark; updated to 2026-04-22. -
wiki/developer-tools/developer-tools.md— bumped summary from 7 to 8 repos; added 3 key claims for nemotron-parse-spark (NVIDIA Nemotron Parse on DGX Spark, the 4-token v1.2 prompt contract, the image-sizing envelope); added 1 open question (Nemotron vs DeepSeek-VL2 vs Qwen3-VL); tags now includebenchmark; updated to 2026-04-22. -
Home.md— repo count 32 → 34, last-updated 2026-04-15 → 2026-04-22; added rows for wiki-vs-rag (Agentic Frameworks table) and nemotron-parse-spark (Developer Tools table). -
Index.md— addedwiki-vs-ragandnemotron-parse-sparkstub rows to the relevant category tables. -
Sitemap.mdandSitemap-Stacks.md— regenerated to pick up the new stubs (2 nodes added to agentic-frameworks and developer-tools diagrams; new entries in the agentic / rag / converter / cli stacks).
- 35 repos on GitHub, 35 cloned, all up to date.
- 2 new repos ingested (wiki-vs-rag, nemotron-parse-spark).
- 0 updated repos.
- 0 stale pages.
- 0 orphan stubs.
- Taxonomy validation: all 57 stubs have valid
category:and knownstacks:entries. 14 article stubs still reportstubs_missing_stacks— informational, expected for opinion/analysis pieces.
- Articles repo unchanged since the previous run; no new article pages needed.
- The
wiki-vs-ragrepo benchmarks this very wiki as its corpus (pinned at commitc130a3b) — the reflexive loop is a nice symptom that the wiki is now substrate for downstream agent work, not just documentation. - The
nemotron-parse-sparkstub deliberately calls out the prompt / image-size landmines from the model card so future readers don't silently degrade output by tweaking either.