Skip to content

Document worai structured-data inventory command#29

Open
ziodave wants to merge 13 commits intomainfrom
feat/worai-structured-data-inventory-docs
Open

Document worai structured-data inventory command#29
ziodave wants to merge 13 commits intomainfrom
feat/worai-structured-data-inventory-docs

Conversation

@ziodave
Copy link
Member

@ziodave ziodave commented Feb 16, 2026

Summary

  • add worai structured-data inventory to scan sitemap URLs and export JSON-LD inventory to CSV or Google Sheets
  • add shared inventory core (worai/core/structured_data_inventory.py) with dataset URI resolution via WordLift API client, FAQPage checks, type normalization, and output writers
  • extract reusable sitemap parsing into worai/core/sitemap.py and point worai/seocheck/sitemap.py to the shared module
  • add tests for inventory logic and CLI wiring
  • update README, internal command docs, and TODO

Behavior

  • input: sitemap URL/path
  • extraction: JSON-LD only
  • FAQ checks:
    • faq_markup=yes when FAQPage exists
    • faq_markup_from_graph=yes when FAQPage.@id starts with current account dataset URI
  • output columns:
    • url
    • faq_markup
    • faq_markup_from_graph
    • types (schema.org prefix stripped)
    • structured_data (combined JSON-LD @graph)

Validation

  • uv run pytest (46 passed)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant