Smart Grid Transformer Integration & Automated Scenario Generation Pipeline by Rohith-Kanathur · Pull Request #288 · IBM/AssetOpsBench

Rohith-Kanathur · 2026-05-08T17:59:58Z

Overview

This PR delivers two major contributions to AssetOpsBench:

A new Smart Grid Transformer asset class with four FMSR diagnostic tools grounded in IEC standards
A multi-phase automated scenario generation pipeline (ScenarioGeneratorAgent) that scales benchmark creation to new asset classes without manual authoring

1. Smart Grid Transformer Integration

CouchDB Data

Added Smart Grid Transformer asset documents to CouchDB, including dissolved gas analysis (DGA) readings, winding temperature profiles, and electrical load profiles as new data sources.

Four New FMSR Tools

Tool	Standard	Description
`predict_health_index`	—	Supervised regression model trained on the Mendeley Transformer Health Dataset; predicts a 0–100 health score across five condition bands (Very Good → Very Poor)
`interpret_dga`	IEC 60599	Applies the Rogers Ratio method to classify transformer fault type from dissolved gas concentrations with confidence rating
`assess_winding_temperature`	IEC 60076-7	Computes insulation ageing rate and thermal risk from winding/oil temperature inputs
`assess_load_profile`	IEC 60076-7	Derives three-phase apparent load and classifies loading status against cyclic loading limits

Tests

Added unit tests for all four FMSR tools covering valid inputs, edge cases, and schema compliance.

2. Scenario Generation Pipeline (`ScenarioGeneratorAgent`)

A three-phase automated pipeline that generates physically plausible, causally consistent, and tool-reachable benchmark scenarios for any onboarded asset class.

Phase 1: Asset Profiling

Discovers live asset instances, sensors, and failure mappings from CouchDB, retrieves and synthesizes domain literature from ArXiv or Semantic Scholar, and merges everything with MCP tool schemas into a single structured AssetProfile that grounds all downstream generation.

Phase 2: Budget Allocation

Distributes the total scenario count across focusses (iot, fmsr, tsfm, wo, vibration, multiagent) proportionally to the asset's available data modalities and tool coverage, with multiagent capped at 75% of total budget to preserve lane diversity.

Phase 3: Scenario Generation & Validation

Generates per-focus scenarios conditioned on the asset profile, runs each candidate through an LLM-based repair step.

Output

Each run produces a timestamped directory with scenarios.json. Each scenario object contains an id, type, text, category, and characteristic_form.

CLI

# Closed-form generation
uv run python -m scenarios.generator "Transformer" --num-scenarios 50

# Grounded open-form with live CouchDB data
uv run python -m scenarios.generator "Transformer" --data-in-couchdb --num-scenarios 50

…ing, and terminal output refinements

…ine for benchmark scenario creation Introduces a fully automated 4-phase scenario generation pipeline driven by LiteLLM: PHASE 1 — Asset Profile Construction - LLM generates targeted ArXiv search queries from the asset's canonical academic name - Fetches PDFs via ArXiv API (up to 2 per query, first 5 pages extracted via pypdf) - Synthesises sensor mappings, failure modes, ISO standards, and relevant tool mappings into an AssetProfile (Pydantic model); fatal if unparseable PHASE 2 — Scenario Budget Allocation - LLM dynamically distributes the total scenario count across 5 subagent categories (iot, fmsr, tsfm, wo, multiagent) based on the AssetProfile - Multiagent capped at 50% of total; fatal if allocation is unparseable PHASE 3 — Individual Agent Generation & Validation (iot / fmsr / tsfm / wo) - Fetches 2 typed few-shot examples from ibm-research/AssetOpsBench on HuggingFace - SCENARIO_GENERATOR_PROMPT produces raw scenario dicts per subagent - VALIDATE_REPAIR_PROMPT corrects schema, tool alignment, and characteristic_form quality - Validation diffs (before/after) written to numbered log files when --log is active PHASE 4 — Multi-Agent Combiner - MULTIAGENT_COMBINER_PROMPT seeds from up to 10 single-agent scenarios to produce complex cross-subagent orchestration scenarios (e.g. IoT → FMSR → WO) CLI (python -m scenarios.generator): --num-scenarios N Total scenarios to generate (default: 50) --output PATH Output JSON path (default: generated_scenarios.json) --model-id MODEL LiteLLM model override --show-workflow Granular phase-by-phase terminal output with diffs --log Dump all raw prompts + responses to logs/<asset>_<ts>/ Supporting additions: - models.py: AssetProfile, ScenarioBudget, Scenario Pydantic models - prompts.py: 6 prompt templates (PROFILE_BUILDER, SCENARIO_GENERATOR, VALIDATE_REPAIR, MULTIAGENT_COMBINER, RESEARCH_QUERY_GENERATOR, BUDGET_ALLOCATOR) - utils.py: fetch_arxiv_studies() with multi-query dedup + PDF extraction; fetch_hf_fewshot() with type-filtered HuggingFace loading + mock fallback - Log header includes ArXiv paper titles and PDF URLs for full traceability - src/scenarios/README.md: full usage docs, pipeline breakdown, output schema, troubleshooting table, and log file structure reference

…T get_* tools

…rompts/retrieval

…obing

…istory

…names

…istory

…nd validation

This reverts commit 64f0ae6.

Rohith-Kanathur and others added 30 commits April 11, 2026 23:32

transformer asset integration initial commit

6def5c7

added failure modes for transformer

92dd883

add transformer health index prediction tool

4cfb668

added transformer asset docs

22c21f1

added mocks and docstrings

618e2a6

feat: implement scenario generator with --log, structured error handl…

a059746

…ing, and terminal output refinements

add summary of fetched study links

4189553

add README.md for scenario generation pipeline

f1a4ed4

update workflow reference to agent

8f3fbc5

for validated scenarios add truncation to adhere to budget

82b8fd4

Refine scenario retrieval and clean up generator flow

38ef49b

llm: allow optional max_tokens override in LiteLLMBackend.generate

974a63b

servers: add CouchDB coverage APIs, FMSR failure-mode aliases, and Io…

7d57037

…T get_* tools

scenarios: modular generator with grounding, constraints, and split p…

aafb7b1

…rompts/retrieval

Add certifi dependency for verified HTTPS in retrieval

ede66d1

Add Semantic Scholar retrieval, research digest synthesis, and PDF pr…

379fd13

…obing

convert scenarios to package

f7d7b79

feat(llm): return LLMResult with optional usage from generate()

cf7f470

refactor(agent): consume LLMResult.text in plan-execute and tests

e90d9ae

refactor: use LLMResult.text in scenario pipeline and FMSR server

b45af08

docs: list IoT MCP tools as get_sites, get_assets, get_sensors, get_h…

6353d0a

…istory

test: align plan-execute and Claude runner tests with IoT get_* tool …

e930acd

…names

test: call IoT MCP tools by get_sites, get_assets, get_sensors, get_h…

563c2b1

…istory

feat(scenarios): add negative scenario track with stricter hardness a…

de56fbc

…nd validation

update README.md

9837baa

Merge branch 'main' into feat/scenario-generator

87bfdd7

revert some changes

8d0f6eb

revert some more changes

64f0ae6

Revert "revert some more changes"

711d9f1

This reverts commit 64f0ae6.

DhavalRepo18 added the External contribution label May 9, 2026

DhavalRepo18 self-requested a review May 11, 2026 22:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Smart Grid Transformer Integration & Automated Scenario Generation Pipeline#288

Smart Grid Transformer Integration & Automated Scenario Generation Pipeline#288
Rohith-Kanathur wants to merge 30 commits into
IBM:mainfrom
Rohith-Kanathur:feat/scenario-generator

Rohith-Kanathur commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Rohith-Kanathur commented May 8, 2026

Overview

1. Smart Grid Transformer Integration

CouchDB Data

Four New FMSR Tools

Tests

2. Scenario Generation Pipeline (ScenarioGeneratorAgent)

Phase 1: Asset Profiling

Phase 2: Budget Allocation

Phase 3: Scenario Generation & Validation

Output

CLI

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

2. Scenario Generation Pipeline (`ScenarioGeneratorAgent`)