Roe AI Python SDK

A Python SDK for the Roe AI API.

v1.0.802 — Version synchronization across the public SDKs: roe-ai (Python), roe-typescript, and roe-golang. The public SDK packages now share a single 1.0.x patch counter, driven by the SDK OpenAPI spec. Python friendly wrappers are generated from openapi/wrappers.yml; current generated facades include client.discovery and client.tables.

v1.0.0 — The SDK delegates to OpenAPI-generated types and transports (roe._generated); ergonomic wrappers on client.agents and client.policies remain. Noteworthy API and behavioral changes compared to earlier releases are listed in CHANGELOG.md.

Installation

uv add roe-ai

Quick Start

from roe import RoeClient

client = RoeClient(
    api_key="your-api-key",
    organization_id="your-org-uuid"
)

# Run an existing agent
job = client.agents.run(agent_id="agent-uuid", text="Analyze this text")
result = job.wait()

for output in result.outputs:
    print(f"{output.key}: {output.value}")

Or set environment variables:

export ROE_API_KEY="your-api-key"
export ROE_ORGANIZATION_ID="your-org-uuid"

Job Result Inspection

After waiting for a job, inspect its outcome using the JobStatus enum. The terminal status and error message are stuffed into the generated response's additional_properties and accessed via subscript:

from roe import JobStatus

result = job.wait()

# Check job outcome
if result["status"] in (JobStatus.SUCCESS, JobStatus.CACHED):
    for output in result.outputs:
        print(f"{output.key}: {output.value}")
elif result["status"] == JobStatus.CANCELLED:
    print("Job was cancelled")
elif result["status"] == JobStatus.FAILURE:
    print("Error:", result["error_message"])

# Available fields
result.outputs           # list[AgentDatum] (direct attribute on the generated model)
result["status"]         # JobStatus code (int) — set by Job.wait()
result["error_message"]  # Error string or None — set by Job.wait()

Errors

Non-2xx responses raise typed exceptions from roe.exceptions, all subclasses of RoeAPIException. Use them to handle expected failures without parsing error strings:

from roe.exceptions import (
    RoeAPIException,
    BadRequestError,            # 400 — validation / bad input
    AuthenticationError,        # 401 — missing or invalid API key
    InsufficientCreditsError,   # 402 — plan limit / billing
    ForbiddenError,             # 403 — feature or resource forbidden
    NotFoundError,              # 404 — resource not found
    ServerError,                # 5xx — server-side
)

try:
    client.agents.retrieve("00000000-0000-0000-0000-000000000000")
except NotFoundError as exc:
    print(exc.status_code, exc.message)

Every RoeAPIException also carries exc.headers (lowercase-keyed dict of the upstream response headers). Use it to read Retry-After on 429s or X-Request-Id for support tickets, without falling back to the raw httpx layer. Retry-After is preserved exactly as sent, so it may be numeric seconds or an HTTP-date:

except RoeAPIException as exc:
    if exc.status_code == 429 and exc.headers:
        retry_after = exc.headers.get("retry-after")

job.wait() does not raise on agent-side failures — instead the returned result carries result["status"] == JobStatus.FAILURE and result["error_message"]. Transport / HTTP errors hit the typed hierarchy above.

Raw API Access

When the ergonomic wrappers don't expose an endpoint you need, the generated client is available as client.raw and the operation modules live under roe._generated.api.<tag>.<operation_id>. Submodule names follow the upstream OpenAPI tags + operationIds and may shift across releases, so the portable form uses client.raw.get_httpx_client() to send a request through the same auth-configured httpx.Client:

from roe import RoeClient

client = RoeClient(api_key="your-api-key", organization_id="your-org-uuid")
response = client.raw.get_httpx_client().get("/v1/users/current_user/")
print(response.status_code)

For typed request/response models, call the generated operation module directly — see roe/_generated/api/ for the current surface.

Generated Friendly APIs

This block is synced from roe-main/roe-sdk/sdk_contract.yml during SDK fan-out.

engines = client.discovery.list_agent_engine_types()
models = client.discovery.list_supported_models(capability="text")

upload = client.tables.upload(
    table_name="customers",
    file="customers.csv",
    with_headers=True,
)

Agent Examples

Multimodal Extraction

Extract structured data from text and images:

agent = client.agents.create(
    name="Listing Analyzer",
    engine_class_id="MultimodalExtractionEngine",
    input_definitions=[
        {"key": "text", "data_type": "text/plain", "description": "Item description"},
    ],
    engine_config={
        "model": "gpt-4.1-2025-04-14",
        "text": "${text}",
        "instruction": "Analyze this product listing. Is it counterfeit?",
        "output_schema": {
            "type": "object",
            "properties": {
                "is_counterfeit": {"type": "boolean", "description": "Whether likely counterfeit"},
                "confidence": {"type": "number", "description": "Confidence score 0-1"},
                "reasoning": {"type": "string", "description": "Explanation"},
            }
        }
    }
)

job = client.agents.run(
    agent_id=str(agent.id),
    text="Authentic Louis Vuitton bag, brand new, $50"
)
result = job.wait()

Document Insights

Extract structured information from PDFs:

agent = client.agents.create(
    name="Resume Parser",
    engine_class_id="PDFExtractionEngine",
    input_definitions=[
        {"key": "pdf_files", "data_type": "application/pdf", "description": "Resume PDF"},
    ],
    engine_config={
        "model": "gpt-4.1-2025-04-14",
        "pdf_files": "${pdf_files}",
        "instructions": "Extract candidate information from this resume.",
        "output_schema": {
            "type": "object",
            "properties": {
                "name": {"type": "string"},
                "email": {"type": "string"},
                "skills": {"type": "array", "items": {"type": "string"}},
            }
        }
    }
)

job = client.agents.run(agent_id=str(agent.id), pdf_files="resume.pdf")
result = job.wait()

Web Insights

Extract data from websites with automatic screenshot/HTML/markdown capture:

agent = client.agents.create(
    name="Company Analyzer",
    engine_class_id="URLWebsiteExtractionEngine",
    input_definitions=[
        {"key": "url", "data_type": "text/plain", "description": "Website URL"},
    ],
    engine_config={
        "url": "${url}",
        "model": "gpt-4.1-2025-04-14",
        "instruction": "Extract company information from this website.",
        "vision_mode": False,
        "crawl_config": {
            "save_html": True,
            "save_markdown": True,
            "save_screenshot": True,
        },
        "output_schema": {
            "type": "object",
            "properties": {
                "company_name": {"type": "string"},
                "description": {"type": "string"},
                "products": {"type": "array", "items": {"type": "string"}},
            }
        }
    }
)

# Run the agent
job = client.agents.run(agent_id=str(agent.id), url="https://www.roe-ai.com/")
result = job.wait()

for output in result.outputs:
    print(f"{output.key}: {output.value}")

Interactive Web

Navigate websites and perform actions:

agent = client.agents.create(
    name="Meeting Booker",
    engine_class_id="InteractiveWebExtractionEngine",
    input_definitions=[
        {"key": "url", "data_type": "text/plain", "description": "Website URL"},
        {"key": "action", "data_type": "text/plain", "description": "Action to perform"},
    ],
    engine_config={
        "url": "${url}",
        "action": "${action}",
        "output_schema": {
            "type": "object",
            "properties": {
                "calendar_link": {"type": "string", "description": "Booking link found"},
                "steps_taken": {"type": "array", "items": {"type": "string"}},
            }
        }
    }
)

job = client.agents.run(
    agent_id=str(agent.id),
    url="https://www.roe-ai.com/",
    action="Find the founder's calendar link to book a meeting"
)
result = job.wait()

Rori Agents (Agentic Workflows)

Rori agents are autonomous investigation agents that follow policies (SOPs), use tools, and produce structured verdicts. Unlike extraction engines which transform data, Rori agents reason over evidence, apply policy rules, and return dispositions. All Rori agents are policy-aware — you define the rules, they run the investigation.

Policies

Policies define the rules, instructions, and disposition classifications that Rori agents follow. Creating a policy atomically creates the policy and its first version in one call:

policy = client.policies.create(
    name="AML Investigation Policy",
    content={
        "guidelines": {
            "categories": [
                {
                    "title": "Structuring",
                    "rules": [
                        {
                            "title": "Cash structuring below reporting thresholds",
                            "description": "Multiple deposits just under $10,000 within short timeframes",
                            "flag": "RED_FLAG",
                        }
                    ],
                },
                {
                    "title": "Layering",
                    "rules": [
                        {
                            "title": "Rapid movement between accounts",
                            "description": "Funds transferred through multiple accounts to obscure origin",
                            "flag": "RED_FLAG",
                            "sub_rules": [
                                {"title": "Cross-border wire transfers with no business purpose"},
                                {"title": "Shell company intermediaries"},
                            ],
                        }
                    ],
                },
            ]
        },
        "instructions": "Investigate the alert against each category. Use available data sources to gather evidence.",
        "dispositions": {
            "classifications": [
                {"name": "Suspicious", "description": "Activity warrants SAR filing"},
                {"name": "Not Suspicious", "description": "Activity has legitimate explanation"},
                {"name": "Needs Escalation", "description": "Requires senior analyst review"},
            ]
        },
        "summary_template": {
            "template": "Investigation of {{subject}} found {{verdict}} based on {{findings_count}} findings."
        },
    },
)

Iterate on policies by creating new versions:

# Create a new version (automatically becomes the current version)
new_version = client.policies.versions.create(
    policy_id=str(policy.id),
    content={...},  # Updated policy content
    version_name="v2 - added layering rules",
)

# List all versions
versions = client.policies.versions.list(policy_id=str(policy.id))

# Retrieve a specific version
version = client.policies.versions.retrieve(str(policy.id), str(new_version.id))

# Update policy metadata
client.policies.update(str(policy.id), name="Updated Policy Name")

# List all policies
policies = client.policies.list()

# Delete a policy
client.policies.delete(str(policy.id))

Policy Content Reference

Field	Type	Description
`guidelines`	object	Categories → Rules → Sub-rules hierarchy
`guidelines.categories[].title`	string	Category name
`guidelines.categories[].rules[].title`	string	Rule name
`guidelines.categories[].rules[].description`	string	Rule details
`guidelines.categories[].rules[].flag`	string	`"RED_FLAG"` or `"GREEN_FLAG"`
`guidelines.categories[].rules[].sub_rules[].title`	string	Sub-rule name
`instructions`	string	Free-text investigation instructions
`dispositions.classifications[].name`	string	Outcome label (e.g., "Suspicious")
`dispositions.classifications[].description`	string	When to apply this outcome
`summary_template.template`	string	Handlebars template for report generation
`optional.sar_narrative_template.template`	string	SAR narrative template (AML-specific)

Product Compliance

Analyze product listings against your compliance policy:

agent = client.agents.create(
    name="Product Compliance",
    engine_class_id="ProductPolicyEngine",
    input_definitions=[
        {"key": "product_listings", "data_type": "text/plain", "description": "Product listing to analyze"},
    ],
    engine_config={
        "policy_version_id": str(policy.current_version_id),
        "product_listings": "${product_listings}",
    },
)

result = client.agents.run_sync(
    agent_id=str(agent.id),
    product_listings="Nike Air Max 90, brand new, $45 — ships from Shenzhen",
)

AML Investigation

Investigate anti-money laundering alerts:

agent = client.agents.create(
    name="AML Investigation",
    engine_class_id="AMLInvestigationEngine",
    input_definitions=[
        {"key": "alert_data", "data_type": "text/plain", "description": "Alert data and context"},
    ],
    engine_config={
        "policy_version_id": str(policy.current_version_id),
        "alert_data": "${alert_data}",
    },
)

job = client.agents.run(
    agent_id=str(agent.id),
    alert_data="Customer John Doe, 5 cash deposits of $9,500 in 3 days",
)
result = job.wait()

Fraud Investigation

Investigate fraud alerts and suspicious activity:

agent = client.agents.create(
    name="Fraud Investigation",
    engine_class_id="FraudInvestigationEngine",
    input_definitions=[
        {"key": "alert_data", "data_type": "text/plain", "description": "Alert data and context"},
    ],
    engine_config={
        "policy_version_id": str(policy.current_version_id),
        "alert_data": "${alert_data}",
    },
)

job = client.agents.run(
    agent_id=str(agent.id),
    alert_data="Chargeback spike: 47 disputes in 24h from merchant ACME-1234",
)
result = job.wait()

Merchant Risk

Analyze merchant risk profiles:

agent = client.agents.create(
    name="Merchant Risk Analysis",
    engine_class_id="MerchantRiskEngine",
    input_definitions=[
        {"key": "merchant_context", "data_type": "text/plain", "description": "Merchant name and context"},
    ],
    engine_config={
        "policy_version_id": str(policy.current_version_id),
        "merchant_context": "${merchant_context}",
    },
)

job = client.agents.run(
    agent_id=str(agent.id),
    merchant_context="ACME Corp - Online electronics retailer, MCC 5732",
)
result = job.wait()

Agent Configuration Options

All Rori agents accept these options in engine_config:

Option	Type	Default	Description
`policy_version_id`	string	—	Policy version UUID (required)
`context_sources`	list	`[]`	External data sources (SQL connections, APIs)
`enable_planning`	bool	`true`	Enable autonomous tool-use planning
`enable_memory`	bool	`false`	Retain context across runs for the same entity
`reasoning_effort`	string	`"medium"`	`"low"`, `"medium"`, or `"high"`

Example with advanced configuration:

agent = client.agents.create(
    name="AML Investigation (Advanced)",
    engine_class_id="AMLInvestigationEngine",
    input_definitions=[
        {"key": "alert_data", "data_type": "text/plain", "description": "Alert data and context"},
    ],
    engine_config={
        "policy_version_id": str(policy.current_version_id),
        "alert_data": "${alert_data}",
        "reasoning_effort": "high",
        "context_sources": [
            {"type": "sql", "name": "Transactions DB", "connection_id": "conn-uuid"},
        ],
    },
)

Retry Behavior

Transient failures are retried with exponential backoff capped at about 10 seconds per attempt. By default there are up to 3 retries (configurable via max_retries or ROE_MAX_RETRIES):

client = RoeClient(
    api_key="your-api-key",
    organization_id="your-org-uuid",
    max_retries=5,  # default: 3
)

Retried: HTTP statuses 408, 429, and any 5xx, plus transport errors (for example disconnects and timeouts). JSON POST bodies may be replayed; multipart agent-run calls (run, run_sync, …) opt out via x-roe-skip-retry so they are not automatically retried at the transport layer.

Not retried immediately: Typical client/auth responses (401, 403, 404, validation 422, …) — surfaced as typed exceptions matching the SDK’s usual error mapping.

Batch Processing

When batch operations exceed 1,000 items, the SDK automatically chunks requests. A configurable delay (default: 10 seconds) is applied between chunks to avoid overwhelming the API. This applies to:

client.agents.run_many() — job submissions
client.agents.jobs.retrieve_status_many() — batch status checks
client.agents.jobs.retrieve_result_many() — batch result retrieval

Single-chunk batches (≤1,000 items) are unaffected.

You can configure the delay via the batch_chunk_delay parameter or the ROE_BATCH_CHUNK_DELAY environment variable:

client = RoeClient(
    api_key="your-api-key",
    organization_id="your-org-uuid",
    batch_chunk_delay=2.0,  # default: 10.0
)

Running Agents

# Async (recommended)
job = client.agents.run(agent_id="uuid", text="input")
result = job.wait()

# Sync
outputs = client.agents.run_sync(agent_id="uuid", text="input")

# With files (auto-uploaded)
job = client.agents.run(agent_id="uuid", document="file.pdf")

# Batch processing
batch = client.agents.run_many(
    agent_id="uuid",
    batch_inputs=[{"text": "input1"}, {"text": "input2"}]
)
results = batch.wait()

Metadata

You can attach arbitrary metadata to any job when running an agent. Metadata is a dictionary of key-value pairs that gets stored with the job, useful for tracking, filtering, or correlating jobs with your own internal records.

# Attach metadata to an async job
job = client.agents.run(
    agent_id="agent-uuid",
    metadata={"customer_id": "cust-123", "request_source": "api"},
    url="https://example.com",
)
result = job.wait()

# Attach metadata to a sync job
outputs = client.agents.run_sync(
    agent_id="agent-uuid",
    metadata={"batch": "2026-02-12", "priority": "high"},
    url="https://example.com",
)

# Attach metadata to a batch of jobs (applied to all jobs in the batch)
batch = client.agents.run_many(
    agent_id="agent-uuid",
    batch_inputs=[{"url": "https://a.com"}, {"url": "https://b.com"}],
    metadata={"campaign": "weekly-scan"},
)
results = batch.wait()

# Attach metadata when running a specific version
job = client.agents.run_version(
    agent_id="agent-uuid",
    version_id="version-uuid",
    metadata={"experiment": "v2-prompt"},
    url="https://example.com",
)

# Also works directly on agent and version models
agent = client.agents.retrieve("agent-uuid")
job = agent.run(metadata={"source": "sdk"}, url="https://example.com")

Agent Management

# List / Retrieve
agents = client.agents.list()
agent = client.agents.retrieve("uuid")

# Update / Delete
client.agents.update("uuid", name="New Name")
client.agents.delete("uuid")

# Duplicate
#
# Note: client.agents.duplicate(...) returns an AgentVersion (the new agent's
# first version), not a BaseAgent. The new agent's ID is reachable on the
# returned object as `.base_agent.id`:
duplicated_version = client.agents.duplicate("uuid")
new_agent_id = duplicated_version.base_agent.id

Version Management

# List and retrieve versions
versions = client.agents.versions.list("agent-uuid")
current = client.agents.versions.retrieve_current("agent-uuid")
version = client.agents.versions.retrieve("agent-uuid", "version-uuid")

# Create, update, delete versions
version = client.agents.versions.create(
    agent_id="agent-uuid",
    version_name="v2",
    input_definitions=[...],
    engine_config={...}
)

client.agents.versions.update("agent-uuid", "version-uuid", version_name="v2-updated")
client.agents.versions.delete("agent-uuid", "version-uuid")

# Run specific versions
job = client.agents.run_version("agent-uuid", "version-uuid", text="input")
result = job.wait()

Job Management

# Retrieve job status and results
status = client.agents.jobs.retrieve_status(job_id)
result = client.agents.jobs.retrieve_result(job_id)

# Batch operations
statuses = client.agents.jobs.retrieve_status_many([job_id1, job_id2])
results = client.agents.jobs.retrieve_result_many([job_id1, job_id2])

# Download references from jobs (screenshots, HTML, markdown)
content = client.agents.jobs.download_reference(job_id, resource_id)

# Delete job data
client.agents.jobs.delete_data(job_id)

Supported Models

Model	Value
GPT-5.4 Pro	`gpt-5.4-pro-2026-03-05`
GPT-5.4	`gpt-5.4-2026-03-05`
GPT-5.4 Mini	`gpt-5.4-mini-2026-03-17`
GPT-5.4 Nano	`gpt-5.4-nano-2026-03-17`
GPT-5.2	`gpt-5.2-2025-12-11`
GPT-5	`gpt-5-2025-08-07`
GPT-4.1	`gpt-4.1-2025-04-14`
Claude Opus 4.7	`claude-opus-4-7`
Claude Opus 4.6	`claude-opus-4-6`
Claude Sonnet 4.6	`claude-sonnet-4-6`
Claude Haiku 4.5	`claude-haiku-4-5-20251001`
Gemini 3.1 Pro	`gemini-3.1-pro-preview`
Gemini 3 Flash	`gemini-3-flash-preview`
Grok 4.20 Reasoning	`grok-4.20-0309-reasoning`

Engine Classes

Engine	ID
Multimodal Extraction	`MultimodalExtractionEngine`
Document Insights	`PDFExtractionEngine`
Document Segmentation	`PDFPageSelectionEngine`
Web Insights	`URLWebsiteExtractionEngine`
Interactive Web	`InteractiveWebExtractionEngine`
Web Search	`URLFinderEngine`
Perplexity Search	`PerplexitySearchEngine`
Maps Search	`GoogleMapsEntityExtractionEngine`
LinkedIn Crawler	`LinkedInScraperEngine`
Social Media	`SocialScraperEngine`
Product Compliance	`ProductPolicyEngine`
Merchant Risk	`MerchantRiskEngine`
AML Investigation	`AMLInvestigationEngine`
Fraud Investigation	`FraudInvestigationEngine`

Name		Name	Last commit message	Last commit date
Latest commit History 127 Commits
.github/workflows		.github/workflows
examples		examples
openapi		openapi
scripts		scripts
src/roe		src/roe
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

Roe AI Python SDK

Installation

Quick Start

Job Result Inspection

Errors

Raw API Access

Generated Friendly APIs

Agent Examples

Multimodal Extraction

Document Insights

Web Insights

Interactive Web

Rori Agents (Agentic Workflows)

Policies

Policy Content Reference

Product Compliance

AML Investigation

Fraud Investigation

Merchant Risk

Agent Configuration Options

Retry Behavior

Batch Processing

Running Agents

Metadata

Agent Management

Version Management

Job Management

Supported Models

Engine Classes

Links

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages