Visualization and Analysis

CUCo provides an interactive web UI and programmatic plotting utilities for exploring evolution results.

Web UI

Launching

# Open a specific database
cuco_visualize --db workloads/ds_v3_moe/results_ds_v3_moe/evolution_db.sqlite --open

# Browse all databases under a directory
cuco_visualize /path/to/results --open

# Custom port
cuco_visualize --db results/evolution_db.sqlite -p 9000

Flag	Default	Description
`root_directory`	`.`	Directory to scan for `.sqlite` databases
`--db`	`None`	Path to a specific database file
`-p`, `--port`	`8000`	HTTP server port
`--open`	`off`	Open browser automatically

Features

The web UI is a single-page application served at http://localhost:8000/. It provides several views:

Tree View

Displays the evolution lineage as an interactive tree:

Each node is a candidate program
Edges connect parents to children
Node color reflects fitness score (viridis colormap)
Gold highlight marks the best program
Red marks incorrect (failed verification) candidates
Click any node to view its code, metrics, and diff from parent

Programs Table

A sortable table of all candidates:

Column	Description
Rank	Fitness rank (1 = best)
Generation	When it was created
Score	`combined_score`
Patch Type	`diff`, `full`, or `cross`
Island	Island index
Model	LLM model used
API Cost	LLM query cost
Correct	Pass/fail

Click any row to see the full program, its mutation diff, and evaluation feedback.

Metrics View

Plots of evolution progress:

Best score over generations
Individual candidate scores (scatter)
Cumulative API cost

Embeddings View

Code similarity heatmap:

Cosine similarity between all candidate embeddings
Sortable by chronological order, cluster, or performance
Helps identify population diversity and convergence

Clusters View

PCA-based visualization of candidate embeddings in 2D/3D space, colored by fitness or island.

Islands View

Per-island population breakdown and migration history.

Meta Scratchpad

Browse the meta-summarizer's output:

Generation slider to view summaries over time
Optimization recommendations
Strategy patterns (what worked, what failed)
Download as PDF

API Endpoints

The web UI exposes JSON APIs for programmatic access:

Endpoint	Method	Description
`/list_databases`	`GET`	List all `.sqlite` databases found under the root directory
`/get_programs?db_path=...`	`GET`	Return all programs from a database as JSON
`/get_meta_files?db_path=...`	`GET`	List `meta_N.txt` files
`/get_meta_content?db_path=...&filename=...`	`GET`	Return content of a meta file

Plotting Utilities

CUCo includes matplotlib-based plotting functions for publication-quality figures.

Lineage Tree

from cuco.plots import plot_lineage_tree
from cuco.utils.load_df import load_evolution_df

df = load_evolution_df("results_ds_v3_moe/evolution_db.sqlite")
fig = plot_lineage_tree(df)
fig.savefig("lineage.pdf")

Produces a NetworkX graph with:

Node shapes: circle (diff), square (full), triangle (init), plus (cross), star (best), X (incorrect)
Viridis color scale by fitness
Path to best node emphasized
Colorbar for combined score

Requires GraphViz for optimal layout (dot engine). Falls back to manual hierarchical layout.

Improvement Plot

from cuco.plots import plot_improvement

fig = plot_improvement(df)
fig.savefig("improvement.pdf")

Shows:

Cumulative best score over generations (line)
Individual candidate scores (scatter)
Path to best node (dashed line)
Optional second y-axis for cumulative API cost

Pareto Front

from cuco.plots import plot_pareto

fig = plot_pareto(df, x_metric="time_ms", y_metric="complexity")
fig.savefig("pareto.pdf")

2D Pareto front visualization:

Pareto-optimal candidates highlighted
Dominated candidates dimmed
Frontier line connecting Pareto points

Embedding Similarity

from cuco.plots import plot_embed_similarity

fig = plot_embed_similarity(embeddings, performances)
fig.savefig("similarity.pdf")

Two-panel figure:

Cosine similarity heatmap (seaborn)
Performance heatmap
Optional hierarchical clustering for ordering

Code Evolution Animation

from cuco.plots.code_path_anim import create_evolution_video

create_evolution_video(
    results_dir="results_ds_v3_moe",
    output_path="evolution.mp4",
    resolution=(3840, 2160),
    fps=25,
)

Generates an MP4 video showing the code evolution along the path to the best candidate:

Syntax highlighting (Pygments)
Diff highlighting for changes
Scrolling for long files
History panes showing previous states

Requires MoviePy and PIL.

Reading the Database Directly

The SQLite database can be queried directly for custom analysis:

import sqlite3
import json

conn = sqlite3.connect("results_ds_v3_moe/evolution_db.sqlite")
cursor = conn.cursor()

# Get the best program
cursor.execute("""
    SELECT id, generation, combined_score, code
    FROM programs
    WHERE correct = 1
    ORDER BY combined_score DESC
    LIMIT 1
""")
best = cursor.fetchone()
print(f"Best: gen {best[1]}, score {best[2]:.2f}")

# Score distribution per generation
cursor.execute("""
    SELECT generation, AVG(combined_score), MAX(combined_score), COUNT(*)
    FROM programs
    WHERE correct = 1
    GROUP BY generation
    ORDER BY generation
""")
for gen, avg, mx, count in cursor.fetchall():
    print(f"Gen {gen}: avg={avg:.2f}, max={mx:.2f}, n={count}")

# Programs by island
cursor.execute("""
    SELECT island_idx, COUNT(*), AVG(combined_score)
    FROM programs
    WHERE correct = 1
    GROUP BY island_idx
""")
for island, count, avg in cursor.fetchall():
    print(f"Island {island}: {count} programs, avg score {avg:.2f}")

conn.close()

Schema Reference

The programs table columns:

Column	Type	Description
`id`	`TEXT`	Unique program ID
`code`	`TEXT`	Full source code
`language`	`TEXT`	Source language
`parent_id`	`TEXT`	Parent program ID (NULL for seed)
`archive_inspiration_ids`	`TEXT`	JSON list of archive inspiration IDs
`top_k_inspiration_ids`	`TEXT`	JSON list of top-k inspiration IDs
`island_idx`	`INTEGER`	Island assignment
`generation`	`INTEGER`	Generation number
`timestamp`	`TEXT`	Creation timestamp
`code_diff`	`TEXT`	Mutation diff from parent
`combined_score`	`REAL`	Fitness score
`public_metrics`	`TEXT`	JSON timing/metrics
`private_metrics`	`TEXT`	JSON internal metrics
`text_feedback`	`TEXT`	LLM feedback string
`correct`	`INTEGER`	0 or 1
`complexity`	`TEXT`	JSON code complexity metrics
`embedding`	`BLOB`	Code embedding vector
`in_archive`	`INTEGER`	Whether in MAP-Elites archive
`metadata`	`TEXT`	JSON metadata

Additional Tables

metadata_store — key-value store for evolution state (e.g., last_iteration)
archive — MAP-Elites archive tracking

Results Directory Structure

results_my_workload/
├── evolution_db.sqlite          # SQLite database (all candidates)
├── experiment_config.yaml       # EvolutionConfig snapshot
├── meta_memory.json             # Meta-learning state
│
├── best/                        # Symlink to best generation
│   ├── my_kernel.cu             # Best evolved kernel
│   └── results/
│       ├── metrics.json
│       └── correct.json
│
├── gen_0/                       # Generation 0 (seed or first mutation)
│   ├── my_kernel.cu             # Evolved program (after patching)
│   ├── original.cu              # Parent code (before patching)
│   ├── main.cu                  # Copy used for evaluation
│   ├── edit.diff                # Diff: original → evolved
│   ├── rewrite.txt              # LLM raw output (for full rewrites)
│   └── results/
│       ├── metrics.json         # Score, timing, feedback
│       ├── correct.json         # Correctness verdict
│       ├── build.log            # nvcc compiler output
│       ├── run.log              # mpirun stdout/stderr
│       ├── job_log.out          # Job scheduler stdout
│       └── job_log.err          # Job scheduler stderr
│
├── gen_1/
│   └── ...
│
├── meta_8.txt                   # Meta-summary at generation 8
├── meta_12.txt                  # Meta-summary at generation 12
└── meta_16.txt                  # Meta-summary at generation 16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Visualization and Analysis

Web UI

Launching

Features

Tree View

Programs Table

Metrics View

Embeddings View

Clusters View

Islands View

Meta Scratchpad

API Endpoints

Plotting Utilities

Lineage Tree

Improvement Plot

Pareto Front

Embedding Similarity

Code Evolution Animation

Reading the Database Directly

Schema Reference

Additional Tables

Results Directory Structure

FilesExpand file tree

visualization.md

Latest commit

History

visualization.md

File metadata and controls

Visualization and Analysis

Web UI

Launching

Features

Tree View

Programs Table

Metrics View

Embeddings View

Clusters View

Islands View

Meta Scratchpad

API Endpoints

Plotting Utilities

Lineage Tree

Improvement Plot

Pareto Front

Embedding Similarity

Code Evolution Animation

Reading the Database Directly

Schema Reference

Additional Tables

Results Directory Structure