Added Image and video gen interactive tasks by ParamThakkar123 · Pull Request #31 · transformerlab/transformerlab-examples

ParamThakkar123 · 2026-04-01T14:03:11Z

Changes

Added image-gen-interactive/ directory with interactive image and video generation
Created main.py with Gradio interface supporting multiple diffusion models (SDXL, SD, Flux, etc.)
Added support for text-to-image and text-to-video generation
Implemented model caching and GPU optimization
Added task.yaml with resource requirements and setup dependencies

Features

Interactive generation of images and videos using state-of-the-art diffusion models
Support for multiple model architectures (Stable Diffusion XL, Flux, ModelScope, etc.)
Configurable parameters: model selection, prompts, dimensions, inference steps, guidance scale
Video generation with frame control and automatic MP4 export
Optimized for GPU acceleration with torch.float16

Parameters

HF_TOKEN: HuggingFace token (required, set as secret)
Model selection from predefined list
Prompt and negative prompt inputs
Width/height, steps, guidance scale controls

How to Test

In TransformerLab, select the 'image-gen-interactive' task
Ensure HF_TOKEN secret is set in app settings
Configure generation parameters
Run the task and access the Gradio interface
Test image generation with different models and prompts
Try video generation (requires compatible models like ModelScope)

greninja · 2026-04-17T20:01:41Z

some feedback till now:

ideally on changing the model from the default (SDXL 1.0 (1024x1024)) to say SD 1.5 (512x512) it should automatically update the width and height to 512 respectively -- but currently doenst happen
maybe we can have a short description or explainer explaining what "Inference steps" and "Guidance scale" is?
Not sure if this is anything to do with the code in this PR but I get this error:

/home/shadab/projects/transformerlab/transformerlab-examples/.venv/lib/python3.11/site-packages/diffusers/image_processor.py:142: RuntimeWarning: invalid value encountered in cast
  images = (images * 255).round().astype("uint8")

maybe some pixel values in the generated image are NaN or inf, so when it tries to cast them to uint8 (0-255), the result is undefined. I am guessing it won't crash the app just that the output image may have corrupted pixels, so ok to ignore for testing purposes and focus on other fixes.

the task.yaml file structure maybe slightly off. Importing it throws this error:

  title: Extra inputs are not permitted; cpus: Extra inputs are not permitted; memory: Extra inputs are not permitted; accelerators: Extra inputs are not permitted; env_vars: Extra inputs are not permitted; description: Extra
  inputs are not permitted; interactive: Extra inputs are not permitted

maybe can refer this: task-submission. Specifically, based on the template in TaskYamlSpec, these fields are not allowed at the top level:


  - title
  - command (should be run)
  - cpus (should be nested under resources)
  - memory (should be nested under resources)
  - accelerators (should be nested under resources)
  - env_vars (should be envs)
  - description
  - interactive

Also, no matter my prompt ("testing", "draw a cat playing soccer" or even "a sky full of butterflies" or something) it throws this:
"Potential NSFW content was detected in one or more images. A black image will be returned instead. Try again with a different prompt and/or seed."
and then just spits s black image.

greninja · 2026-04-17T20:57:43Z

the task.yaml is missing github_repo_url and github_repo_dir

greninja

refer feedback comments above

Added Image and video gen interactive tasks

8072288

greninja self-assigned this Apr 17, 2026

greninja requested changes Apr 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added Image and video gen interactive tasks#31

Added Image and video gen interactive tasks#31
ParamThakkar123 wants to merge 1 commit into
mainfrom
add/image-gen-interact

ParamThakkar123 commented Apr 1, 2026 •

edited

Loading

Uh oh!

greninja commented Apr 17, 2026 •

edited

Loading

Uh oh!

greninja commented Apr 17, 2026

Uh oh!

greninja left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ParamThakkar123 commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Features

Parameters

How to Test

Uh oh!

greninja commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greninja commented Apr 17, 2026

Uh oh!

greninja left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ParamThakkar123 commented Apr 1, 2026 •

edited

Loading

greninja commented Apr 17, 2026 •

edited

Loading

greninja left a comment •

edited

Loading