Image Workflow

A simple black and white web application for generating images using fal.ai workflow API with AI-powered object detection.

Features

Automatic Object Detection: Uses Google's Gemini 2.5 Flash to automatically identify the most prominent object in your uploaded image
AI-Powered Segmentation: Automatically segments and styles the detected object using LoRA
Real-time Progress: Shows step-by-step progress from image analysis to final generation

Setup

Install dependencies:

npm install

Create a .env file in the project root with your API keys:

FAL_KEY=your_fal_api_key_here
REPLICATE_API_TOKEN=your_replicate_api_token_here

Get your API keys from:

FAL API Key: https://fal.ai/dashboard/keys
Replicate API Token: https://replicate.com/account/api-tokens

Start the server:

npm start

Open your browser to http://localhost:3000

Usage

Upload an image using the file input on the left side
Click "Generate" button
Gemini AI will automatically detect the most prominent object in your image
Wait for the image to be generated (progress bar will show on the right side)
Generated images are saved in /gen-images folder

Project Structure

public/ - Frontend files (HTML, CSS, JavaScript)
server.js - Express backend server with fal.ai and Replicate/Gemini integration
config.js - Configuration for workflow parameters
lora/ - LoRA model files for styling
gen-images/ - Generated images are saved here
uploads/ - Temporary storage for uploaded images

How It Works

Upload: You upload an image through the web interface
Analysis: The image is sent to Google's Gemini 2.5 Flash API which analyzes it and returns a one-word description of the most prominent object
Segmentation: The detected object name is used as the text prompt for the segmentation workflow
Styling: The segmented object is rendered in the SK3TCHING style (red marker on white background)
Result: The final stylized image is displayed and saved locally

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
placeholders		placeholders
public		public
sketch-ui		sketch-ui
uploads		uploads
.DS_Store		.DS_Store
.gitignore		.gitignore
QR-CODE-ANLEITUNG.md		QR-CODE-ANLEITUNG.md
README.md		README.md
config.js		config.js
image-positions.json		image-positions.json
package-lock.json		package-lock.json
package.json		package.json
server.js		server.js
sync-images.js		sync-images.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Workflow

Features

Setup

Usage

Project Structure

How It Works

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Image Workflow

Features

Setup

Usage

Project Structure

How It Works

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages