Skip to content

Comments

add shape matching task#6

Open
jyizheng wants to merge 2 commits intoVBVR-DataFactory:mainfrom
jyizheng:yizheng/task4
Open

add shape matching task#6
jyizheng wants to merge 2 commits intoVBVR-DataFactory:mainfrom
jyizheng:yizheng/task4

Conversation

@jyizheng
Copy link

@jyizheng jyizheng commented Jan 8, 2026

This pull request introduces a new "Shape Matching" visual reasoning task to the project. The main additions include a new task generator and configuration, command-line interface for dataset generation, and updates to documentation and module exports. The new task challenges models to move colored shapes into their matching outlines, and supports both image and video outputs.

Shape Matching Task Implementation:

  • Added a new ShapeMatchingTask class in src/shape_matching_task.py that generates geometric shape matching tasks, including logic for rendering shapes, outlines, and generating animation frames for videos.
  • Introduced ShapeMatchingConfig for task configuration and ShapeMatchingGenerator for dataset generation, supporting adjustable shape count, size, and optional video output.

Command-line Interface:

  • Added a new script examples/generate_shape_matching.py to generate shape matching datasets with configurable parameters (number of samples, shapes, size, output directory, etc.) via command-line arguments.

Documentation and Usage:

  • Updated README.md with a detailed section describing the Shape Matching Task, including task description, example usage, configuration options, and output structure.

Module Exports:

  • Updated src/__init__.py to export the new shape matching classes and prompt function, making them available for import elsewhere in the project.

- Reorganized shape matching task following G-1 template structure
- Domain: shape_matching, task_id format: shape_matching_XXXX
- Implements 4 shape types: circle, square, triangle, star
- Clean structure: core/ and src/ separation
- Includes video generation with smooth animation
- Follows all vm-dataset coding standards from rules.txt
- 9 top-level entries as required
- Ready for production use
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant