Vision-Language-Action sandbox that connects a PyBullet Franka Panda scene, YOLOv8 perception, a mock LLM parser, and a planner-driven controller via ROS 2.
- Simulation:
vla_simulationspins up PyBullet, camera publishing, and joint control endpoints. - Perception:
vla_perceptionruns YOLOv8 plus color fallback to publish detections. - Brain:
vla_brainprovides a mock LLM task parser and a planner that turns tasks into joint commands. - Launch:
ros2 launch vla_simulation vla_system.launch.py use_gui:=Truestarts the whole stack.
python3 -m venv venv && source venv/bin/activatepip install -r requirements.txt(ROS 2 providesrclpy)cd vla_ws && colcon build && source install/setup.bashros2 launch vla_simulation vla_system.launch.py use_gui:=True
