Learning-Based Robot Motion Planning

This repository contains an implementation of a learning-based motion planning and control system for a ground robot with independently steered wheels. The project was developed as a technical assignment and focuses on correct modeling, stable learning, and clear visualization of results.

The solution demonstrates:

A kinematic robot model with curvature–velocity control
Training of a policy from scratch using reinforcement learning
Visualization of robot behavior using MCAP
Quantitative evaluation of navigation performance

Robot Model

The robot is modeled as:

A rectangular rigid body
Four independently steered and driven wheels
Bicycle-style kinematic abstraction
Instantaneous Center of Rotation (ICR) constrained to the robot’s y-axis
Explicit limits on steering rate and wheel angular acceleration

The kinematic model is implemented in model.py.

Environments

Single-Robot Environment

File: env_single.py
One robot navigating to a target from random initial poses
Continuous state and continuous action space
Used to validate dynamics and learning stability

Multi-Robot Environment

File: env_multi.py
Three robots, each with its own target
Shared policy controlling all robots
Observation includes relative target information and inter-robot geometry
Designed to test coordination and scalable control

Learning Setup

Action Space

Each robot is controlled by:

curvature κ
forward velocity v

The action space is continuous. Reverse motion is disabled to simplify learning and improve stability.

Reward Structure

The reward function encourages:

Continuous progress toward the target
Smooth steering behavior
Successful goal reaching

Each robot receives an individual reward upon reaching its target, and a terminal bonus is given when all robots reach their goals.

Training

Single Robot

python train_single.py

Multi-robot training:

python train_multi.py

Evaluation

Evaluation runs multiple episodes and reports:

Success rate
Average number of steps per episode
Final distance to targets
Diagnostic spacing metrics in the multi-robot case

Single-robot evaluation:

python eval_single.py

Multi-robot evaluation:

python eval_multi.py

Evaluation scripts can optionally generate performance plots.

Visualization with MCAP

Robot behavior is visualized using MCAP, compatible with tools such as Foxglove Studio.

The visualizations show:

Robot body and wheel geometry
Steering configuration
Target locations
Executed trajectories

Generate Rollouts

Single-robot rollout:

python rollout_single_to_mcap.py

Multiple single-robot episodes:

python rollout_single_10eps_to_mcap.py

Multi-robot rollout:

python rollout_multi_to_mcap.py

The generated .mcap files can be opened directly in Foxglove Studio.

Reproducibility

Install dependencies:

pip install -r requirements.txt

Train a policy:

python train_single.py
# or
python train_multi.py

Evaluate the trained policy:

python eval_single.py
# or
python eval_multi.py

Generate visualization:

python rollout_multi_to_mcap.py

All experiments start from random initialization and do not rely on pre-trained models.

Repository Structure

.
├── callbacks.py
├── env_single.py
├── env_multi.py
├── eval_single.py
├── eval_multi.py
├── log_mcap.py
├── model.py
├── rollout_single_to_mcap.py
├── rollout_single_10eps_to_mcap.py
├── rollout_multi_to_mcap.py
├── train_single.py
├── train_multi.py
├── requirements.txt
└── README.md

Notes

MCAP visualizations demonstrate learned behaviors.
Hyperparameters and reward shaping were chosen to prioritize stability and interpretability.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.vs/avride_test.slnx/v18		.vs/avride_test.slnx/v18
avride_assignment		avride_assignment
README.md		README.md
avride_assignment.slnx		avride_assignment.slnx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning-Based Robot Motion Planning

Robot Model

Environments

Single-Robot Environment

Multi-Robot Environment

Learning Setup

Action Space

Reward Structure

Training

Single Robot

Evaluation

Visualization with MCAP

Generate Rollouts

Reproducibility

Repository Structure

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Learning-Based Robot Motion Planning

Robot Model

Environments

Single-Robot Environment

Multi-Robot Environment

Learning Setup

Action Space

Reward Structure

Training

Single Robot

Evaluation

Visualization with MCAP

Generate Rollouts

Reproducibility

Repository Structure

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages