DriveFusion

drivefusion Public

DriveFusion is an open-source multimodal Vision–Language–Action model for autonomous driving that fuses visual perception, language understanding, and driving context (GPS and speed) to describe dr…

Python 1

data-preprocessing Public

Data preprocessing pipeline for the DriveFusionQA. It converts multiple autonomous-driving QA datasets into unified LLaMA and LLaVA-style instruction formats, with modular dataset preprocessors, JS…

Python

Evaluate-Models Public

Evaluation framework for the DriveFusion DriveFusionQA vision-language model, benchmarking Q&A performance on driving datasets using metrics like Lingo-Judge, BLEU, and BERTScore.

Python

drivefusion-train Public

Training framework for the DriveFusion project that fine-tunes and train LLMs and multimodal vision-language models for driving tasks. Built on LLaMAFactory, it adds dataset processing, distributed…

Python

carla-data-collection Public

Autonomous-driving data pipeline for DriveFusion project built on the CARLA Simulator, generating cleaned multi-modal sensor data and VQA annotations for training vision-language action models.

Python

car-deployment Public

Autonomous vehicle deployment system for DriveFusion combining ROS-based control with AI vision powered by the Qwen 2.5 vision-language model, enabling real-time driving, single-shot decisions, and…

Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DriveFusion

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!