Internship Research: Generalized VLA and Failure Detection

This repository contains the full lifecycle of an internship research project focused on Generalizing Vision-Language-Action (VLA) models and implementing Temporal-Difference Quality Calibration (TDQC) for proactive failure detection in robotic manipulation.

🚀 Project Overview

The project is divided into two major tracks:

Literature & Structured Research: Mapping the state-of-the-art in unseen object exploration, world models, and VLA uncertainty.
Implementation (Phase 1 & 2): Fine-tuning the SimVLA model on LIBERO datasets and developing a standalone LSTM-based failure calibrator.

📂 Repository Structure

🔬 Research Track

00_subjects/: Official internship briefs and requirements.
02_search_strategy/ & 03_search_runs/: Comprehensive literature search history and paper shortlists.
04_structured_research/: Deep-dive analysis and field schemas for "Unseen Object Exploration" and "World Models."
06_papers/: Local repository of key PDF papers and reading manifests.

💻 Implementation Track (`intern_ship_ws/`)

SimVLA/: The base VLA model repository (SmolVLM backbone).
envs/simvla/: Dedicated Conda environment (Python 3.10, PyTorch, CUDA 12.4).
phase2_tdqc_standalone/: [ACTIVE] Consolidated failure detection project.
config/ & data/: Simulation settings and LIBERO datasets.

📍 Current Project State (April 2026)

We have successfully completed the training of the Phase 2 TDQC LSTM Calibrator.

Model: 1-layer, 128-unit LSTM.
Status: Finalized (Stage 5 Polish).
Checkpoint: intern_ship_ws/phase2_tdqc_standalone/results/checkpoints/lstm_td0_final_polish_v2/best.pt
Primary Metric: Global Brier Score of 0.0823.

🛠 Quick Start for Gemini

If you are resuming this project in a new session:

Check intern_ship_ws/phase2_tdqc_standalone/README.md for the failure detection metrics.
Activate the environment: source intern_ship_ws/activate_simvla.sh.
Ensure PYTHONPATH includes intern_ship_ws/phase2_tdqc_standalone/code.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.claude		.claude
00_subjects/official_briefs		00_subjects/official_briefs
01_context		01_context
02_search_strategy		02_search_strategy
03_search_runs		03_search_runs
04_structured_research		04_structured_research
05_external_assets		05_external_assets
06_papers		06_papers
docs		docs
intern_ship_ws		intern_ship_ws
plans/state		plans/state
.codex		.codex
.gitignore		.gitignore
=1.2.0		=1.2.0
GEMINI.md		GEMINI.md
MIGRATION_NOTES.md		MIGRATION_NOTES.md
README.md		README.md
REMOTE_EXPERIMENT_GUIDE.md		REMOTE_EXPERIMENT_GUIDE.md
WORKSPACE_GUIDE.md		WORKSPACE_GUIDE.md
check_misaligned.py		check_misaligned.py
check_new_dataset.py		check_new_dataset.py
check_ood_tasks.py		check_ood_tasks.py
check_target_ood.py		check_target_ood.py
check_train_tasks.py		check_train_tasks.py
check_v7_train.py		check_v7_train.py
inspect_dataset.py		inspect_dataset.py
inspect_v2.py		inspect_v2.py
inspect_v2_deep.py		inspect_v2_deep.py
launch_training.sh		launch_training.sh
launch_v8_training.sh		launch_v8_training.sh
project_context_dump.txt		project_context_dump.txt
prompt.txt		prompt.txt
run_misaligned_training.sh		run_misaligned_training.sh
training_log.txt		training_log.txt
ultimate_project_context.txt		ultimate_project_context.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Internship Research: Generalized VLA and Failure Detection

🚀 Project Overview

📂 Repository Structure

🔬 Research Track

💻 Implementation Track (`intern_ship_ws/`)

📍 Current Project State (April 2026)

🛠 Quick Start for Gemini

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Internship Research: Generalized VLA and Failure Detection

🚀 Project Overview

📂 Repository Structure

🔬 Research Track

💻 Implementation Track (intern_ship_ws/)

📍 Current Project State (April 2026)

🛠 Quick Start for Gemini

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

💻 Implementation Track (`intern_ship_ws/`)

Packages