Reconstructing Hand-Object Interactions with 3D Gaussian Splatting

$\downarrow$

(See videos here: Hand-Object Interaction, Human-Object Interaction)

Overview

This project essentially evolved out of our initial goal, which was to outperform HOLD in an identical experiment setting with a more versatile/efficient pipeline. To this end, we replace the neural rendering approach from implicit Signed-Distance-Fields with explicit 3D 2D-Gaussian Splatting.

We achieve equal or better results than the baseline with a more simple and efficient approach (i.e. w/o modeling the background or jointly optimizing the H/O poses) & reduce training & rendering times from 12h -> 10min & 2 min/frame -> 60FPS respectively.

With respect to the hand/hand- object related metrics we see great improvements. In terms of object-related metrics we are slightly sub-par. That's because, unlike HOLD, our method can't fully reconstruct the object in regions that are always occluded.

Instead of focusing on in-hand/monocular occluded object reconstruction, we plan to extend the current pipeline to the two-hand case by replacing MANO with SMPL-X and evaluate its performance quantitatively on ARCTIC and qualitatively on in-the-wild videos (i.e. YouTube unboxing videos or similar).

	CD (↓)	CDₕ (↓)	F₁₀ (↑)	MPJPE (↓)
Ø ours, latest	0.64	4.06	92.78	25.7
HOLD	0.4	11.3	96.5	24.2

Conclusion

We successfully migrated both the data & appearance pipelines to the human-object interaction scenario. To the best of our knowledge this is the first such work using SMPLX to reconstruct intricate hand-object interactions. (Bi-Manual HOI either uses two dis-joint MANO layers or simply SMPL.)

Local Installation

⚠️ ******************************************************************** ⚠️

Clone this repo with the --recursive flag. This also clones the data pipeline its submodules.

*****************************************************************************************
*******************************

Setup the handobjectreconstruction environment.
```
./scripts/setup.sh
```
If this fails, please install the dependencies manually. This environment is based on the conda environment from the camera_motion submodule & includes the dependencies for the data pipeline.
Setup the data pipeline
```
cd datapipeline/submodules/camera_motion
```
Continue here: datapipeline/README.md.

After processing a demo sequence, you should have the following folder structure:
```
→ tree datapipeline/data
data
└── demo
    ├── data.pt
    ├── masks.mp4
    ├── out.mp4
    └── video.mp4
```
Let's link the data folder to the root directory.
```
ln -s datapipeline/data data
```
Alternatively, unzip ./assets/demo.zip to ./data/demo to skip the preprocessing pipeline.
```
unzip assets/demo.zip -d data/
```
Add third-party files

MANO_SMPLX_vertex_ids.pkl & SMPL-X__FLAME_vertex_ids.npy are missing missing in the model_registry. Download these files from the SMPL-X Website & add them to ./assets/.

[Optional] Create a .env file with your COMET_API_KEY=****** to monitor experiments online.
Run the demo.

NOTE: On the first run gsplat is compiled, which might take a few minutes.
```
python train.py
```
COMMENT: This command loads ./configs/debug_fast.yaml, which can be used to verify the appearance-training procedure.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
configs		configs
datapipeline		datapipeline
scripts		scripts
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reconstructing Hand-Object Interactions with 3D Gaussian Splatting

Overview

Conclusion

Local Installation

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Reconstructing Hand-Object Interactions with 3D Gaussian Splatting

Overview

Conclusion

Local Installation

About

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages