BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects

BundleSDF for Push Anything

Step 1: take a video of the object. You will be prompted to draw one mask of the object. Please put the object on the table at the end, in its upright position, and the object axis should be aligned with the world coordinate axis. Please specify the name for the object here.

python live_demo.py --name A_shape

Step 2: run BundleSDF. This step also reorients the reconstructed mesh and fill up any holes. object_name should be the same as "name" in step 1.

python run_custom.py --use_segmenter 1 --use_gui 1 --debug_level 2 --object_name A_shape

python bundlesdf/run_nerf.py --ref_view_dir /home/yufeiyang/Documents/BundleSDF/arm_data/ --name A_shape

Step 3: run FoundationPose. The object state will be published to the "OBJECT_{object_name}_STATE_SIMULATION" channel in lcm. If you are working with LCM, you can do single/multi object tracking by running:

python auto_tracking.py

If you are NOT working with LCM and want to do single object tracking, run:

python fpTracking.py --object_name A_shape

If you are NNOT working with LCM and want to do multi-objects tracking, run:

python camera_memory.py

and then for each object you wish to track, do:

python fpTracking_share3.py --object_name A_shape

Docker setup

BundleSDF setup: Do the Docker/Environment setup below (Optiional) These are required for my 4049 GPU. Do this if you have error message like `GLIBCXX_3.4.29' not found

apt-get update
apt-get install -y software-properties-common
add-apt-repository ppa:ubuntu-toolchain-r/test
apt-get update
apt-get install -y libstdc++6

Install XMem:

git clone https://github.com/yufeiyg/XMem.git
cd XMem (where your XMem is cloned)
pip install requirements.txt
pip install pyrealsense2
pip install yacs

Install FoundationPose. They provided the requirement.txt, but installing everything in the requirement.txt messed up the BundleSDF packages

git clone https://github.com/yufeiyg/FoundationPose.git
pip install fvcore==0.1.5.post20221221
python -m pip install --quiet --no-cache-dir git+https://github.com/NVlabs/nvdiffrast.git
python -m pip install --quiet --no-index --no-cache-dir pytorch3d -f https://dl.fbaipublicfiles.com/pytorch3d/packaging/wheels/py39_cu118_pyt200/download.html

BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects

This is an implementation of our paper published in CVPR 2023

[Arxiv] [Project page] [Supplemental video]

Abstract

We present a near real-time method for 6-DoF tracking of an unknown object from a monocular RGBD video sequence, while simultaneously performing neural 3D reconstruction of the object. Our method works for arbitrary rigid objects, even when visual texture is largely absent. The object is assumed to be segmented in the first frame only. No additional information is required, and no assumption is made about the interaction agent. Key to our method is a Neural Object Field that is learned concurrently with a pose graph optimization process in order to robustly accumulate information into a consistent 3D representation capturing both geometry and appearance. A dynamic pool of posed memory frames is automatically maintained to facilitate communication between these threads. Our approach handles challenging sequences with large pose changes, partial and full occlusion, untextured surfaces, and specular highlights. We show results on HO3D, YCBInEOAT, and BEHAVE datasets, demonstrating that our method significantly outperforms existing approaches.

Bibtex

@InProceedings{bundlesdfwen2023,
author        = {Bowen Wen and Jonathan Tremblay and Valts Blukis and Stephen Tyree and Thomas M\"{u}ller and Alex Evans and Dieter Fox and Jan Kautz and Stan Birchfield},
title         = {{BundleSDF}: {N}eural 6-{DoF} Tracking and {3D} Reconstruction of Unknown Objects},
booktitle     = {CVPR},
year          = {2023},
}

Data download

Download pretrained weights of segmentation network, and put it under ./BundleTrack/XMem/saves/XMem-s012.pth
Download pretrained weights of LoFTR outdoor_ds.ckpt, and put it under ./BundleTrack/LoFTR/weights/outdoor_ds.ckpt
Download HO3D data. We provide the augmented data that you can download here. Then download YCB-Video object models from here. Finally, make sure the structure is like below, and update your root path of HO3D_ROOT at the top of BundleTrack/scripts/data_reader.py
```
HO3D_v3
  ├── evaluation
  ├── models
  └── masks_XMem
```

Docker/Environment setup

Build the docker image (this only needs to do once and can take some time).

cd docker
docker build --network host -t nvcr.io/nvidian/bundlesdf .

Start a docker container the first time

cd docker && bash run_container.sh

# Inside docker container, compile the packages which are machine dependent
bash build.sh

Run on your custom data

Prepare your RGBD video folder as below (also refer to the example milk data). You can find an example milk data here for testing.

root
  ├──rgb/    (PNG files)
  ├──depth/  (PNG files, stored in mm, uint16 format. Filename same as rgb)
  ├──masks/       (PNG files. Filename same as rgb. 0 is background. Else is foreground)
  └──cam_K.txt   (3x3 intrinsic matrix, use space and enter to delimit)

Due to license issues, we are not able to include XMem in this codebase for running segmentation online. If you are interested in doing so, please download the code separately and add a wrapper in segmentation_utils.py.

Run your RGBD video (specify the video_dir and your desired output path). There are 3 steps. Note we assume the max relevant depth in the demo data <1. If this is not the case for you, change it here

# 1) Run joint tracking and reconstruction. 
python run_custom.py --mode run_video --video_dir /home/bowen/debug/2022-11-18-15-10-24_milk --out_folder /home/bowen/debug/bundlesdf_2022-11-18-15-10-24_milk --use_segmenter 1 --use_gui 1 --debug_level 2

# 2) Run global refinement post-processing to refine the mesh
python run_custom.py --mode global_refine --video_dir /home/bowen/debug/2022-11-18-15-10-24_milk --out_folder /home/bowen/debug/bundlesdf_2022-11-18-15-10-24_milk   # Change the path to your video_directory

# 3) (Optional) If you want to draw the oriented bounding box to visualize the pose, similar to our demo
python run_custom.py --mode draw_pose --out_folder /home/bowen/debug/bundlesdf_2022-11-18-15-10-24_milk

Finally the results will be dumped in the out_folder, including the tracked poses stored in ob_in_cam/ and reconstructed mesh with texture textured_mesh.obj.

Run on HO3D dataset

# Run BundleSDF to get the pose and reconstruction results
python run_ho3d.py --video_dirs /mnt/9a72c439-d0a7-45e8-8d20-d7a235d02763/DATASET/HO3D_v3/evaluation/SM1 --out_dir /home/bowen/debug/ho3d_ours

# Benchmark the output results
python benchmark_ho3d.py --video_dirs /mnt/9a72c439-d0a7-45e8-8d20-d7a235d02763/DATASET/HO3D_v3/evaluation/SM1 --out_dir /home/bowen/debug/ho3d_ours

Acknowledgement

We would like to thank Jeff Smith for helping with the code release. Marco Foco and his team for providing the test data on the static scene.

Contact

For questions, please contact Bowen Wen (bowenw@nvidia.com)

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.vscode		.vscode
BundleTrack		BundleTrack
assets		assets
docker		docker
lcm_sys		lcm_sys
media		media
mycuda		mycuda
.gitattributes		.gitattributes
.gitignore		.gitignore
3_shape_video.obj		3_shape_video.obj
A_shape_new.obj		A_shape_new.obj
A_shape_video.obj		A_shape_video.obj
B_shape_video.obj		B_shape_video.obj
C_shape_new_video.obj		C_shape_new_video.obj
C_shape_texture.obj		C_shape_texture.obj
D_shape_texture.obj		D_shape_texture.obj
D_shape_video.obj		D_shape_video.obj
E_shape_video.obj		E_shape_video.obj
F_shape_texture.obj		F_shape_texture.obj
G_shape_video.obj		G_shape_video.obj
H_shape_texture.obj		H_shape_texture.obj
H_shape_video.obj		H_shape_video.obj
I_shape_texture.obj		I_shape_texture.obj
I_shape_video.obj		I_shape_video.obj
LICENSE.txt		LICENSE.txt
N_shape_video.obj		N_shape_video.obj
O_shape_texture.obj		O_shape_texture.obj
P_shape_texture.obj		P_shape_texture.obj
P_shape_video.obj		P_shape_video.obj
R_shape_texture.obj		R_shape_texture.obj
R_shape_video.obj		R_shape_video.obj
S_shape_texture.obj		S_shape_texture.obj
S_shape_video.obj		S_shape_video.obj
T_small_video.obj		T_small_video.obj
U_shape_texture.obj		U_shape_texture.obj
U_shape_video.obj		U_shape_video.obj
Utils.py		Utils.py
XMem_config.yaml		XMem_config.yaml
Y_shape_video.obj		Y_shape_video.obj
arm_cam.py		arm_cam.py
auto_rotate.py		auto_rotate.py
auto_rotate_mesh.obj		auto_rotate_mesh.obj
auto_tracking.py		auto_tracking.py
baby_toy.obj		baby_toy.obj
benchmark_ho3d.py		benchmark_ho3d.py
blue_bin.obj		blue_bin.obj
book.obj		book.obj
build.sh		build.sh
bundlesdf.py		bundlesdf.py
calibration.py		calibration.py
camera.obj		camera.obj
camera_memory.py		camera_memory.py
chicken_broth.obj		chicken_broth.obj
clamp.obj		clamp.obj
coffee_mate.obj		coffee_mate.obj
complete_mesh copy.py		complete_mesh copy.py
complete_mesh.py		complete_mesh.py
config.yml		config.yml
cuda-keyring_1.1-1_all.deb		cuda-keyring_1.1-1_all.deb
cuda-keyring_1.1-1_all.deb.1		cuda-keyring_1.1-1_all.deb.1
cus.py		cus.py
dunkin_box.obj		dunkin_box.obj
egg_carton.obj		egg_carton.obj
eraser.obj		eraser.obj
expo_box.obj		expo_box.obj
file_utils.py		file_utils.py
fpTracking.py		fpTracking.py
fpTracking_multi.py		fpTracking_multi.py
fpTracking_share.py		fpTracking_share.py
fpTracking_share2.py		fpTracking_share2.py
fpTracking_share3.py		fpTracking_share3.py
fpTracking_share4.py		fpTracking_share4.py
fpTracking_try.py		fpTracking_try.py
franka_gripper.obj		franka_gripper.obj
gallon_milk.obj		gallon_milk.obj
glasses_box.obj		glasses_box.obj
gui.py		gui.py
live_demo.py		live_demo.py
loftr_wrapper.py		loftr_wrapper.py
lotion.obj		lotion.obj
material.mtl		material.mtl
material_0.png		material_0.png
mesh_flat.mtl		mesh_flat.mtl
mesh_flat.obj		mesh_flat.obj
mesh_post_processing.py		mesh_post_processing.py
mesh_sharpened.obj		mesh_sharpened.obj
mesh_with_bottom.obj		mesh_with_bottom.obj
milk.obj		milk.obj
milk_new.obj		milk_new.obj
mug_new.obj		mug_new.obj
mug_video.obj		mug_video.obj
nerf_helpers.py		nerf_helpers.py
nerf_runner.py		nerf_runner.py
offscreen_renderer.py		offscreen_renderer.py
pen.obj		pen.obj
plus_video.obj		plus_video.obj
process_depth.py		process_depth.py
readme.md		readme.md
reconstruction.sh		reconstruction.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BundleSDF for Push Anything

Docker setup

BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects

Abstract

Bibtex

Data download

Docker/Environment setup

Run on your custom data

Run on HO3D dataset

Acknowledgement

Contact

About

Uh oh!

Releases

Packages

Contributors 5

Uh oh!

Languages

License

yufeiyg/PushTracking

Folders and files

Latest commit

History

Repository files navigation

BundleSDF for Push Anything

Docker setup

BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects

Abstract

Bibtex

Data download

Docker/Environment setup

Run on your custom data

Run on HO3D dataset

Acknowledgement

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

Packages