Kinematics Reconstruction

Github Structure

Our repository is organized into multiple directories for different aspects of robotic surgery research:

Scene Reconstruction & Pose Analytics Pipeline

Our scene reconstruction and pose analytics pipeline follows these key stages:

YOLO-pose Finetuning: Initial model training using the SurgPose dataset to establish foundational pose recognition capabilities. Further refinement of the pretrained model using SurgVU dataset pose annotations (Northwell Physicians Group, Encord. Contact liam.mchugh@columbia.edu) for domain-specific adaptation.
Monocular Depth Finetuning*: Using calibrated Stereo-Vision inference as annotations, Metric3D can be finetuned for laparoscopic surgery, improving the performance of depth-integrated kinematics reconstruction on monocular video datasets. Code for finetuning & depth inference (both monocular using Metric3D and stereo using NVLabs FoundationStereo) can be found in the depth_recon subdirectory.
Kinematic Inference:
- Core Pose Detection: Extraction of key instrument positions and orientations
- Optional Enhancements:
- Stereo/monocular depth inference for enhanced spatial awareness
- SAM instrument masking for constraining x/y & especially depth projections
Kinematic Clustering: Analysis of movement patterns to identify surgical gestures and techniques

Pose Annotation Datasets:

Kinematic Inferece Setup Guide

This guide will help you set up the environment and run kinematic inference for this project.

1. Clone Submodules

git submodule update --init --recursive

2. Create and Activate Environment

# Create environment from the provided YAML file

# LOCAL MACHINES (flexible torch/cuda)
conda env create -f kinematics/kinematics_env_flexmachine.yml

# CLOUD ENVIRONMENTS:
conda env create -f kinematics/kinematics_env.yml

# Activate the environment
conda activate kinematics

4. Download Model Weights

Pose Models (see XX for complete pose analytics report):

surgvu finetunes: Download surgvu_finetune.zip
northwell finetune (yolo11m):

Extract and put file in kinematics/models/

5. Run Kinematic Inference

python kinematics/kinematic_inference.py --input <input video> --save-video

For further questions, please refer to the project documentation or contact the maintainer.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
kinematics		kinematics
scene3d		scene3d
stereo_acquisition		stereo_acquisition
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kinematics Reconstruction

Github Structure

Scene Reconstruction & Pose Analytics Pipeline

Pose Annotation Datasets:

Kinematic Inferece Setup Guide

1. Clone Submodules

2. Create and Activate Environment

4. Download Model Weights

5. Run Kinematic Inference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Kinematics Reconstruction

Github Structure

Scene Reconstruction & Pose Analytics Pipeline

Pose Annotation Datasets:

Kinematic Inferece Setup Guide

1. Clone Submodules

2. Create and Activate Environment

4. Download Model Weights

5. Run Kinematic Inference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages