UR Robotic Arm with Robotiq 2-Finger Gripper for ROS 2

Related Blog Post: For behind-the-scenes details and the full development journey, check out the companion Medium article: How I'm Building an Autonomous Pick-and-Place System with ROS 2 Jazzy and Gazebo Harmonic

The blog dives into simulation setup, robotic control, MoveIt Task Constructor, and lessons learned — perfect if you're curious about the engineering side or want to replicate the project from scratch.

This project integrates the Robotiq 2-Finger Gripper with a Universal Robots UR3 arm using ROS 2 Humble and Gazebo Harmonic. It includes URDF models, ROS 2 control configuration, simulation launch files, MoveIt Task Constructor pick-and-place, vision-based object detection, LLM-driven task planning (Ollama), and demonstration recording for behavior cloning.

Demo

Installation

Make sure you have ROS 2 Humble and Gazebo Harmonic (gz-sim 8.x) installed. Ignition Fortress (ign gazebo / gz-sim 6) will not work — the world file and bridge packages are Harmonic-specific.

1. Clone the Repository

git clone https://github.com/darshmenon/UR3_ROS2_PICK_AND_PLACE.git
cd UR3_ROS2_PICK_AND_PLACE

2. Install ROS Dependencies

# Set to humble or jazzy
export ROS_DISTRO=humble

sudo apt install ros-$ROS_DISTRO-rviz2 \
                 ros-$ROS_DISTRO-joint-state-publisher \
                 ros-$ROS_DISTRO-robot-state-publisher \
                 ros-$ROS_DISTRO-ros2-control \
                 ros-$ROS_DISTRO-ros2-controllers \
                 ros-$ROS_DISTRO-controller-manager \
                 ros-$ROS_DISTRO-joint-trajectory-controller \
                 ros-$ROS_DISTRO-position-controllers \
                 ros-$ROS_DISTRO-gz-ros2-control \
                 ros-$ROS_DISTRO-ros2controlcli \
                 ros-$ROS_DISTRO-moveit \
                 ros-$ROS_DISTRO-moveit-ros-perception \
                 ros-$ROS_DISTRO-simple-grasping \
                 ros-$ROS_DISTRO-cv-bridge \
                 ros-$ROS_DISTRO-tf2-ros \
                 ros-$ROS_DISTRO-tf2-geometry-msgs \
                 ros-$ROS_DISTRO-pcl-ros

Jazzy only — add these two extra packages:
sudo apt install ros-jazzy-ros-gz-sim ros-jazzy-ros-gz-bridge \
                 ros-jazzy-moveit-planners-stomp
STOMP is not packaged for Humble so leave it out there — the planner init fails silently and is harmless.

3. Install Python Dependencies

pip3 install -r requirements.txt
# Ollama is required for the LLM planner:
# Install from https://ollama.com
# Then pull your preferred model:
ollama pull llama2:latest

4. Build the Workspace

colcon build --symlink-install
source install/setup.bash

MoveIt Task Constructor Setup

This project supports MoveIt Task Constructor (MTC) for advanced pick-and-place planning.

This repo already includes a patched MTC source in src/moveit_task_constructor/ that works for both ROS 2 Humble and Jazzy — no extra cloning needed. Just build normally:

colcon build --symlink-install

MongoDB (required for warehouse_ros_mongo)

MTC uses warehouse_ros_mongo to persist planning scenes and trajectories. MongoDB must be installed and running before launching the demo:

curl -fsSL https://www.mongodb.org/static/pgp/server-7.0.asc | \
  sudo gpg -o /usr/share/keyrings/mongodb-server-7.0.gpg --dearmor

echo "deb [ arch=amd64,arm64 signed-by=/usr/share/keyrings/mongodb-server-7.0.gpg ] https://repo.mongodb.org/apt/ubuntu jammy/mongodb-org/7.0 multiverse" | \
  sudo tee /etc/apt/sources.list.d/mongodb-org-7.0.list

sudo apt-get update && sudo apt-get install -y mongodb-org
sudo systemctl start mongod && sudo systemctl enable mongod

Verify it is running: mongosh should connect to mongodb://127.0.0.1:27017.

For Humble/Jazzy API differences and troubleshooting, see ur_mtc_pick_place_demo/README.md.

Launch Instructions

Full MTC Pick-and-Place Demo

bash ur_mtc_pick_place_demo/scripts/robot.sh

Launches Gazebo + MoveIt + planning scene server + MTC demo in sequence.

Launch Full Simulation in Gazebo

# Default — Robotiq 2F-85
ros2 launch ur_gazebo ur.gazebo.launch.py

# Robotiq 2F-140
ros2 launch ur_gazebo ur.gazebo.launch.py gripper:=robotiq_2f_140

# OnRobot RG2
ros2 launch ur_gazebo ur.gazebo.launch.py gripper:=onrobot_rg2

# OnRobot RG6
ros2 launch ur_gazebo ur.gazebo.launch.py gripper:=onrobot_rg6

Supported Grippers

Gripper	Arg	Actuated joint	Mimic joints
Robotiq 2F-85	`robotiq_2f_85`	`finger_joint`	5
Robotiq 2F-140	`robotiq_2f_140`	`finger_joint`	5
OnRobot RG2	`onrobot_rg2`	`gripper_joint`	5
OnRobot RG6	`onrobot_rg6`	`gripper_joint`	5

All four grippers use position_controllers/GripperActionController for the single commanded joint. Mimic joints are state-only — Gazebo Harmonic enforces the <mimic> constraints at the physics level.

Verify Controllers After Launch

Controllers take ~40 s to spawn. Run this to confirm all three are active:

ros2 control list_controllers

Expected output (same for all grippers):

arm_controller[joint_trajectory_controller/JointTrajectoryController] active
gripper_controller[position_controllers/GripperActionController] active
joint_state_broadcaster[joint_state_broadcaster/JointStateBroadcaster] active

Command the Gripper from CLI

Robotiq (2F-85 / 2F-140) — finger_joint range 0.0 (open) → 0.8 (closed):

ros2 action send_goal /gripper_controller/gripper_cmd \
  control_msgs/action/GripperCommand \
  "{command: {position: 0.5, max_effort: 50.0}}"

OnRobot (RG2 / RG6) — gripper_joint range 0.0 (open) → 1.3 (closed):

ros2 action send_goal /gripper_controller/gripper_cmd \
  control_msgs/action/GripperCommand \
  "{command: {position: 0.65, max_effort: 50.0}}"

Launch Point Cloud Viewer (Gazebo + RViz)

bash ur_mtc_pick_place_demo/scripts/pointcloud.sh

Launch RViz Visualization (UR3 + Gripper)

ros2 launch ur_description view_ur.launch.py ur_type:=ur3

Launch Gripper Visualization Alone

ros2 launch robotiq_2finger_grippers robotiq_2f_85_gripper_visualization/launch/test_2f_85_model.launch.py

Move the Arm from CLI

ros2 action send_goal /arm_controller/follow_joint_trajectory control_msgs/action/FollowJointTrajectory \
'{
  "trajectory": {
    "joint_names": [
      "shoulder_pan_joint",
      "shoulder_lift_joint",
      "elbow_joint",
      "wrist_1_joint",
      "wrist_2_joint",
      "wrist_3_joint"
    ],
    "points": [
      {
        "positions": [0.0, -1.57, 1.57, 0.0, 1.57, 0.0],
        "time_from_start": { "sec": 2, "nanosec": 0 }
      }
    ]
  }
}'

Run Arm-Gripper Automation Script

python3 ~/UR3_ROS2_PICK_AND_PLACE/ur_system_tests/scripts/arm_gripper_loop_controller.py

Full Autonomous Pipeline

full_demo.launch.py brings up the entire stack — Gazebo, MoveIt, perception, grasp detection, and a selectable autonomous brain — in a single command.

source install/setup.bash

# LLM planner (Ollama, send commands via /llm_planner/command):
ros2 launch ur_gazebo full_demo.launch.py brain:=llm

# Trained SAC policy (auto-reads object position from perception):
ros2 launch ur_gazebo full_demo.launch.py brain:=rl \
  model_path:=ur_rl_training/models/checkpoints/<run>/best_model.zip

# OpenVLA end-to-end vision-language-action:
ros2 launch ur_gazebo full_demo.launch.py brain:=openvla \
  task:="pick the red block and place it in the bin"

# Perception + grasp only (no autonomous control):
ros2 launch ur_gazebo full_demo.launch.py brain:=none

Startup sequence: Gazebo + MoveIt → perception (60 s) → grasp (62 s) → brain (65 s).

Pipeline Data Flow

Camera/Depth  →  ur_perception  →  /detected_objects  →  LLM planner
                                                       →  RL policy (auto object tracking)
PointCloud2   →  ur_grasp       →  /ur_grasp/grasp_pose → RL policy (overrides perception)
Camera        →  OpenVLA        →  /arm_controller/joint_trajectory

Grasp Detection (ur_grasp)

Estimates grasp poses from the Intel D435 point cloud. Two backends:

Backend	Method	Dependency
simple_grasping (primary)	PCL RANSAC → `moveit_msgs/Grasp[]`	`ros-$ROS_DISTRO-simple-grasping`
numpy centroid (fallback)	Colour HSV filter + centroid + height	built-in

ros2 launch ur_grasp grasp_detection.launch.py colour:=red
python3 testing/test_grasp.py --colour red --execute

Standalone Robot Control GUI

python3 ur_system_tests/scripts/gui.py

UR3 Reinforcement Learning (SAC)

Trains a Soft Actor-Critic (SAC) policy in MuJoCo and deploys it to Gazebo. The policy learns to reach, grasp, lift, and place a cube using the UR3 + Robotiq 2F-85.

Features:

VecNormalize observation and reward normalisation for stable training
4-phase curriculum: reach → grasp → lift → place; auto-advances to full task once eval reward ≥ 400
Phase-distribution and success-rate metrics logged to TensorBoard every eval interval
Domain randomisation: object mass, friction, size (±20%), observation noise, joint jitter

Train:

cd ur_rl_training
python3 scripts/train.py --timesteps 3000000
# Resume from checkpoint (loads vecnormalize.pkl automatically):
python3 scripts/train.py --resume models/checkpoints/<run>/best_model

Best model and normalisation stats saved to ur_rl_training/models/checkpoints/<run>/.

View policy in Gazebo:

# Terminal 1 — Gazebo + MoveIt:
source install/setup.bash
ros2 launch ur_gazebo ur.gazebo.launch.py world_file:=rl_policy_demo.world

# Terminal 2 — RL policy node:
source install/setup.bash
ros2 launch ur_rl_training rl_policy.launch.py \
  model_path:=ur_rl_training/models/checkpoints/<run>/best_model.zip

Optional launch parameters:

Parameter	Default	Description
`action_scale`	`0.1`	Joint delta per step (increase for faster motion, e.g. `0.4`)
`step_dt`	`0.01`	Trajectory point duration in seconds
`control_rate_hz`	`100.0`	Policy inference rate
`object_x/y/z`	`0.35/0.0/0.045`	Object position fallback (auto-overridden by `/detected_objects` or `/ur_grasp/grasp_pose` when running)
`drop_x/y/z`	`0.35/0.20/0.02`	Drop zone position
`phase`	`1.0`	Curriculum phase (0=reach, 1=grasp, 2=lift, 3=place)

Headless evaluation:

python3 ur_rl_training/scripts/eval_headless.py \
  --model ur_rl_training/models/checkpoints/<run>/best_model.zip \
  --episodes 20

ACT — Action Chunking Transformers (`ur_act`)

ur_act trains an ACT policy on demonstrations recorded by ur_data_collector. Instead of predicting one action at a time (like the BC policy), ACT predicts a chunk of future actions per step and blends overlapping predictions with temporal ensemble — giving smoother, more temporally consistent motion.

Architecture:

ResNet18 visual backbone → spatial image tokens
CVAE encoder (training only) → style latent z
Transformer decoder (image tokens + joint token + z) → action chunk of length k
At inference: z = 0, temporal ensemble blends overlapping chunks

Train:

# Record demonstrations first with ur_data_collector, then:
python3 ur_act/scripts/train_act.py \
  --data_dir ~/ur3_demos \
  --output_dir ~/act_policy \
  --chunk_size 10 \
  --epochs 100

Arg	Default	Description
`chunk_size`	`10`	Actions predicted per step (2 s at 5 Hz)
`kl_weight`	`10.0`	CVAE KL term weight
`d_model`	`256`	Transformer hidden dim
`freeze_backbone`	off	Freeze ResNet18 during training

Full Gazebo workflow — collect → train → deploy:

# Step 1: Launch Gazebo + MoveIt + data collector
ros2 launch ur_gazebo ur.gazebo.launch.py
ros2 launch ur_data_collector data_collector.launch.py

# Step 2: Record demos using the MTC pick-place script (repeat N times)
ros2 service call /data_collector/start_recording std_srvs/srv/Trigger {}
bash ur_mtc_pick_place_demo/scripts/robot.sh          # runs one pick-place
ros2 service call /data_collector/stop_recording  std_srvs/srv/Trigger {}

# Step 3: Train
python3 ur_act/scripts/train_act.py \
  --data_dir ~/ur3_demos \
  --output_dir ~/act_policy \
  --chunk_size 10 --epochs 100

# Step 4: Deploy via full_demo (ACT as the brain)
ros2 launch ur_gazebo full_demo.launch.py \
  brain:=act \
  act_model_path:=~/act_policy/best_act_policy.pt

Contributing

Pull requests and issues are welcome, especially around simulation stability, transfer learning, and perception-to-action integration.

Future Scope

Improve MuJoCo-to-Gazebo transfer so learned grasping policies behave more consistently on the UR3 with the Robotiq gripper.
Fine-tune OpenVLA on collected UR3 demonstrations for better sim-to-real performance.
Add multi-object handling so the RL policy and LLM planner can sequence picks across several targets.
Real robot deployment — swap Gazebo hardware interface for the live UR3 driver and test trained policies on hardware.

Work in Progress

The following features are actively being developed and are not yet fully integrated.

Grasp Detection (`ur_grasp`)

Point-cloud grasp estimation for tabletop objects from the Intel D435 depth stream.

Verified in this workspace:

package imports successfully after source install/setup.bash
installed executable: ros2 run ur_grasp grasp_node

Launch:

source install/setup.bash
ros2 run ur_grasp grasp_node

# Or with optional args (colour filter and backend):
ros2 launch ur_grasp grasp_detection.launch.py colour:=red backend:=auto

Trigger one detection:

ros2 service call /ur_grasp/detect std_srvs/srv/Trigger {}

Healthy signs:

advertises /ur_grasp/detect
subscribes to /camera_head/depth/color/points
publishes /ur_grasp/grasp_pose
publishes /ur_grasp/grasp_marker for RViz
falls back to the built-in numpy centroid detector if simple_grasping is not installed
warns and returns no grasp if a point cloud has not arrived yet

Vision-Based Perception (`ur_perception`)

Color-based object detection with optional YOLO and PCL cluster extraction from the Intel D435 camera.

Launch:

source install/setup.bash
ros2 launch ur_perception perception.launch.py

Watch detections:

ros2 topic echo /detected_objects

Run the node directly:

source install/setup.bash
ros2 run ur_perception object_detector_node.py

Verified in this workspace:

package imports successfully after source install/setup.bash
installed executable: ros2 run ur_perception object_detector_node.py

Healthy signs:

publishes detected objects on /detected_objects
publishes annotated images on /detection_image
publishes collision objects on /planning_scene
waits for /camera_head/color/image_raw, /camera_head/depth/image_rect_raw, and /camera_head/camera_info
warns and keeps color detection enabled if use_yolo:=true is set but ultralytics is missing

LLM Task Planner (`ur_llm_planner`)

Natural-language task planning backed by a local Ollama model and connected to perception plus the MoveIt/gripper execution path.

Verified in this workspace:

package imports successfully after source install/setup.bash
installed executable: ros2 run ur_llm_planner llm_planner_node.py
command topic exists in code at /llm_planner/command
planner converts text into a JSON task list and passes it to MotionExecutor

Launch:

source install/setup.bash
ros2 run ur_llm_planner llm_planner_node.py

Or use the launch file:

source install/setup.bash
ros2 launch ur_llm_planner llm_planner.launch.py

Send a text instruction:

ros2 topic pub --once /llm_planner/command std_msgs/msg/String \
  "{data: 'pick up the red object and place it to the left of the robot'}"

Healthy signs:

subscribes to /detected_objects
listens on /llm_planner/command
asks Ollama for a JSON task plan
executes actions like move_to_named_pose, pick, place, open_gripper, and close_gripper
retries up to 2 times on execution failure, sending failure context back to the LLM for a simpler re-plan
warns and returns an empty task list if Ollama is not available at http://localhost:11434
may plan successfully but fail execution if MoveIt or gripper action servers are unavailable

Ollama setup:

ollama serve
ollama pull llama3.2:3b
ros2 launch ur_llm_planner llm_planner.launch.py ollama_model:=llama3.2:3b

Name		Name	Last commit message	Last commit date
Latest commit History 247 Commits
assets		assets
docs		docs
moveit_config		moveit_config
mujoco_ur_rl_ros2		mujoco_ur_rl_ros2
onrobot_description		onrobot_description
robotiq_2f_85_gripper_visualization		robotiq_2f_85_gripper_visualization
robotiq_description		robotiq_description
src		src
testing		testing
ur_act		ur_act
ur_data_collector		ur_data_collector
ur_description		ur_description
ur_gazebo		ur_gazebo
ur_grasp		ur_grasp
ur_interfaces		ur_interfaces
ur_llm_planner		ur_llm_planner
ur_moveit_demos		ur_moveit_demos
ur_mtc_demos		ur_mtc_demos
ur_mtc_pick_place_demo		ur_mtc_pick_place_demo
ur_perception		ur_perception
ur_rl_training		ur_rl_training
ur_smolvla		ur_smolvla
ur_sorting_demo		ur_sorting_demo
ur_system_tests		ur_system_tests
ur_visual_servo		ur_visual_servo
ur_voice_cmd		ur_voice_cmd
ur_web_dashboard		ur_web_dashboard
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
launch_headless.sh		launch_headless.sh
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

UR Robotic Arm with Robotiq 2-Finger Gripper for ROS 2

Demo

Installation

1. Clone the Repository

2. Install ROS Dependencies

3. Install Python Dependencies

4. Build the Workspace

MoveIt Task Constructor Setup

MongoDB (required for warehouse_ros_mongo)

Launch Instructions

Full MTC Pick-and-Place Demo

Launch Full Simulation in Gazebo

Supported Grippers

Verify Controllers After Launch

Command the Gripper from CLI

Launch Point Cloud Viewer (Gazebo + RViz)

Launch RViz Visualization (UR3 + Gripper)

Launch Gripper Visualization Alone

Move the Arm from CLI

Run Arm-Gripper Automation Script

Full Autonomous Pipeline

Pipeline Data Flow

Grasp Detection (ur_grasp)

Standalone Robot Control GUI

UR3 Reinforcement Learning (SAC)

ACT — Action Chunking Transformers (ur_act)

Contributing

Future Scope

Work in Progress

Grasp Detection (ur_grasp)

Vision-Based Perception (ur_perception)

LLM Task Planner (ur_llm_planner)

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

ACT — Action Chunking Transformers (`ur_act`)

Grasp Detection (`ur_grasp`)

Vision-Based Perception (`ur_perception`)

LLM Task Planner (`ur_llm_planner`)

Packages