Skip to content

omics-ai/ctrlact

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Embodied Agent Interface Challenge @ NeurIPS 2025

This repo contains the CtrlAct solution for the Embodied Agent Interface Challenge at NeurIPS 2025.

Challenge Overview

The competition includes four main tasks:

  1. Goal Interpretation: Understanding objectives and grounding them in environmental states.
  2. Subgoal Decomposition: Breaking complex goals into actionable steps.
  3. Action Sequencing: Planning coherent action sequences.
  4. Transition Modeling: Predicting environment state changes caused by actions.

CtrlAct Solution

The goal of CtrlAct is to evaluate the performance of open-source models on the Embodied Agent Interface benchmark and analyze the performance gap between these models and top-ranked systems.

The following open-source models were evaluated:

vLLM Setup

conda create -n ctrlact python=3.12
conda activate ctrlact
pip install vllm==0.11.0 --extra-index-url https://download.pytorch.org/whl/cu128
pip install transformers==4.57.1
pip install flashinfer-python==0.4.1
pip install scikit-learn matplotlib pandas

Harware

  • 4 NVIDIA H100 GPUs
  • 8 NVIDIA L40S GPUs

SFT Using Tinker

We used Tinker for supervised fine-tuning (SFT) experiments as part of our evaluation pipeline.

We thank the Tinker team for providing free credits that supported our large-scale model experiments.


Changelog

  • 2025-12-07: Technical report released.
  • 2026-04-30: GitHub repository updated.
  • 2026-05-03: vLLM inference code released.

About

CtrlAct: Grounding Embodied LLM Agents to Bridge the Gap between Instructions and Actions. NeurIPS 2025 Competition for Embodied Agent

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Contributors

Languages