Hi ROLL team,
Since support for Qwen3.5 has been added, could you please provide an official example for the Sokoban agentic pipeline (similar to the existing qwen2.5-vl-3B-agentic one)?
Specifically, a reference script like run_agentic_pipeline_sokoban.sh optimized for Qwen3.5 would be extremely helpful for the community to:
Verify the integration of the new model in RL environments.
Have a baseline configuration for training/evaluating Qwen3.5 on agentic tasks.
It would be great if this could be included in the examples/ directory.
Thanks!