Hi, thank you for releasing the Tabletop-Sim codebases.
I am trying to reproduce the Tabletop-Sim evaluation protocol, and I noticed a potential issue with the Lift Box task. When I replay/evaluate the provided dataset demonstrations for lift_box using the current environment success criterion, the success rate appears to be 0%, even though the box is visibly lifted in the rollout videos.
The same replay/evaluation check works as expected for the other Tabletop tasks, so this seems specific to Lift Box. I was wondering whether this behavior is expected, or whether there may be a mismatch between the provided Lift Box demonstrations and the current success condition/evaluation code.
In particular, it seems possible that the box position is correct, but the orientation-related success condition is not satisfied. Could you please confirm the intended success criterion for Lift Box, and whether the released dataset demonstrations should achieve non-zero success when replayed in the current environment?
Thank you very much for your help.
Hi, thank you for releasing the Tabletop-Sim codebases.
I am trying to reproduce the Tabletop-Sim evaluation protocol, and I noticed a potential issue with the Lift Box task. When I replay/evaluate the provided dataset demonstrations for
lift_boxusing the current environment success criterion, the success rate appears to be 0%, even though the box is visibly lifted in the rollout videos.The same replay/evaluation check works as expected for the other Tabletop tasks, so this seems specific to Lift Box. I was wondering whether this behavior is expected, or whether there may be a mismatch between the provided Lift Box demonstrations and the current success condition/evaluation code.
In particular, it seems possible that the box position is correct, but the orientation-related success condition is not satisfied. Could you please confirm the intended success criterion for Lift Box, and whether the released dataset demonstrations should achieve non-zero success when replayed in the current environment?
Thank you very much for your help.