Skip to content

Eval issue on lift_box #7

@cosdeneb

Description

@cosdeneb

Hi, thank you for releasing the Tabletop-Sim codebases.

I am trying to reproduce the Tabletop-Sim evaluation protocol, and I noticed a potential issue with the Lift Box task. When I replay/evaluate the provided dataset demonstrations for lift_box using the current environment success criterion, the success rate appears to be 0%, even though the box is visibly lifted in the rollout videos.

The same replay/evaluation check works as expected for the other Tabletop tasks, so this seems specific to Lift Box. I was wondering whether this behavior is expected, or whether there may be a mismatch between the provided Lift Box demonstrations and the current success condition/evaluation code.

In particular, it seems possible that the box position is correct, but the orientation-related success condition is not satisfied. Could you please confirm the intended success criterion for Lift Box, and whether the released dataset demonstrations should achieve non-zero success when replayed in the current environment?

Thank you very much for your help.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions