Dear:
Thanks for your excellent work firstly. But I found an issue when I compared V3D with the SV3D. Both of the two papers removed the motion_bucket_id as well as the fps_id, which are irrelevant. In the SV3D_u configuration, it did so with the adm_in_channels = 256.

But when referred to the V3D_512 configuration, the adm_in_channels is set 768 and I found the fps_id as well as the motion_bucket_id is set to 1 and 300 in the inference script.
So I wonder why the model is not consistent with the paper. If I do not remove the motion_bucket_id as well as the fps_id, how should I set them during training?
If convenient, could you help me solve this issue?
Dear:
Thanks for your excellent work firstly. But I found an issue when I compared V3D with the SV3D. Both of the two papers removed the motion_bucket_id as well as the fps_id, which are irrelevant. In the SV3D_u configuration, it did so with the adm_in_channels = 256.
But when referred to the V3D_512 configuration, the adm_in_channels is set 768 and I found the fps_id as well as the motion_bucket_id is set to 1 and 300 in the inference script.
So I wonder why the model is not consistent with the paper. If I do not remove the motion_bucket_id as well as the fps_id, how should I set them during training?
If convenient, could you help me solve this issue?