Thanks for your exciting work! I have two questions about the temporal feature aggregation:
- Does the AFFM use deformable attention, or only use original attention?
- Does the AFFM shared parameters? Suppose I have temporal BEV features at {T, T-1, T-2}, and I need 4 AFFM to finish the temporal aggregation as introduced in section3.2. Does these 4 AFFM shared parameters? Or I need to initiate 4 attention modules with different parameters?
Looking forward to your reply.
Thanks for your exciting work! I have two questions about the temporal feature aggregation:
Looking forward to your reply.