Hi Author,
We do not quite understand about the definition of the two learnable parameters [{\theta _1}] and [{\theta _2}] in Figure 4 in the following way:
|
cls_2_other = abs_pos_bias[:, 0, 0] |
|
other_2_cls = abs_pos_bias[:, 1, 1] |
We would appreciate it if you could explain it to us.
Hi Author,
We do not quite understand about the definition of the two learnable parameters [{\theta _1}] and [{\theta _2}] in Figure 4 in the following way:
TUPE/fairseq/modules/transformer_sentence_encoder.py
Line 234 in 4c64ff7
TUPE/fairseq/modules/transformer_sentence_encoder.py
Line 236 in 4c64ff7
We would appreciate it if you could explain it to us.