Skip to content

Custom Attention Mask #249

@PaulCouairon

Description

@PaulCouairon

Hi,

I am trying to use a custom attention mask (for example masking out scores below a certain threshold). The flex-attention backend seems to be suited for this but I am not fully sure to understand how to pass a score_mod function as an argument.

In addition, is #231 compatible with flex-attention backend?

Thank you in advance!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions