Skip to content

ranked reward alernatives? #24

@pstjohn

Description

@pstjohn

Might be worth exploring our own versions of ranked rewards. I don't expect that paper is a canonical standard at this point, so there might be further optimizations we could look into.

Do loss rewards need to be -1? Might be easier to make a loss 0 and win 1.

A couple ideas:

  • pareto front optimization
    Here, we'd just check to see if a molecule is pareto-optimal, i.e., whether its dominated by any other recent solution currently in the game buffer.
    select count(*) > 0 from game_table where buried_vol > %s and max_spin < %s and atom_type = %s

  • percentile scoring
    Similar to ranked rewards but perhaps less sparse, just return 0->1 rewards depending on the percentile score of the game w.r.t a game buffer

  • anything else from the literature?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions