Reward function on both aesthetic and prompt alignment

Hi, 

Wonderful project, congratulations ! 

Have you tried using a reward function for both objectives ? Because it feels like the aesthetic reward do make the generations look better but also oversimplifies it (posing only one subject in the center with a blur effect around).

Also, do you have any insights on how to use a new reward function ? Should it be normalized between a certain min-max ? How long does it take to train each model ? Does it change between the different reward functions ?

Thanks :) 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reward function on both aesthetic and prompt alignment #12

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Reward function on both aesthetic and prompt alignment #12

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions