Skip to content
This repository was archived by the owner on Aug 6, 2025. It is now read-only.
This repository was archived by the owner on Aug 6, 2025. It is now read-only.

Reproducing Issue on DiT-S/2 #122

@xiwen1

Description

@xiwen1

Hi, thanks for your great work!

I tried to reproduce the performance of DiT-S/2 on 256×256 resolution using this repository. My training settings followed both the configuration provided in the README and the default arguments in the training script, i.e.:
• Global batch size = 256
• Learning rate = 1e-4

However, after 400k steps, the FID of the model was still far from the reported 68.4 in the paper. Even after 4000k steps, I was only able to achieve an FID of 79.59 on the test set.

For FID evaluation, I followed the paper’s settings:
• cfg=1.0
• vae=mse

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions