Training instructions

Hi Eric,

Great work! It is very impressive that DiffSpeaker can produce lower LVE and FDD while having faster inference speed. It's also surprising to me that a VAE isn't needed to construct the latents for this diffusion model beforehand.

I wonder if there will be instructions on training DiffSpeaker on other datasets apart from VOCASET and BIWI? I'm trying to train it on a dataset I collected, that is similar to VOCASET. Is it possible to provide instructions, or general guidance to point me in the right direction?

Thanks a lot in advance!
Leo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training instructions #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Training instructions #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions