Hello, I have a question regarding refinement. Refinement is carried out on the VQGAN after training with BAIRD dataset, correct? The idea is that a VQGAN is trained on BAIRD images, then the Refinement model will refine the model to reconstruct frames in a temporally consistent way. Then, once the previous two steps are done, the CFM regressor can be trained to generate sequences. Is that correct? Thanks.
Hello, I have a question regarding refinement. Refinement is carried out on the VQGAN after training with BAIRD dataset, correct? The idea is that a VQGAN is trained on BAIRD images, then the Refinement model will refine the model to reconstruct frames in a temporally consistent way. Then, once the previous two steps are done, the CFM regressor can be trained to generate sequences. Is that correct? Thanks.