Hi! I run the resize.sh to prepare the dataset, but I find that the prepared dataset does not follow the normal definition of resolution.
For example, in "text_data/infograph_256", many of the pictures are way bigger than 256x256, and the texts are hard to recognize, there is no way for VAEs to reconstruct such data and get scores.
Can you provide the resized subset (e.g. on huggingface)? I wonder if I miss some steps.
Hi! I run the resize.sh to prepare the dataset, but I find that the prepared dataset does not follow the normal definition of resolution.
For example, in "text_data/infograph_256", many of the pictures are way bigger than 256x256, and the texts are hard to recognize, there is no way for VAEs to reconstruct such data and get scores.
Can you provide the resized subset (e.g. on huggingface)? I wonder if I miss some steps.