Skip to content

[FR] Run save_images in parallel to speed-up preprocessing #3

@vfdev-5

Description

@vfdev-5

@samedii thanks for making this template example running with python 3.6 ! That's great as I can run it now inside Google Colab.

I'd like to ask another FR to improve UX with the template code, namely to run image preprocessing part in multi-processing way to speed up the data preparation:
https://github.com/Aiwizo/template-nvae/blob/d8a7470dcb1dec59dad6ca5299e25e82ace2f9ac/prepare.py#L17-L19

For example, naively it can be done using joblib which could create a pool of workers to process the dataset by chunks.
What do you think ?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions