Skip to content

Data preprocessing issues #2

@tjayada

Description

@tjayada

I'm trying to reproduce the results and ran into some trouble for the data preprocessing steps.

The first issue is the missing ego4d_gaze_untracked.csv file, which luckily can easily be found your older project in the GLC data directory, but should probably be added into this repository as well.

The second issue concerns the two missing audio files.
The two files you made available for download at https://drive.google.com/drive/folders/1iZuuRiflog9AazCtLXa9PbIYg-S3vENs are the original 1 hour audios and thus the code in preprocess.py in lines 259 & 260 with

command = f'cp {os.path.join(os.path.dirname(data_path), "missing_audio/*")} {save_path}' subprocess.call(command)

are simply copying the complete 1-hour audios into the directory with the already clipped audios.
Is there a script that you have used to clip the missing audios first as well, which didn't get committed or did you maybe accidentally uploaded the complete audios instead of the clipped versions?
Of course its also possible that I made an error and in that case please let me know as well.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions