Hi, thanks for your contribution to audio source localization.
Now, I am trying to train your model FANC with VGGSound.
I have the VGGSound dataset but I do not know how to prepare the dataset like your 'prepare_flickr_soundnet.py'
It would be very appreciated if you share the preparation codes for VGGSound.
Also, are there any experimental results about utilizing Resnet50 or a bigger model, not Resnet18?
Thank you.
Hi, thanks for your contribution to audio source localization.
Now, I am trying to train your model FANC with VGGSound.
I have the VGGSound dataset but I do not know how to prepare the dataset like your 'prepare_flickr_soundnet.py'
It would be very appreciated if you share the preparation codes for VGGSound.
Also, are there any experimental results about utilizing Resnet50 or a bigger model, not Resnet18?
Thank you.