Hi, I'd like to do the process on raw videos but found that the length of raw videos and labels (downloaded from the official Breakfast dataset) are different from the extracted I3D features and the corresponding labels.
Did you do any preprocessing on the raw videos before extracting features?
Hi, I'd like to do the process on raw videos but found that the length of raw videos and labels (downloaded from the official Breakfast dataset) are different from the extracted I3D features and the corresponding labels.
Did you do any preprocessing on the raw videos before extracting features?