After training on around 1,000,000 videos, if the outcome is not as expected, try to use GPT-3.5 instead of BERT
After training on around 1,000,000 videos, if the outcome is not as expected, try to use GPT-3.5 instead of BERT