Comprehensive project submitted for the completeion of B. Tech. degree.
Tasks performed:
- Performed augmentation on raw audio data to produce more samples
- Employed a pipeline to extract spectrograms from raw audio and then extract MFCCs
- Developed a model based on CNN and LSTM to predict the emotions