In order to obtain better results, some experimentation with the model setup should be done.
Try out the following:
- Increase the batch size (suggested: 8192); see that the batch size does not grow too large with respect to the dataset
- Check out the plateau learning rate ReduceLROnPlateau which reduces the learning rate when a selected metric stops improving
- Try adam optimizer instead of Adagrad.
In order to obtain better results, some experimentation with the model setup should be done.
Try out the following: