fix: use default whisper temperature values for temperature 0.0 like openai api#615
fix: use default whisper temperature values for temperature 0.0 like openai api#615dotmobo wants to merge 1 commit into
Conversation
ea0d2d1 to
14bd565
Compare
|
+1 for this fix. We're running into the exact same issue with a German CT2 model ( Symptoms: A 1:21 minute audio recording is only transcribed up to ~30 seconds. The remaining segments are silently dropped. Debug logs show: Because Workaround: We're currently using a This is a significant issue for non-English models where confidence scores are naturally lower. Would love to see this merged! |
- aarch64-darwin works - debloat docs... - merge dependencies that are not important in one file - make it multilingual, faster_whisper handles it - default to Python 3.12, but 3.13 and 3.14 also work - Includes PR speaches-ai#609, speaches-ai#610, and speaches-ai#615
- aarch64-darwin works - debloat docs... - merge dependencies that are not important in one file - make it multilingual, faster_whisper handles it - default to Python 3.12, but 3.13 and 3.14 also work - Includes PR speaches-ai#609, speaches-ai#610, and speaches-ai#615
- aarch64-darwin works - debloat docs... - merge dependencies that are not important in one file - make it multilingual, faster_whisper handles it - default to Python 3.12, but 3.13 and 3.14 also work - Includes PR speaches-ai#609, speaches-ai#610, and speaches-ai#615
- aarch64-darwin works - debloat docs... - merge dependencies that are not important in one file - make it multilingual, faster_whisper handles it - default to Python 3.12, but 3.13 and 3.14 also work - Includes PR speaches-ai#609, speaches-ai#610, and speaches-ai#615
- aarch64-darwin works - debloat docs... - merge dependencies that are not important in one file - make it multilingual, faster_whisper handles it - default to Python 3.12, but 3.13 and 3.14 also work - Includes PR speaches-ai#609, speaches-ai#610, and speaches-ai#615
|
Thanks for submitting this pr |
Hi,
It's an alternative version of #553 made for the latest v0.9.0 RC.
So, like the OpenAI API, the default temperature remains 0.0. But OpenAI made some temperature changes behind this, cf: https://developers.openai.com/api/reference/resources/audio/subresources/transcriptions/methods/create
Then we can see that the default Whisper temperature values are a list of temperatures, cf: https://whisper-api.com/docs/transcription-options/#sampling-temperature
So, I think that OpenAI transforms the temperature 0.0 to [0.0, 0.2, 0.4, 0.6, 0.8, 1.0] to activate that "automatically increase the temperature until certain thresholds are hit" feature.
Here we are doing the same. So it will fix the bug in #553.
Regards,
Morgan