Proposal: eloquence HQ

It's not something I'm asking you to implement yourself, but rather something I've been working on with the help of AI (gemini and ChatGPT). I'd like to hear your opinion first on whether you think it's worth integrating or not, before creating a pull request. It's an upsampling algorithm with a smooth filter which produces a clearer sounding eloquence, but without the harsh  artefacts of simple upsampling. Instead of writing it in pure Python,  I wrote the algorithm in c as an external library (dll) and called it with ctypes on the Python side for best performance. The result is a voice which has clearer s sounds while still being smooth. I used 44 KHz instead of 22, because it makes filtering easier. The input itself still consists of the 11 KHz signal, it's my external upsampling which does the trick.
And don't worry, although practically everything was coded using AI, it's been verified and tested by me, and it's working exactly as it should. There are no glitches.
I think the best way is to introduce the sample rate combo box we have in the IBMTTS driver, and instead of the 22 KHz signal, include my smooth 44 KHz option. If we do it like this, it would be great if you could port the sample rate setting over, and then once that's done, I can send you my upsampling code. What do you think?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: eloquence HQ #88

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Proposal: eloquence HQ #88

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions