Skip to content

Proposal: eloquence HQ #88

@hozosch

Description

@hozosch

It's not something I'm asking you to implement yourself, but rather something I've been working on with the help of AI (gemini and ChatGPT). I'd like to hear your opinion first on whether you think it's worth integrating or not, before creating a pull request. It's an upsampling algorithm with a smooth filter which produces a clearer sounding eloquence, but without the harsh artefacts of simple upsampling. Instead of writing it in pure Python, I wrote the algorithm in c as an external library (dll) and called it with ctypes on the Python side for best performance. The result is a voice which has clearer s sounds while still being smooth. I used 44 KHz instead of 22, because it makes filtering easier. The input itself still consists of the 11 KHz signal, it's my external upsampling which does the trick.
And don't worry, although practically everything was coded using AI, it's been verified and tested by me, and it's working exactly as it should. There are no glitches.
I think the best way is to introduce the sample rate combo box we have in the IBMTTS driver, and instead of the 22 KHz signal, include my smooth 44 KHz option. If we do it like this, it would be great if you could port the sample rate setting over, and then once that's done, I can send you my upsampling code. What do you think?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions