PoC: Speech Enhancement using DeepFilterNet#3819
Open
fkwp wants to merge 17 commits into
Open
Conversation
|
Just experimented with this a bit, this works amazingly well - completely eliminating keyboard sound, squeaking chairs, noise from bumping into the mic, etc. |
How is the CPU usage? Edit: Tested it myself, it only elevated CPU usage by ~1.5% on my i7-10700F |
|
I didn't watch my cpu usage while testing, but you can try it yourself by clicking on "View Deployment" above :) |
5 tasks
9dbe023 to
d81d285
Compare
d81d285 to
19989fe
Compare
…arious tab contents.
19989fe to
d52916d
Compare
4761dcb to
161c6f7
Compare
5 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Content
The default speech enhancement (mainly noise reduction) algorithms of modern web browsers are working speech model based, don't consume a lot of CPU cycles and are quite limited. On the other end of the CPU cycle spectrum deep neural networks have
evolved with impressive performance. On the FOSS side of things RNNoise (https://github.com/xiph/rnnoise) as well as DeepFilterNet (https://github.com/Rikorose/DeepFilterNet) are two prominent examples.
Especially DeepFilterNet is of interest as it's not only carrying out noise reduction but also dereverberation which is very beneficial in open setups.
Recently a livekit plugin has been written leveraging WASM https://github.com/mezonai/mezon-noise-suppression which is used in this PoC
This PR adds a new Options to Settings -> Audio
