kyutai

moshi Public

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 10.1k 944

pocket-tts Public

A TTS that fits in your CPU (and pocket)

Python 4.1k 455

delayed-streams-modeling Public

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2.9k 303

hibiki Public

Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

Rust 1.5k 117

unmute Public

Make text LLMs listen and speak

Python 1.3k 222

moshi-finetune Public

Python 440 61

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kyutai

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!