Popular repositories Loading
-
-
delayed-streams-modeling
delayed-streams-modeling PublicKyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
-
Repositories
- tts_longeval Public
kyutai-labs/tts_longeval’s past year of commit activity - moshi Public
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
kyutai-labs/moshi’s past year of commit activity - moshi-rag Public
MoshiRAG is a compact full-duplex speech language model augmented with asynchronous knowledge retrieval to improve factuality without sacrificing real-time interactivity.
kyutai-labs/moshi-rag’s past year of commit activity - flashy Public
Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpointing, logging, distributed, compatibility with Dora, and more!
kyutai-labs/flashy’s past year of commit activity - ovie Public
Official implementation and models for OVIE (One View Is Enough! Monocular Training for In-the-Wild Novel View Generation)
kyutai-labs/ovie’s past year of commit activity - dactory Public
kyutai-labs/dactory’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…