Research foundation for multimodal creative systems: single-stream audio, video, and text modeling for future workflow experiments.
pytorch transformer vae creative-ai multimodal video-generation self-attention audio-generation generative-ai ai-video flash-attention creative-workflows single-stream
-
Updated
Apr 12, 2026 - Python