Replies: 4 comments 7 replies
-
|
My two cents:
Again, this is just about priorities/focus. I'm not against it, but I'd rather have a kick-ass WebGPU framework with no cpu support than an ok-ish webgpu framework with cpu support. Also I think that things like 4-bit gguf and performance improvements are more important. 2,3,5) 👍
|
Beta Was this translation helpful? Give feedback.
-
|
Thanks @FL33TW00D for the huge effort! I think that having a full Webgpu ML framework is awesome 🚀. IMO, the GGUF support alone is amazing. This is one of those quality-of-life improvement that will make people choose ratchet for sure. My one cent on the improvements: |
Beta Was this translation helpful? Give feedback.
-
|
I like what Ratchet achieved so far! The way the GGUF files are loaded is amazing! |
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.

Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Ratchet as a POC has proved pretty good. A few of the highlights that I am proud of to date:
There's a few things we can be less than proud of:
We need an architecture to overcome these limitations. We need to build an abstraction on top of WGSL that provides us the tooling that Apple has with Metal. Easy parameterisation over dtypes, workgroup sizes and more is essential. Furthermore, we need to be able to apply as many sensible fusions as possible.
A WebGPU kernel could be represented in Rust as follows:
Given that we are specialised to WebGPU, we should leverage the fact that
nagaalready parses kernels into a nice AST. Once we have this AST, should be able to handle the following case at a minimum:We intelligently merge bindings, combine globals, and explore the AST for the final
storein the GEMM kernel, and modify the AST to wrap the finalstorewith the GELU call.These are just the ideas that i've had kicking around. Would love input from the kind folks who have been giving me pointers along the way.
@sigma-andex
@ivarflakstad
@philpax
@AmineDiro
Beta Was this translation helpful? Give feedback.
All reactions