CPU offloading / low-VRAM support

As contributed in #97. That was not merged at the time due to a lack of agreement over what this would entail, but I think we have a clearer picture of it now:
- Wherever possible, when loading with World Engine or Quark, avoid materialising intermediate results within the GPU.
- Scene Authoring's models should be resident on the CPU wherever possible. I think the primary beneficiaries of this would be the VLM and the text encoder for Flux.

This should make it possible to use WP-1.5-1B with Scene Authoring fairly reliably on a 8GB VRAM card.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CPU offloading / low-VRAM support #137

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

CPU offloading / low-VRAM support #137

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions