Skip to content

Ww/support gguf 3bit#36203

Closed
WeldonWangwang wants to merge 2 commits into
openvinotoolkit:masterfrom
WeldonWangwang:ww/support_gguf_3bit
Closed

Ww/support gguf 3bit#36203
WeldonWangwang wants to merge 2 commits into
openvinotoolkit:masterfrom
WeldonWangwang:ww/support_gguf_3bit

Conversation

@WeldonWangwang
Copy link
Copy Markdown
Contributor

Details:

  • item1
  • ...

Tickets:

  • ticket-id

AI Assistance:

  • AI assistance used: no / yes
  • If yes, summarize how AI was used and what human validation was performed (build/tests/manual checks).

- iq3_xxs_linear.hpp: Op declaration in ov::op::internal namespace
  Inputs: activation [M,K] + compressed_weights u8 blob
  Attributes: weight_shape [N,K], block_size=256, bytes_per_block=98
- iq3_xxs_linear.cpp: validate_and_infer_types, visit_attributes, clone
@github-actions github-actions Bot added the category: Core OpenVINO Core (aka ngraph) label Jun 3, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: Core OpenVINO Core (aka ngraph)

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant