Skip to content

feat: add ONNX Runtime support and resolve pre-commit compliance issues#105

Open
jieyao-MilestoneHub wants to merge 5 commits into
DEVfrom
DEVfeat/llm-export
Open

feat: add ONNX Runtime support and resolve pre-commit compliance issues#105
jieyao-MilestoneHub wants to merge 5 commits into
DEVfrom
DEVfeat/llm-export

Conversation

@jieyao-MilestoneHub
Copy link
Copy Markdown
Collaborator

@jieyao-MilestoneHub jieyao-MilestoneHub commented Nov 15, 2025

Changes

  • update pyproject.toml with ONNX Runtime dependencies
  • add onnxruntime-gpu, tokenizers to base dependencies
  • add llm-export optional dependency group

BREAKLOG: Determine how ONNX models are generated and how they should be used by both developers and end-users.

Related issue

implement: #100

- update pyproject.toml with ONNX Runtime dependencies
- add onnxruntime-gpu, tokenizers to base dependencies
- add llm-export optional dependency group

fix: resolve pre-commit compliance issues
- update .pre-commit-config.yaml to ignore GHSA-f83h-ghpp-7wcc

BREAKLOG: Determine how ONNX models are generated and how they
          should be used by both developers and end-users.
- Restructure README.md with a clear installation comparison table
- Add three usage paths:
  - Option 1: TURU API Mode (NPU acceleration)
  - Option 2: ONNX Runtime (lightweight and fast)
  - Option 3: Full PyTorch installation (GPU / CPU)
- Add models/README.md to explain the model directory layout
- Enhance TURU configuration guide with collapsible sections
- add INT8 dynamic quantization to export_model.py with automatic validation
- create download_onnx_model.py for GitHub Releases distribution
- add release-model.yml workflow with split file support for >2GB models
- update provider_factory.py with auto-detection of any ONNX models
- update documentation with INT8 quantization details
Fixes bandit B110 security check by marking the exception handling
as intentional for split file detection.
@jieyao-MilestoneHub jieyao-MilestoneHub added enhancement New feature or request in process This issue is currently being worked on and actively developed. labels Nov 20, 2025
@jieyao-MilestoneHub jieyao-MilestoneHub self-assigned this Nov 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request in process This issue is currently being worked on and actively developed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant