Skip to content

Bundle tokenizer entrypoint as ExecuTorch delegate#20173

Closed
JacobSzwejbka wants to merge 2 commits into
pytorch12from
js/tokenizer-delegate-entrypoint
Closed

Bundle tokenizer entrypoint as ExecuTorch delegate#20173
JacobSzwejbka wants to merge 2 commits into
pytorch12from
js/tokenizer-delegate-entrypoint

Conversation

@JacobSzwejbka

Copy link
Copy Markdown
Contributor

Summary:

  • add a manual ExecuTorch tokenize execution plan backed by TokenizerBackend
  • bundle tokenizer bytes as inline delegate data and wire LLM export to append the entrypoint
  • add runtime buffer loading for tokenizers and string input rebinding

Submodule:

Test Plan:

  • cmake --build cmake-out --target extension_llm_runner -j8
  • python -m py_compile extension/llm/export/tokenizer_delegate.py extension/llm/export/test/test_tokenizer_delegate.py
  • python -m pytest extension/llm/export/test/test_tokenizer_delegate.py -q could not collect in this environment because local torchao imports expect torch.ops.higher_order.templated_attention

Notes:

  • commit hook was bypassed because lintrunner_adapters is missing locally (python: No module named lintrunner_adapters).

@pytorch-bot

pytorch-bot Bot commented Jun 10, 2026

Copy link
Copy Markdown

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20173

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 10, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant