Skip to content

feat: add configurable tokenizer to text splitters #2

Merged
mathieuripert merged 1 commit into
mainfrom
feat-text-splitter-chunk-size-counter
May 25, 2025
Merged

feat: add configurable tokenizer to text splitters #2
mathieuripert merged 1 commit into
mainfrom
feat-text-splitter-chunk-size-counter

Conversation

@mathieuripert
Copy link
Copy Markdown
Owner

No description provided.

…eld to CharacterTextSplitter and RecursiveCharacterTextSplitter - Replace hardcoded String.length() calls with configurable tokenizer function - Default tokenizer remains String.length/1 for backward compatibility - Update documentation to reflect token-based chunk sizing - Add tests demonstrating custom tokenizer functionality
@mathieuripert mathieuripert merged commit 62ab9dc into main May 25, 2025
1 of 2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant