GPTTokenCounter

A plain script to compare and count tokens for OpenAI

Introduction

OpenAIToken Counter is a simple Python script designed to provide insights into the tokenization process of various languages compared to English.

Dependencies

to run the script you need the following Python libraries to be properly installed:

tiktoken
rich
tabulate

To install these prerequisites, execute:

pip install tiktoken rich tabulate

How to Execute OpenAIToken Counter

Clone the Repository
Execute the Script:
```
python GPTTokencounter.py
```
Language Selection:
- The program kicks off with a brief overview of its purpose.
- You'll then be prompted to either:
  - [a] Default (English and Bangla)
  - [b] Custom
  Choose the default for a comparison between English and Bangla, or opt for a custom language pair.
Provide Required Inputs:
- For the custom option, you'll need to specify the ISO language code, an English word or sentence, and its counterpart in the chosen language.
Examine the Token Comparison:
- The next step showcases a table contrasting the tokenization of the English phrase with the selected language, leveraging the gpt-3.5-turbo model.

Input

Language	English	Bangla
Sentence	I speak Bengali	আমি বাংলায় কথা কই

Output

Points to be noted

The tiktoken library is the backbone, facilitating token count based on the model.
The default model in use is gpt-3.5-turbo. Should you wish to experiment with others, adjust the model_name variable within the main() function.
The display_word_parameters() function integrates word wrapping, ensuring legibility for lengthier inputs.

Contributions and Feedback

We welcome feedback! If you come across any hiccups do reach out through GitHub.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
GPTTokencounter.py		GPTTokencounter.py
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPTTokenCounter

Introduction

Dependencies

How to Execute OpenAIToken Counter

Input

Output

Points to be noted

Contributions and Feedback

About

Uh oh!

Releases

Packages

Languages

License

asifshaikat/GPTTokenCounter

Folders and files

Latest commit

History

Repository files navigation

GPTTokenCounter

Introduction

Dependencies

How to Execute OpenAIToken Counter

Input

Output

Points to be noted

Contributions and Feedback

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages