I edited tokenize.py
and in main called
tokenizer=SpacyTokenize()
to use the Spacy Tokenizer for English text. Tho I always end up getting a :
memory error on running on Google Colab.
Thoughts on how I can use the English tokenizer for my dataset? Or for the English dataset dailydialoguttr_lines.txt, how do you run the code for the GSM model? @zll17
I edited tokenize.py
and in main called
to use the Spacy Tokenizer for English text. Tho I always end up getting a :
memory error on running on Google Colab.
Thoughts on how I can use the English tokenizer for my dataset? Or for the English dataset
dailydialoguttr_lines.txt, how do you run the code for the GSM model? @zll17