I'm just wondering what type of data preprocessing for SIF embedding I need to do for the sentences. For example,
- do I need to remove punctuations? In the example, sentences don't have punctuations.
- should I tokenize negations?
- what other preprocessing needs to be done?
Thanks a lot!!
I'm just wondering what type of data preprocessing for SIF embedding I need to do for the sentences. For example,
Thanks a lot!!