data preprocessing

I'm just wondering what type of data preprocessing for SIF embedding I need to do for the sentences. For example,
1) do I need to remove punctuations? In the example, sentences don't have punctuations.
2) should I tokenize negations?
3) what other preprocessing needs to be done?
Thanks a lot!!