4. Basic Text Pre-processing of text data

4.1 Punctuation removal: - it doesn’t add any extra information while treating text data. Therefore, it will help us reduce t size of the training data. (Here, we make use of the ‘startswith’ function because hashtags (or mentions) always appear at the beginning of a word.).in below output, all the punctuation, including ‘#’ ‘@’,emoji, has been removed from the training data.

4.2 Stopwords removal

We know, stop words (or commonly occurring words) should be removed from the text data