4.1 Punctuation removal: - it doesn’t add any extra information while treating text data. Therefore, it will help us reduce t size of the training data.
(Here, we make use of the ‘startswith’ function because hashtags (or mentions) always appear at the beginning of a word.).in below output, all the punctuation, including ‘#’ ‘@’,emoji, has been removed from the training data.