Challenge: Tokenizing Using Regex
Task
Swipe to start coding
Given a string named message
, convert it lowercase, then tokenize it into words using regular expression tokenization and the corresponding nltk
class. A word is a sequence of only alphanumeric characters (letters and numbers). '#Conference2023!'
, for example, contains one word: Conference2023
.
Solution
Everything was clear?
Thanks for your feedback!
SectionΒ 1. ChapterΒ 6