Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Tokenize Using Regex | Text Preprocessing Fundamentals
Introduction to NLP

Tokenize Using RegexTokenize Using Regex

Task

Given a string named message, convert it lowercase, then tokenize it into words using regular expression tokenization and the corresponding nltk class. A word is a sequence of only alphanumeric characters (letters and numbers). '#Conference2023!', for example, contains one word: Conference2023.

Everything was clear?

Section 1. Chapter 6
toggle bottom row
course content

Course Content

Introduction to NLP

Tokenize Using RegexTokenize Using Regex

Task

Given a string named message, convert it lowercase, then tokenize it into words using regular expression tokenization and the corresponding nltk class. A word is a sequence of only alphanumeric characters (letters and numbers). '#Conference2023!', for example, contains one word: Conference2023.

Everything was clear?

Section 1. Chapter 6
toggle bottom row
some-alt