Introduces course topics: - Extract, transform and select (ETS) - Vocabulary extraction - Sparse representation - Converting a document into a vector Introduces a key tool: - Count vectorizer extractor - Frequency vocabulary size parameter - Choosing between using token counts or token occurrences