18/05/2026
Turn text into numbers in seconds. CountVectorizer learns a vocabulary from your documents, then converts each document into a row of token counts (a document-term matrix). This is the classic bag-of-words representation used by many baseline NLP models like Naive Bayes and linear classifiers.