Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
nlp
count
machine-learning
natural-language-processing
text-mining
practice
article
text-classification
word2vec
gensim
tf-idf
-
Updated
Dec 2, 2020 - Jupyter Notebook


After running
pip install movieboxon a Mac with Python 2.7 I get the following error when trying to run it: