Path: blob/master/data/twitter_sentiment.ipynb
2923 views
Kernel: Python [conda env:ml4t] *
Download Twitter Data for Sentiment Analysis
We use a dataset that contains 1.6 million training and 350 test tweets from 2009 with algorithmically assigned binary positive and negative sentiment scores that are fairly evenly split.
In [9]:
Download and unzip
Download the data from here.
The following code downloads and extracts the content of the compressed file and stores it in 'sentiment140', while renaming the content as follows:
training.1600000.processed.noemoticon.csv
totrain.csv
, andtestdata.manual.2009.06.14.csv
totest.csv
In [2]:
In [3]:
In [5]:
In [8]: