Brendan Nee

Naive Tweets

April 16th, 2012

Naive Bayes Tweet Classifier in Node.js

Naive Tweets is an app that classifies tweets by language using a naive Bayes classifier. Tweets are pulled from the twitter streaming api and classified on the fly. Training the classifier for languages is automated using a list of specified location/language pairs to auto-classify by assumed languaged. For instance, tweets from Surakarta, Indonesia are all autoclassified as Indonesian. This provides a large amount of training data which allows the classifier to predict the language of a stream of incoming tweets.
I also created a front-end for viewing the tweet stream and seeing what each tweet is being classified as.
Visit Naive Tweets | View Source Code