Mandrill Tweet Analysis
Mandrill Tweet Analysis
Mandrill Tweet Analysis
Agenda
• Introduction
• Approach
– Removing Extraneous Punctuation
– Splitting on Spaces
– Counting Tokens and Calculating Probabilities
– Model formulation and usage
• Advantages of Naïve Bayes
Introduction
• Mandrill is a transactional e-mail
product for software developers
• Sends one-to-one e-mails, receipts and
password resets
• Can help track the emails that have
been opened and viewed
Advantages
• Easy to implement
• Fast Disadvantages
• If the independence assumption • The strong assumption about the
holds then it works more features to be independent which
efficiently than other algorithms is hardly true in real life
• It requires less training data. applications
• It is highly scalable • Chances of loss of accuracy
• Best suited for text classification
problems
THANK YOU