Separate Spam and Ham for WordCloud Visualization

Question

I am performing spam detection and want to visualize spam and ham keywords separately in Wordcloud. Here's my .csv file.

data = pd.read_csv("spam.csv",encoding='latin-1')
data = data.rename(columns = {"v1":"label", "v2":"message"})
data = data.replace({"spam":"1","ham":"0"})

Here's my code for WordCloud. I need help with spam_words. I cannot generate the right graph.

import matplotlib.pyplot as plt
from wordcloud import WordCloud 

spam_words = ' '.join(list(data[data['label'] == 1 ]['message']))
spam_wc = WordCloud(width = 512, height = 512).generate(spam_words)

plt.figure(figsize = (10,8), facecolor = 'k')
plt.imshow(spam_wc)
plt.axis('off')
plt.tight_layout(pad = 0)
plt.show()

Specifically what is wrong with your current output? Also, you posted the names of your csv files, but it would help if you posted the first few lines of the actual data. — Peter Leimbigler, Commented Apr 7, 2018 at 17:22
Hello @Peter, I want the spam_words variable to only take in messages that were labelled spam. Currently, it is taking in all the messages and showing me a combined wordcloud of spam and ham. — Prashant, Commented Apr 7, 2018 at 17:27
@PeterLeimbigler I would like to know if you need more information about the question. — Prashant, Commented Apr 7, 2018 at 18:11

Peter Leimbigler · Accepted Answer · 2018-04-07 18:20:30Z

0

The issue is that the current code replaces "spam" and "ham" with the one-character strings "1" and "0", but you filter the DataFrame based on comparison with the integer 1. Change the replace line to this:

data = data.replace({"spam": 1, "ham": 0})

answered Apr 7, 2018 at 18:20

Peter Leimbigler

11.1k1 gold badge26 silver badges39 bronze badges

Add a comment |

Collectives™ on Stack Overflow

Separate Spam and Ham for WordCloud Visualization

1 Answer 1

Your Answer

Not the answer you're looking for? Browse other questions tagged
python-3.x
pandas
join
spam-prevention
word-cloud
or ask your own question.

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Not the answer you're looking for? Browse other questions tagged python-3.xpandasjoinspam-preventionword-cloud or ask your own question.

Related

Not the answer you're looking for? Browse other questions tagged
python-3.x
pandas
join
spam-prevention
word-cloud
or ask your own question.