Predicting whether a Marvel character is good or evil using Big Data analytics.

This article utilizes statistical methods, data mining techniques and Python in order to create a prediction model for superhero characters alignment. Starting off we choose what dataset to utilize — which is then prepared, handled and analyzed.

Then, the cleansed data is imported into a data mining tool; Weka, to create a viable prediction model that can foretell the alignment based on descriptions of the characters. This outcome is then tested and debated to describe the different results. This article assumes you have some familiarity with Python and Pandas, and will therefore not go into too much into detail in regards to these subjects. If you have any questions please shoot me an email at

Author: Vetle Ottem Frantzvaag

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s