Alien vs. Predator

Alien vs. Predator

This is a quick visualization of data from the netflix prize. A vertical bar is drawn for every customer rating a movie. Ratings go from 1 to 5 stars (represented top to bottom.) Note how “Alien” (on the left) received many ratings of 4 and 5 stars, but “Predator” (on the right) mostly received ratings of 4 stars. This depicts approximately 50,000 customer ratings.

  • http://www.tom-carden.co.uk/ Tom

    Neat… is it still legible if you use the same colour for both sides? The left side is a little bit brighter I think, could be hiding something?

    Downloading the data now… it’s not exactly instant gratification territory, huh?

  • http://www.shiffman.net Daniel

    Hmmmm. It also may simply be due to the fact that there are fewer ratings for Predator than Alien (~10% less I believe). Oh, also, heh, I noticed a bug in my code which used only about 75% of the ratings for Predator. Need to fix that. . . zoinks.

    Downloading the data is fun, even more fun is inserting it into a database, which took about 6 hours to run on my lowly machine.