Skip to main content

Using Tableau and Letterboxd To Manipulate Movie Data

 


Letterboxd is a personal database to keep track of what movies you've watched, want to watch, and their ratings 🎥

Now, we're looking at my own data. Everything goes from 2016 - 2020, even though some movies are dated for the years they were released. I am keeping the date watched and the year released in the data. I've hidden the Letterboxd URLs to the specific movie pages - It's not necessary - and my own username is not shown.

Spoiler alert: There are movies that are highly acclaimed that...I do not like! Feel free to link to this post and go "How does she like 'Valerian and The City of A Thousand Planets' more than 'Up'?"

Check out the public charts here on Tableau Public, and follow along!


Ratings breakdown:

  • 15 movies are rated 5 (perfect).
  • 25 movies are rated as 4 (Very good)
  • 33 movies are rated 3.5 (fine enough) 
  • 35 movies are rated 3
  •  24 are rated 2.5 (I've seen worse, but barely.)
  • 9 movies are rated 2 (I'd watch if I was bored. Very bored.)
  • 11 movies are rated 1.5 (Goodness me)
  • 3 movies are rated 1 
  • 1 Lone little movie is rated 0.5


Year and Quarter Watched

It's easiest to point out for 2020; I'm not sure I watched any older movies in 2020 (besides 1), so I didn't catalog them on the website.


For the curious, July is "The Personal History of David Copperfield". If 2020 had not been 2020, June would have included A24's "The Green Knight", Disney's "Soul", and Universal's "Candyman".

Q1 -Jan to (and including) March.

Q2 - April to (and including) June

Q3 - July to (and including)  September

Q4 - is October to December

I notice that does not include rewatches of movies. I know I logged a rewatch that would be in Q1 of 2020. 

If you can guess which movie it is (You get one shot at it), post on this LinkedIn post, I'll send you the code to a 5$ gift card to either Starbucks or Dunkin. Your choice!

Hint

 


Look at this graph, and in Q1. You have a 1 in 90 chance of getting it! 

Contest ended November 28th, 2020.

Average Ratings Per Year (The movie came out)

I had to adjust this a bit because the information was spread out over 120 years (The oldest movie was "A Trip To The Moon"), so I adjusted the axis to start at 1960 and go to 2021 for visual clarity. 

 
The year that had the movies with the highest ratings were 2000, with 5. However, how many movies from 2000 did I watch?

Only 1: 

 This is a pro-Wuxia house.

Yep, that'll skew it.


Amount of Movies Watched Per Quarter Over 174 Movies

This is not the Quarter they were released, but the Quarter they were watched in, no matter the year.



I expect Q4 to rise by 1, with the release of "Soul" on Disney+. If "Raya And The Last Dragon" came out as scheduled on Thanksgiving, it would have been added also.

I'm working on a bit more data manipulation - I want to learn how to sort by # of 5 ratings per watch year, or how many 3.5 ratings were in Q2 of 2019?

Don't forget about the competition and the prize 😉 Ends December 20th, 2020. Or until someone gets it.


Comments

Popular posts from this blog

Connecting IoT Devices to a Registration Server (Packet Tracer, Cisco)

In Packet Tracer, a demo software made by Cisco Systems. It certainly has changed a lot since 2016. It's almost an Olympic feat to even get started with it now, but it does look snazzy. This is for the new CCNA, that integrates, among other things, IoT and Automation, which I've worked on here before. Instructions here . I don't know if this is an aspect of "Let's make sure people are paying attention and not simply following blindly", or an oversight - The instructions indicate a Meraki Server, when a regular one is the working option here. I have to enable the IoT service on this server. Also, we assign the server an IPv4 address from a DHCP pool instead of giving it a static one. For something that handles our IoT business, perhaps that's safer; Getting a new IPv4 address every week or so is a minimal step against an intruder, but it is a step. There are no devices associated with this new server; In an earlier lab (not shown), I attached them to 'H

Building, Breaking, and Building A CRM with Retool

 I like no- or low-code solutions to things. I've often wanted to simply push a button or move some GUI around and have the code implement itself.  I've thought about building something that's like a customer relationship management (CRM) system for keeping up with my network better than my little spreadsheet where I click links and then go like something. The general idea in this CRM Development is:  To have a GUI to add people to a NRM (Network Relationship Management).       Attach it to a database (MySQL is what I went with eventually using Amazon Relational Database service, but you can use PostGRES, and probably others).     Make sure components are connected to each other in the retool interface. This video is a good start. Watching the tutorial video, heard some SQL commands and went 'Oh no 😳" before going "Wait I know basic SQL", which is good, because you'll see.  When you get set up, there's a plethora of resources you can use -- Incl

Securing Terraform and You Part 1 -- rego, Tfsec, and Terrascan

9/20: The open source version of Terraform is now  OpenTofu     Sometimes, I write articles even when things don't work. It's about showing a learning process.  Using IaC means consistency, and one thing you don't want to do is have 5 open S3 buckets on AWS that anyone on the internet can reach.  That's where tools such as Terrascan and Tfsec come in, where we can make our own policies and rules to be checked against our code before we init.  As this was contract work, I can't show you the exact code used, but I can tell you that this blog post by Cesar Rodriguez of Cloud Security Musings was quite helpful, as well as this one by Chris Ayers . The issue is using Rego; I found a cool VS Code Extension; Terrascan Rego Editor , as well as several courses on Styra Academy; Policy Authoring and Policy Essentials . The big issue was figuring out how to tell Terrascan to follow a certain policy; I made it, put it in a directory, and ran the program while in that directory