COLUMBIA UNIVERSITY COMS W6998
SYSTEMS FOR HUMAN DATA INTERACTION

A1: Play with vis systems

Due: Sun 9/19 11:59PM EST.

Edits:

  • 9/18 Updated tableau log file name to include “jprotocolserver.log”

In this assignment, you will use three interactive visualization systems to perform visual analysis, and report on the experience. The systems are:

The dataset you will be using is a daily record of the top trending YouTube videos. We select those from US, GB, DE, CA, FR and IN (USA, Great Britain, Germany, Canada, France, and India, respectively) for our assignments.

Tableau

Setup

  1. Download Tableau from their website.

  2. Connect Tableau to the hosted PostgreSQL databasethat we have set up for the course. To do so, create a new Tableau workbook based on the following server:

    • Type: PostgreSQL (you may have to download an additional driver for postgreSQL, here)
    • Hostname: pg-001.db.gcloud.instabase.com
    • Username: columbia
    • Password: B%38Mt5W@M*QU?Ar
    • Database: db_fea10998_f88d_4b6e_8f90_a6cd73bac65c
    • You should use the table called youtube_full

Task 1

Explore the dataset using only Tableau. Create visualizations to answer the following analysis questions. Copy and paste screenshots of the visualizations into your report:

  • Q1: For each country, how many days on average does a video trend?
  • Q2: Is there any relationship between the hour of the day that a video was published and the category of the video?

IMPORTANT

  • Copy of your Tableau log file and send them with your results. The log file we are interested in is called “tabprotosrv.txt” (or “jprotocolserver.log”). It is the one containing the SQL queries that the client issues.
    • On Windows, you may find it in “My Documents/My Tableau Repository/Logs”.
    • On MacOS, it should be in “Documents/My Tableau Repository/Logs”.
    • You may find more info here.
  • Make sure that the logging works before you start engaging in the exploration. To do so, issue a few dummy queries, open the “tabprotosrv.txt” file and make sure that it contains SQL statemements.

Task 2

Analyze and characterize the part of Tableau that you used based on the Multi-level Task Typology paper. Try to follow their terminology. You can also draw the task diagram (computer or hand drawn) and upload a picture in the submission form.

Vega-lite

Setup

Go to the vega-lite editor

Task

Your task is to use vega-lite to create visualizations that can be used to perform the following analysis tasks. You will submit a sharable link that you can access by clicking on “Share” and then “Copy Link to Clipboard” in the vega-lite editor

Vega-lite is browser-based, so you will use a small dataset here. To do so, insert the following into your vega-lite spec to load the dataset as a vega-lite data source:

"data": {
  "url": "https://raw.githubusercontent.com/columbiaviz/columbiaviz.github.io/master/_data/youtube_2020s_w6998.csv"
},
  1. V1: Create a visualization that shows the maximum number of Likes in each category.
  2. V2: Create a visualization that compares the ratio of Likes-Dislikes in different categories.
    • Hint: You’ll want to use data transforms to compute the ratios. Watch out for divide by zeros!

Submission Instructions

Fill out the google form