Data Management and Visualization (2022-23)

Data Management and Visualization (2022-23)

General Information

Lecturers: Daniele Apiletti, Silvia Chiusano, Diego Monti

Teaching staff: Alessandro Fiori, Simone Monaco

SSD: ING-INF/05 – CFU: 8 – course details from the official student guide

Q&A teaching assistance on Piazza: piazza.com/polito.it/fall2022/01txasm/


News

  • 5 December 2022: the Lab practice originally scheduled for December 16th is canceled. It will be held on December 20th in classroom 2P (please bring your laptop): team A from 10:00 to 11:30, team B from 11:30 to 13:00.
  • 10 October 2022: the lecture on Tuesday, October 11, 2022 will be held only online (not in presence) via virtual classroom. The lecture will be recorded.
  • 9 October 2022: the lesson on Monday, October 10, 2022 will be held only online (not in presence) via virtual classroom. The lesson will be recorded.
  • 27 September 2023: the 22-23 edition of the course begins! See the official timetable.
  • We are using Piazza for class discussion, we invite all students to join the course Piazza. Piazza is highly catered to getting help fast and efficiently from both classmates and teachers. Rather than emailing questions to the teaching staff, students are invited to post their questions on Piazza.

Teaching material

Course introduction (slides) – errata corrige on October 21, 2022: the Rector has informed us that teaching activities will be regularly provided on Friday, December 9th, 2022, hence please note that our Lab practice will be held on that day from 8:30 to 11:30, as in the standard weekly timetable of the course

Data Warehousing

  • Introduction (slides)
  • Conceptual and logical design (slides)
  • Data analysis, OLAP, extended SQL (slides)
  • ETL process (slides)
  • Materialized view (slides)
  • Data warehousing in Oracle (slides)
  • Data warehousing: physical design (slides)

Exercises

NoSQL

  • Non-relational databases for data management – introduction (slides)
  • Introduction to MongoDB, collections, create, delete, GUI (slides)
  • MongoDB, querying data, find operator, aggregation pipeline (slides)
  • MongoDB aggregation examples, indexes (slides)
  • Distributed Data Management, replication, and the CAP theorem (slides)
  • MongoDB replica set (slides, updated Nov 18)
  • Distributed transactions (slides)
  • Distributed data processing and Map Reduce (slides)
  • NoSQL design recipe (slides)
  • MongoDB query exercises (slides)
  • MongoDB query exercises IMDB (slides, IMDB database)
  • MongoDB design patterns part 1 (slide)
  • MongoDB design patterns part 2 (slide)
  • MongoDB design pattern exercises 1 (slide)
  • MongoDB design pattern exercises 2 (slide)
  • Additional exercises:
    • MongoDB design pattern exercises (text)
    • MongoDB query exercises (text)

Data Visualization


Laboratory material

Lab practices start on Friday, October 14th, 2022.

Students groupTimeRoom
TEAM A (FROM A TO K)Friday, 8:30 – 10:00 amLAIB4
TEAM B (FROM L TO Z)Friday, 10:00 – 11:30 amLAIB4

For Lab 1 and 2 you need to run Extended SQL on Oracle databases. SQL Developer is already available at LABINF. If you want to practise at home, you can follow one of these options:

Online version [SUGGESTED METHOD] Installing Oracle Database 18c and SQL Developer [NOT RECOMMENDED]
Instead of installing Oracle Database and SQL Developer (not always straightforward to configure), you can consider Oracle Live SQL.
  • You can add tables using SQL scripts
  • A short guide on how to import SQL scripts and query the DB in Oracle Live SQL is available (pdf)
  • To download and install Oracle Express Edition: home page
  • To download and install SQL Developer: home page
  • TutorialInstallation Guide for Windows
  • Installation Guide for Ubuntu
  • Installation Guide for Mac OS
  • Import Database and Tables: Tutorial

Lab 1: Extended SQL

Lab 2: Extended SQL

  • Text – Additional queries (pdf)
  • Solution (pdf)

Lab 3: Looker Studio

Lab 4: MongoDB Compass

  • Draft solution (pdf)

Lab 5: MongoDB replica set

  • Draft solution (pdf)

Lab 6: Visualization analysis

Lab 7: Redesign with Tableau

  • Solution (zip)

Lab 8: Visualization of a dataset

  • Solution (zip)

Lab 9: Intervals and dashboards

  • Solution (zip)

Lab 10: Geographic roles and maps

  • Solution (zip)

Lab 11: Dataviz exam simulation

  • Text (pdf)
  • Visualization (jpg)

Exam

This section will provide the texts and solutions for the exams.

  • Feb 1st, 2021
    • Text + DW and NoSQL solutions (pdf)
    • Data visualization solutions (pdf)
  • Feb 15th, 2021
    • Text + DW and NoSQL solutions (pdf)
    • Data visualization solutions (pdf)
  • June 17th, 2021
    • Text + DW and NoSQL solutions (pdf)
    • Data visualization solution (pdf)
  • September 1st, 2021
    • Text + DW and NoSQL solutions (pdf)
    • Data visualization solutions (pdf)
  • January 28th, 2022
    • Text + DW and NoSQL solutions (pdf)
    • Data visualization solutions (pdf)
  • February 17th, 2022
    • Text + DW and NoSQL solutions (pdf)
    • Data visualization solutions (pdf)
  • February 7th, 2023
    • Text + DW and NoSQL solutions (pdf)
    • Data visualization solutions (pdf)
  • February 22th, 2023
    • Text + DW and NoSQL solutions (pdf)
    • Data visualization solutions (pdf)