Data Management and Visualization (2021-22)

Data Management and Visualization (2021-22)

General Information

Lecturers: Daniele Apiletti, Silvia Chiusano, Diego Monti

Teaching staff: Eliana Pastor, Alessandro Fiori, Simone Monaco

SSD: ING-INF/05 – CFU: 8 – course details from the official student guide

Q&A teaching assistance on Piazza: https://piazza.com/polito.it/fall2021/01txasm


News

  • 27 September 2021: the 21-22 edition of the course begins! See the official timetable.
  • We are using Piazza for class discussion, we invite all students to join the course Piazza. Piazza is highly catered to getting help fast and efficiently from both classmates and teachers. Rather than emailing questions to the teaching staff, students are invited to post their questions on Piazza.

Exam

This section will provide the text and solution of the exams.


Teaching material

Course introduction (slides)

Data Warehousing

  • Introduction (slides)
  • Conceptual and logical design (slides) – slides updated on Oct, 1st, 2021 (minor revision)
  • Data analysis, OLAP, extended SQL (slides)
  • ETL process (slides)
  • Materialized view (slides)
  • Data warehousing in Oracle (slides)
  • Data warehousing: physical design (slides)

Exercises

Exercise extended SQL, customers (text)

Exercise extended SQL, rentals (text)

Exercise DW design, hotels (text)

Exercise DW design, storehouses (text)

NoSQL

  • Non-relational databases for data management – introduction (slides)
  • Introduction to MongoDB, collections, create, delete, GUI (slides)
  • MongoDB, querying data, find operator, aggregation pipeline (slides)
  • MongoDB aggregation examples, indexes (slides)
  • Distributed Data Management, replication, and the CAP theorem (slides)
  • MongoDB replica set (slides, updated Nov 18)
  • Distributed transactions (slides)
  • Distributed data processing and Map Reduce (slides)
  • NoSQL design recipe (slides)
  • MongoDB query exercises (slides)
  • MongoDB query exercises IMDB (slides, IMDB database)
  • MongoDB design patterns part 1 (slide)
  • MongoDB design patterns part 2 (slide)
  • MongoDB design pattern exercises 1 (slide)

Data Visualization


Laboratory material

Lab practices start on Tuesday, October 19th, 2021.

LAB SCHEDULE. 
TEAM A (FROM A TO K) on Tuesday from 1 pm to 2:30 pm
TEAM B (FROM L TO Z) on Tuesday from 2:30 pm to 4pm

Lab 1: Extended SQL

  • Text (pdf)
  • Data warehouse tables in csv format (zip)
  • SQL Developer is already available at LABINF. If you want to practise at home, you can follow these tutorials
  • Installing Oracle Database 18c Express Edition and SQL Developer
    • To download and install Oracle Express Edition: home page
    • To download and install SQL Developer: home page
    • Tutorial
  • In the case, you want to practice at home and you have problems in using Oracle Database and SQL Developer, you can consider Oracle Live SQL.
    • You can add tables using SQL scripts (zip)
    • A short guide on how to import SQL scripts and query the DB in Oracle Live SQL is available (pdf)

Draft solution (star schema, queries)

Lab 2: Extended SQL

  • Text – Additional queries (pdf)
  • Draft solution (pdf)

Lab 3: Data Studio

Lab 4: MongoDB Compass

Lab 5: MongoDB replica set

Lab 6: Visualization analysis

Lab 7: Redesign with Tableau

Lab 8: Visualization of a dataset