Data Management and Visualization (2024-25)

Data Management and Visualization (2024-25)

General Information

Lecturers: Daniele Apiletti, Diego Monti

Teaching staff: Simone Monaco

SSD: ING-INF/05 – CFU: 8 – course details from the official student guide

Q&A teaching assistance on Piazza: piazza.com/polito.it/fall2024/01txasm


📰 News

  • We are using Piazza for class discussion. We invite all students to join the course Piazza. Piazza is highly suggested to get help fast and efficiently from classmates and teachers. Rather than emailing questions to the teaching staff, students are invited to post their questions on Piazza.

📒 Teaching material

Course introduction (slides)

  • Introduction (slides)
  • Conceptual and logical design (slides)
  • Data analysis, OLAP, extended SQL (slides)
  • ETL process (slides)
  • Data Lakes and ELT (slides)
  • Materialized view (slides)
  • Data warehousing in Oracle (slides)
  • Data warehousing: physical design (slides)
  • Conceptual schema: textual formalism (slides)
  • Non-relational databases for data management – introduction (slides)
  • Introduction to MongoDB, collections, create, delete, GUI (slides)
  • MongoDB, querying data, find operator, aggregation pipeline (slides)
  • MongoDB aggregation examples, indexes (slides)
  • Distributed Data Management, replication, and the CAP theorem (slides)
  • MongoDB replica set (slides, updated Nov 18)
  • Distributed transactions (slides)
  • Distributed data processing and Map Reduce (slides)
  • NoSQL design recipe (slides)
  • MongoDB design patterns part 1 (slide)
  • MongoDB design patterns part 2 (slide)

🗒️ Exercises


💻 Laboratory material

Students groupTimeRoom
TEAM 1 (FROM A TO K)Tuesday, 13:00 – 14:30LAIB3B
TEAM 2 (FROM L TO Z)Tuesday, 14:30 – 16:00LAIB3B

For Labs 1 and 2, you must run Extended SQL on Oracle databases. The suggested tool for these Labs is Oracle Live SQL. To configure the environment you can add tables using SQL scripts. A short guide on how to import them and query the DB is available (pdf)

Lab 1: Extended SQL

Text (📄pdf)
Data warehouse tables (🗃️CSV, 🗃️SQL)
Solution (📄star schema, 📄queries)

Lab 2: Extended SQL

Text – Additional queries (📄pdf)
Solution (📄pdf)

Lab 3: Looker Studio

Text (📄pdf)

Lab 3: NoSQL with PyMongo

Text (📄pdf)
Data (🗂️link)
Notebook (📒colab)

Solution (📒colab)

Lab 5: MongoDB replica set

Text (📄pdf)
Notebook (📒colab)
Docker Compose (📜file)

⭐ Solution (📒colab)

Lab 6: Visualization analysis

Text (📄pdf)
Data (🗂️xlsx)

Solution (🗂️xlsx)

Lab 7: Redesign with Tableau

Text (📄pdf)
Data (🗂️xlsx)

Solution (🗃️zip)

Lab 8: Visualization of a dataset

Text (📄pdf)
Data (🗃️zip;📄txt)

Solution (🗃️zip)

Lab 9: Intervals and dashboards

Text (📄pdf)
Data (🗃️zip)

Solution (🗃️zip)

Lab 10: Geographic roles and maps

Text (📄pdf)
Data (🗃️zip)

Solution (🗃️zip)

Lab 11: Dataviz exam simulation

Text (📄pdf)
Visualization (📊jpg)

Solution (📄pdf🗃️zip)


📄 Past Exams

More exams are available on moodle.polito.it.

TitleTextSolutions
NEW March 13th, 2024📝Text and Solution
September 16th, 2024📝Text + DW and NoSQL solutions
September 20th, 2023📝Text + DW and NoSQL solutions
June 21th, 2023📝Text + DW and NoSQL solutions
February 22th, 2023📝Text + DW and NoSQL solutions📑Data visualization solutions
📑Conceptual design
February 7th, 2023📝Text + DW and NoSQL solutions📑Data visualization solutions
📑Conceptual design
June 29th, 2022📝Text + DW and NoSQL solutions
February 17th, 2022📝Text + DW and NoSQL solutions📑Data visualization solutions
📑Conceptual design
January 28th, 2022📝Text + DW and NoSQL solutions📑Data visualization solutions