General Information
Lecturers: Daniele Apiletti, Diego Monti
Teaching staff: Alessandro Fiori, Simone Monaco
SSD: ING-INF/05 – CFU: 8 – course details from the official student guide
Q&A teaching assistance on Piazza: piazza.com/polito.it/fall2023/01txasm
đź“° News
- We are using Piazza for class discussion. We invite all students to join the course Piazza. Piazza is highly suggested to get help fast and efficiently from classmates and teachers. Rather than emailing questions to the teaching staff, students are invited to post their questions on Piazza.
- Seminar announcement – Data science and retail: how Data Management and Machine Learning can improve customer experience
- When: Monday, January 15th, 14:30-16:00, classroom 1P
- Who: Marco Stella, Data Science Manager, Miroglio Group
- What: Miroglio Group ambition is to become a Data and Customer centric company. How to propose the best personalized product for each customer? Can new technologies facilitate this goal? In this seminar we will explain how we are implementing a Cloud Data Platform to know and understand better our customers. We will go through a use case where Machine Learning algorithms are applied to customer segmentation, hence facilitating, pushing and harmonizing marketing product campaigns.
- Material: slides
- Seminar announcement – PowerBI: a free tool for Big Data Management and Advanced Visualization
- When: Tuesday, January 16th, 11:30-13:00, classroom R3
- Who: Victor Rivas, Evergrow BI
- What: Unlock the full potential of Power BI, a premier, yet free tool for Windows users. The seminar will guide you through the strategic use of Power BI for managing and visualizing Big Data using innovative features like Composite Models and Direct Lake in Fabric. Glean insights into the comparative advantages of tabular versus multidimensional models, and learn to craft visual narratives that make data speak volumes.
- Material: intro by Adam Saxon, Program Manager, Microsoft.
- Seminar announcement – The power of Data Modeling in Business Analytics
- When: Tuesday, January 16th, 13:00-14:30, classroom R3
- Who: Marco Russo, SQLBI
- What: This seminar focuses on different data models used in business analytics. Using the example of “Tickit”, a fictional website for buying and selling event tickets, we’ll examine how data models work in real-world scenarios. We’ll look at typical raw data (like denormalized files and OLTP data source, and then compare different analytical models like Inmon (data warehouse) and Kimball (dimensional modeling), and then introduce the additional features of semantic models to create reports in an interactive way and write shorter queries.
- Material: slides and demo
đź“’ Teaching material
Course introduction (slides)
- Non-relational databases for data management – introduction (slides)
- Introduction to MongoDB, collections, create, delete, GUI (slides)
- MongoDB, querying data, find operator, aggregation pipeline (slides)
- MongoDB aggregation examples, indexes (slides)
- Distributed Data Management, replication, and the CAP theorem (slides)
- MongoDB replica set (slides, updated Nov 18)
- Distributed transactions (slides)
- Distributed data processing and Map Reduce (slides)
- NoSQL design recipe (slides)
- MongoDB design patterns part 1 (slide)
- MongoDB design patterns part 2 (slide)
🗒️ Exercises
- Exercise extended SQL, customers (text, solution)
- Exercise extended SQL, rentals (text, solution)
- Exercise DW design, hotels (text, solution)
- Exercise DW design, storehouses (text, solution)
- Exercise DW design, rentals (text, solution)
- Exercise DW design, parcels (text, solution)
- Exercise DW design, computer (text, solution)
- Exercise DW design, vehicle rental (text, solution)
- Exercise DW design, versioning services (text, solution)
- Exercise DW design, craft beer (text, solution)
- Exercise DW design, tickets (text, solution)
- Exercise DW design, food delivery (text, solution)
- MongoDB query exercises (slides)
- MongoDB query exercises IMDB (slides, IMDB database)
- MongoDB design pattern exercises 1 (slide)
- MongoDB design pattern exercises 2 (slide)
đź’» Laboratory material
The first lab is scheduled for Thursday, October 19
Students group | Time | Room |
TEAM A (FROM A TO K) | Thursday, 16:00 – 17:30 | LAIB2B |
TEAM B (FROM L TO Z) | Thursday, 17:30 – 19:00 | LAIB2B |
For Labs 1 and 2, you need to run Extended SQL on Oracle databases. SQL Developer is already available on the devices. If you want to practise at home (or on PoliTO devices as well), we suggest you configure its online version on Oracle Live SQL, in particular:
- You can add tables using SQL scripts
- A short guide on how to import SQL scripts and query the DB in Oracle Live SQL is available (pdf)
Lab 1: Extended SQL
- Text (pdf)
- Data warehouse tables (zip (CSV), zip (SQL))
- Solution (star schema, queries)
Lab 2: Extended SQL
Lab 3: Looker Studio
- Text (pdf)
Lab 4: MongoDB Compass
Lab 5: MongoDB replica set
Lab 6: Visualization analysis
- Solution (xlsx)
Lab 7: Redesign with Tableau
- Text (pdf)
- Solution (zip)
Lab 8: Visualization of a dataset
- Solution (zip)
Lab 9: Intervals and dashboards
- Solution (zip)
Lab 10: Geographic roles and maps
- Solution (zip)
Lab 11: Dataviz exam simulation