General Information
Lecturers: Daniele Apiletti, Silvia Chiusano, Diego Monti
Teaching staff: Eliana Pastor, Alessandro Fiori, Simone Monaco
SSD: ING-INF/05 – CFU: 8 – course details from the official student guide
Q&A teaching assistance on Piazza: https://piazza.com/polito.it/fall2021/01txasm
News
- 27 September 2021: the 21-22 edition of the course begins! See the official timetable.
- We are using Piazza for class discussion, we invite all students to join the course Piazza. Piazza is highly catered to getting help fast and efficiently from both classmates and teachers. Rather than emailing questions to the teaching staff, students are invited to post their questions on Piazza.
Exam
This section will provide the text and solution of the exams.
- Feb 1st, 2021
- Feb 15th, 2021
- June 17th, 2021
- September 1st, 2021
- January 28th, 2022
- February 17th, 2022
- Text + DW and NoSQL solutions (available soon)
- Data visualization solutions (pdf)
Teaching material
Course introduction (slides)
Data Warehousing
- Introduction (slides)
- Conceptual and logical design (slides) – slides updated on Oct, 1st, 2021 (minor revision)
- Data analysis, OLAP, extended SQL (slides)
- ETL process (slides)
- Materialized view (slides)
- Data warehousing in Oracle (slides)
- Data warehousing: physical design (slides)
Exercises
Exercise extended SQL, customers (text, draft solution)
Exercise extended SQL, rentals (text, draft solution)
Exercise DW design, hotels (text, draft solution)
Exercise DW design, storehouses (text, draft solution)
Exercise DW design, rentals (text, draft solution)
Exercise DW design, parcels (text, draft solution)
Exercise DW design, computer (text, draft solution)
Additional exercises:
- Vehicle rental (text, draft solution)
- Versioning services (text, draft solution)
- Craft beer (text, draft solution)
- Tickets (text, draft solution)
- Food delivery (text, draft solution)
NoSQL
- Non-relational databases for data management – introduction (slides)
- Introduction to MongoDB, collections, create, delete, GUI (slides)
- MongoDB, querying data, find operator, aggregation pipeline (slides)
- MongoDB aggregation examples, indexes (slides)
- Distributed Data Management, replication, and the CAP theorem (slides)
- MongoDB replica set (slides, updated Nov 18)
- Distributed transactions (slides)
- Distributed data processing and Map Reduce (slides)
- NoSQL design recipe (slides)
- MongoDB query exercises (slides)
- MongoDB query exercises IMDB (slides, IMDB database) – updated Dec 13, 2021 with solutions
- MongoDB design patterns part 1 (slide)
- MongoDB design patterns part 2 (slide)
- MongoDB design pattern exercises 1 (slide)
- MongoDB design pattern exercises 2 (slide)
Additional exercises:
Data Visualization
- What is visualization (slides)
- Visual perception (slides)
- Graph construction (slides)
- Data quality (slides)
- Past exams (slides)
Seminars
Docsity data lineage (slides)
Laboratory material
Lab practices start on Tuesday, October 19th, 2021.
LAB SCHEDULE.
TEAM A (FROM A TO K) on Tuesday from 1 pm to 2:30 pm
TEAM B (FROM L TO Z) on Tuesday from 2:30 pm to 4pm
Lab 1: Extended SQL
- Text (pdf)
- Data warehouse tables in csv format (zip)
- SQL Developer is already available at LABINF. If you want to practise at home, you can follow these tutorials
- Installing Oracle Database 18c Express Edition and SQL Developer
- Import Database and Tables: Tutorial
- In the case, you want to practice at home and you have problems in using Oracle Database and SQL Developer, you can consider Oracle Live SQL.
Draft solution (star schema, queries)
Lab 2: Extended SQL
Lab 3: Data Studio
- Text (pdf)
Lab 4: MongoDB Compass
Lab 5: MongoDB replica set
Lab 6: Visualization analysis
Lab 7: Redesign with Tableau
Lab 8: Visualization of a dataset
Lab 9: Intervals and dashboards
Lab 10: Geographic roles and maps
Lab 11: Dataviz exam simulation