Data Science And Database Technology (2025/2026)

Data Science And Database Technology (2025/2026)

General Information

SSD CFUProfessorTeaching Assistants
ING-INF/058Silvia ChiusanoAlessandro Fiori, Davide Napolitano

📰 Announcements [dd/mm/yyyy]

  • Laboratories start on October 13, 2025

📅Weekly schedule (13/10/2025 – 17/10/2025)

MondayTuesdayWednesdayThursdayFriday
8:30-10:00
10:00-11:30[ROOM 10A]: Data preprocessing
11:30-13:00[TEAM B: Lab on room LAIB1T] Exercises on extended SQL [ROOM R1]: Exercises on Data warehouse design and extended SQL [TEAM C: Lab on room LAIB4] Exercises on extended SQL
13:00-14:30[TEAM A: Lab on room 12IStudents must bring their own PC] Exercises on extended SQL [ROOM R1]: Exercises on Data warehouse design and extended SQL
14:30-16:00[ROOM 5S]: Lesson: ETL; Data lakes; data mining process
16:00-17:30
17:30-19:00

📒Teaching Material

  • Course introduction: pdf
  • Introduction to Data Science (slides)
  • Data warehouse: introduction (slides)
  • Data warehouse: design (slides)
  • Data warehouse: analysis (slides)
  • Data warehouse: materialized view, physical design, ETL (slides)
  • Data mining process (slides)
  • Data preparation (slides)
  • Data mining: association rules (slides)
  • Data mining: classification (slides)
  • Data mining: clustering (slides)
  • Introduction to DBMS (slides)
  • Buffer Manager (slides)
  • Physical access to data (slides)
  • Query optimization (slides)
  • Physical Design (slides)
  • Concurrency Control (slides)
  • Reliability management (slides)
  • Distributed databases (slides)
  • NoSQL, beyond relational databases (slides)
  • Introduction to MongoDB (slides)
  • ElasticSearch (slides)

🗒️Exercise


💻Laboratory

  • The laboratory practices will start from the fourth week.
  • It is recommended to respect the division into teams to allow the laboratories to take place.
  • Laboratory practices are mainly based on web-based applications – they do not require the installation of ad hoc software
  • Laboratories are NOT mandatory, but they allow you to practice with exercises similar to those that you will face in the homework (max 2 points) and in the Exam.
LAB TEAMS (Division into teams for Surname) WHEN HOURWHERE
TEAM A: from AAA to DOL (Students must bring their own PC)Monday 13:00-14:3012I
TEAM B: from DON to KOCWednesday11:30-13:00LAIB1T
TEAM C: from KOL to ZZZFriday11:30-13:00LAIB4
SUBJECTTEAM ATEAM BTEAM CTEXTSOLUTIONSOFTWARE
Lab 1: Extended SQLMonday 13/10/2025Wednesday 15/10/2025Friday 17/10/2025TextDW SQLFiles
Lab 2: Data StudioMonday 27/10/2025Wednesday 29/10/2025Friday 31/10/2025
Lab 3: Materialize viewsMonday 03/11/2025Wednesday 05/11/2025Friday 07/11/2025
Lab 4: Data mining with PythonMonday 17/11/2025Wednesday 19/11/2025Friday 21/11/2025
Lab 5: Oracle optimizerMonday 24/11/2025Wednesday 26/11/2025Friday 28/11/2025
Lab 6: MongoDBMonday 08/12/2025Wednesday 10/12/2025Friday 12/12/2025

📗Homeworks

To obtain the points associated with the Homeworks, students have to observe the following terms:

  • Complete all the points of the exercises in the homework text.
  • All exercises must be computer-written (e.g. Conceptual Schema, Logical Schema, SQL queries, Triggers, etc…).
  • Prepare one file in PDF format with the solution of the homework.
  • Name the file as: HomeworkN_Surname_Name_StudentId.pdf where
    • StudentId, Surname and Name should be substituted with student information (place all names/surnames that are used in your PoliTo account, attached to each other)
    • the N character following Homework should be substituted with the number of the submitted homework
    • Since uploaded files are automatically processed, using the wrong format or naming the file with a wrong name implies the cancellation of the related homework submission.
    • For example, for homework 1 and extension pdf, the student with name Luigi Maria, surname De Rossi and id s123456 will upload Homework1_DeRossi_LuigiMaria_s123456.pdf
  • Load the file on the didactic portal (Portale della didattica) in the section Work Submission (Elaborati) before the deadline.
    • Multiple loadings for the same student and/or for the same homework are not allowed.
    • The upload date shown on the didactic portal is considered for the evaluation.
    • Since uploaded files are automatically processed, uploading after the deadline implies canceling the related homework submission.
  • During the upload procedure a description (“Descrizione”) field is requested. Insert the same name of the file according to the rules described above (w/ or w/o extension are both fine).
  • Only the students without the access to the course page on the didactic portal can submit the homework before the deadline by sending an email with the PDF to the assistant lecturer (davide.napolitano@polito.it)
  • Discuss the homework with a positive evaluation on the fixed date (announcement will be published).

Homework Info:

  • Homoworks are NOT mandatory
  • Each Homework provides at most 0.5pt, resulting in a max score of 2pt
  • Each Homework is evaluated between 0 and 30, with the final score scaled into the [0, 0.5] pt range.

Homework Discussion:

Homework Schedule:

HomeworkTextFilesUploadDeadline
Homework1: DW, Extended SQL and MVuploaded before the end of November 11th, 2025 to be delivered by November 19th, 2025 at 11.59 PM (UTC/GMT+1)
Homework2: Data mininguploaded before the end of November 24th, 2025to be delivered by December 2nd, 2025 at 11.59 PM (UTC/GMT+1)
Homework3: The Optimizeruploaded before the end of December 1st, 2025to be delivered by December 8th, 2025 at 11.59 PM (UTC/GMT+1)
Homework4: MongoDBuploaded before the end of December 15th, 2025to be delivered by Decemeber 23rd, 2025 at 11.59 PM (UTC/GMT+1)