Data Science And Database Technology (2024/2025)

Data Science And Database Technology (2024/2025)

General Information

SSDCFUProfessorTeaching Assistants
ING-INF/058Silvia ChiusanoAlessandro Fiori, Davide Napolitano

📰 Announcements [dd/mm/yyyy]

  • [02/12/2024] Homework 2 published
  • [22/11/2024] Update Deadline Homework 1
  • [18/11/2024] Homework 1 published
  • [10/10/2024] Next week (14/10/2024 – 18/10/2024) Lab1 will take place

📅Weekly schedule (2/12/2024 – 6/12/2024)

MondayTuesdayWednesdayThursdayFriday
8:30-10:00[TEAM B: LAIB2B]
Oracle optimizer
10:00-11:30[ROOM R2]
Concurrency control
[ROOM R1]
Solution of Homework #1 on DW, Extended SQL and MV; Exercises on query optimization 
11:30-13:00[ROOM R1]
Concurrency control
13:00-14:30[TEAM A: LAIB3]
Oracle optimizer
14:30-16:00[ROOM R2]
Exercises on query optimization
16:00-17:30[TEAM C: LAIB2B]
Oracle optimizer
17:30-19:00

📒Teaching Material

Course IntroductionCourse introduction: pdf
Introduction to Data ScienceIntroduction to Data Science (slides)

Data warehouse
Data warehouse: introduction (slides)
Data warehouse: design (slides)
Data warehouse: analysis (slides)
Data warehouse: materialized view, physical design, ETL (slides)
Data lakesData lakes (slides)
Data miningData mining process (slides)
Data preparation (slides)
Data mining: association rules (slides)
Data mining: classification (slides)
Data mining: clustering (slides)
DBMSIntroduction to DBMS (slides)
Buffer Manager (slides)
Physical access to data (slides)
Query optimization (slides)
Physical Design (slides)
Oracle: Oracle optimizer (slides) Oracle Hints (slides)
Concurrency Control (slides)
Reliability management (slides)
Distributed databases (slides)
NoSQL, beyond relational databases (slides)
Introduction to MongoDB (slides)
ElasticSearch (slides)


🗒️Exercise

SUBJECTMATERIAL
Extended SQL, materialized view, triggersExtended SQL and materialized view in Oracle (2 slides per page6 slides per page)
Exercise 1 on extended SQL (text)
Materialized views and triggers (text, draft solution)
Supporting material: Introduction to triggers (slides)
Data WarehouseStorehouses (text, draft solution)
Italian wines (text, draft solution)
Remote heating (text, draft solution)
Scientific publications (text)
Parcels (text)
Query optimizationFine (text)
Students (text)
Athletes (text)
Tourist village (text)


💻Laboratory

  • The laboratory practices will start from the fourth week.
  • The subdivision into teams may be subject to change following the close of the registration period.
  • It is recommended to respect the division into teams to allow the laboratories to take place.
  • Remember to bring your laptop to solve the lab.
LAB TEAMS (Division into teams for Surname)WHENHOURWHERE
TEAM A: from AAA to GZZMonday
(except Lab 6: Tuesday 17/12/2024 – 17:30-19:00 Laib2B)
13:00-14:30LAIB3
TEAM B: from HAA to OZZFriday8:30-11:30LAIB2B
TEAM C: from PAA to ZZZTuesday16:00-17:30LAIB2B
SUBJECTTEAM ATEAM BTEAM CTEXTSOLUTIONSOFTWARE
Lab 1: Extended SQLMonday 21/10/2024Friday 18/10/2024 Tuesday 22/10/2024TextDW SQLFiles
Lab 2: Data StudioMonday 28/10/2024Friday 25/10/2024Tuesday 29/10/2024Text
Lab 3: Materialize viewsMonday 11/11/2024Friday 15/11/2024Tuesday 12/11/2024TextSol
Lab 4: Data mining with PythonMonday 18/11/2024Friday 22/11/2024Tuesday 19/11/2024TextFiles
Lab 5: Oracle optimizerMonday 02/12/2024Friday 06/12/2024Tuesday 03/12/2024TextFiles
Lab 6: MongoDBTuesday 17/12/2024 (17:30-19:00 Laib2B)Friday 20/12/2024Tuesday 17/12/2024

📗Homeworks

To obtain the points associated with the Homeworks, students have to observe the following terms:

  • Complete all the points of the exercises in the homework text.
  • All exercises must be computer-written (e.g. Conceptual Schema, Logical Schema, SQL queries, Triggers, etc…).
  • Prepare one file in PDF format with the solution of the homework.
  • Name the file as: HomeworkN_Surname_Name_StudentId.pdf where
    • StudentId, Surname and Name should be substituted with student information (place all names/surnames that are used in your PoliTo account, attached to each other)
    • the N character following Homework should be substituted with the number of the submitted homework
    • Since uploaded files are automatically processed, using the wrong format or naming the file with a wrong name implies the cancellation of the related homework submission.
    • For example, for homework 1 and extension pdf, the student with name Luigi Maria, surname De Rossi and id s123456 will upload Homework1_DeRossi_LuigiMaria_s123456.pdf
  • Load the file on the didactic portal (Portale della didattica) in the section Work Submission (Elaborati) before the deadline.
    • Multiple loadings for the same student and/or for the same homework are not allowed.
    • The upload date shown on the didactic portal is considered for the evaluation.
    • Since uploaded files are automatically processed, uploading after the deadline implies canceling the related homework submission.
  • During the upload procedure a description (“Descrizione”) field is requested. Insert the same name of the file according to the rules described above (w/ or w/o extension are both fine).
  • Only the students without the access to the course page on the didactic portal can submit the homework before the deadline by sending an email with the PDF to the assistant lecturer (davide.napolitano@polito.it)
  • Discuss the homework with a positive evaluation on the fixed date (announcement will be published).

Homework Info:

  • Homoworks are not mandatory
  • Each Homework provides at most 0.5pt, resulting in a max score of 2pt
  • Each Homework is evaluated between 0 and 30, with the final score scaled into the [0, 0.5] pt range.

Homework Discussion:

Homework Schedule:

HomeworkTextFilesUploadDeadline
Homework1: DW, Extended SQL and MVText
Draft solution
uploaded before the end of November 18th, 2024 to be delivered by November 26th, 2024 at 11.59 PM (UTC/GMT+1)
Homework2: Data miningTextFilesuploaded before the end of December 2nd, 2024to be delivered by December 11th, 2024 at 11.59 PM (UTC/GMT+1)
Homework3: The Optimizeruploaded before the end of December 9th, 2024to be delivered by December 18th, 2024 at 11.59 PM (UTC/GMT+1)
Homework4: MongoDBuploaded before the end of December 23th, 2024to be delivered by January 5th, 2025 at 11.59 PM (UTC/GMT+1)