Data mining: concepts and algorithms
Lecturer
- Danilo Giordano (email: name dot surname at polito dot it)
Lectures
- Introduction to the course and introduction to Data Science, overview of data mining approaches
- Mon, Jan 11, 2021 14:30-17:30 – Virtual Classroom
- Association Rules: Introduction, Algorithms and Performance metrics
- Thu, Jan 14, 2021 14:30-17:30 – Virtual Classroom
- Classification, supervised learning: Introduction and Algorithms
- Mon, Jan 18, 2021 14:30-17:30 – Virtual Classroom
- Classification, supervised learning: Algorithms, Validation and Performance metrics
- Thu, Jan 21, 2021 14:30-17:30 – Virtual Classroom
- Clustering, unsupervised learning: Introduction, Algorithms and Performance metrics
- Mon, Jan 25, 2021 14:30-17:30 – Virtual Classroom
- Seminar on: Big Data and Advanced Machine Learning techniques
- Thu, Jan 28, 2021 14:30-17:30 – Virtual Classroom
- Hands on the data with RapidMiner and Exam
-
- Mon, Feb 01, 2021 14:30-16:30 – Virtual Classroom
-
Given the COVID-19 emergency the entire course will be given online by using Politecnico Virtual Classroom.
For the exam, as in the past years, it consists in a data analysis task by using RapidMiner plus an oral exam about the data analysis results and the course program.
Please be sure to download and install RapidMiner on your computer before the exam day. The free version is sufficient for the exam.
The latest RapidMiner Studio is available for download from: https://rapidminer.com/educational-program/ (you can apply for the educational licence).
Slides
- Data mining: Introduction (6 slides per page,2 slides per page)
- Data mining: Preprocessing (6 slides per page,2 slides per page)
- Summary Lecture 1 (Summary Lecture 1)
- Association Rules (6 slides per page,2 slides per page)
- Summary Lecture 2 (Summary Lecture 2)
- Classification (6 slides per page, 2 slides per page)
- Summary Lecture 3 (Summary Lecture 3)
- Summary Lecture 4 (Summary Lecture 4)
- Clustering (6 slides per page,2 slides per page)
- Introduction to Big Data (6 slides per page,2 slides per page)
- Introduction to Advanced Machine Learning techniques (6 slides per page, 2 slides per page)
Practice
- Data mining – Practice Text (new)
- Dataset: Users
- RapidMiner version 5.0 is available at the LABINF PCs – Manual: download
- Introduction to RapidMiner: 2 slides per page, 3 slides per page, 6 slides per page
- The latest RapidMiner Studio is available for download from: https://rapidminer.com/educational-program/ (you can apply for the educational licence)
Resources
- DatasetsUCI.zip (download)
- UCI Machine Learning Repository