Research Activities
The DataBase and Data Mining Group is a research group of the Department of Control and Computer Engineering of the Politecnico di Torino. The interests of the group span over all aspects of Data Science and Machine Learning.
Relevant topics for the group include, but are not limited to, the following areas: Explainability and Fairness in Machine Learning, Finance and Quantitative Trading, Natural Language Processing, Concept Drift Detection, Unsupervised Learning, Time series analytics, and stream processing, Sensor-based data analytics, Smart Cities, Big Data Processing and Analytics, Data Warehousing and Data Mining.
Use the buttons below to discover more.
The group is also involved in several research activities conducted within funded project. Use the button below to jump to the dedicated section.
Explainability, Fairness and Bias
Recent data mining and machine learning models are often considered black-boxes, as the process that led to certain output is undisclosed. The group puts effort into developing novel techniques to unveil the “reasoning” behind the models’ decisions for both structured (tables, records, tabular data) and unstructured (images, texts) data.
Finance and Quantitative Trading
Short Bio of the Area…
Natural Language Processing
Short Bio of the Area…
Concept Drift Detection
Automatically detection of prediction-quality degradation of machine learning models due to class-based concept drift.
Publications
- Ventura et Al. – A new unsupervised predictive-model self-assessment approach that SCALEs.
- Cerquitelli et Al. – Towards a real-time unsupervised estimation of predictive model degradation.
- Cerquitelli et Al. – Automating concept-drift detection by self-evaluating predictive model degradation.
Data Science in Academy
We understand the importance of exposing students as early as possible to Data Science and Machine Learning topics. We thus propose several initiatives that let students learn by doing.
Machine Learning @ PolitTO
MAchine Learning At poliTO (MALTO) is a student team with the goal to take part in international data science shared tasks, projects, and competitions.
For more details, visit the team’s website.
Data Science Lab Environment
Data Science Lab Environment (DSLE) is a web platform to host data science competitions. It is currently used in the course Data Science Lab: process and methods to assess students’ abilities in solving classification, regression tasks.
Research Bites
Research Bites, a series of short research talks and seminaries held by PhD students and international faculty members for students of the course Data Science Lab: process and methods. The goal of RB is to disseminate cutting-edge research topics, in short, high-level pills.
Funded Projects
The group is focused on bringing high-quality research into funded projects of national and international nature.
SERENA – EU Project
A verSatilE plug-and-play platform enabling remote pREdictive mainteNAnce
Publications
- Proto et Al. – PREMISES, a scalable data-driven service to predict alarms in slowly-degrading multi-cycle industrial processes.
- Panicucci et Al. – A Cloud-to-Edge Approach to Support Predictive Analytics.
- Apiletti et Al. – iSTEP, an integrated Self-Tuning Engine for Predictive maintenance in Industry 4.0.
- Ventura et Al. – A new unsupervised predictive-model self-assessment approach that SCALEs.
Previous Research
The following is a comprehensive list of research and projects carried on during the years.
- Expressive generalized itemset mining
- Bioinformatics
- Large-scale itemset mining
- Sports data analysis
- Analysis of physiological data
- Analysis of sensor network data
- Index support for Itemset Mining in a Relational DBMS
- Associative classification
- Network traffic analysis
- Document summarization
- Social network analysis and mining
- Recommendation systems
- infrequent itemset mining
- ONTIC
- Discovering profitable stocks for intraday trading