{"id":2608,"date":"2011-11-17T10:36:07","date_gmt":"2011-11-17T09:36:07","guid":{"rendered":"http:\/\/dbdmg.polito.it\/wordpress\/?page_id=2608"},"modified":"2015-12-14T23:59:54","modified_gmt":"2015-12-14T22:59:54","slug":"data-mining-algorithms","status":"publish","type":"page","link":"https:\/\/dbdmg.polito.it\/wordpress\/theses\/data-mining-algorithms\/","title":{"rendered":"Data Mining Algorithms"},"content":{"rendered":"<h3>Disk-based algorithms to scale-up data mining task<\/h3>\n<h4>Tutors<\/h4>\n<p><a href=\"https:\/\/dbdmg.polito.it\/wordpress\/people\/tania-cerquitelli\/\">Tania Cerquitelli<\/a>, <a href=\"https:\/\/dbdmg.polito.it\/wordpress\/people\/silvia-chiusano\/\">Silvia Chiusano<\/a><\/p>\n<h3>\u00a0<span style=\"font-size: 1em; line-height: 19px;\">Issues<\/span><\/h3>\n<div>\n<ul>\n<li>Huge amount of data<\/li>\n<li>Most algorithms exploit ad-hoc main memory data structures to efficiently perform data mining\n<ul>\n<li>These approaches rely on the available physical memory and may run out of memory when the analysis is performed on very large databases<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<h4>Goal<\/h4>\n<\/div>\n<div>\n<ul>\n<li>design of disk-based structures and algorithms to efficiently support clustering algorithms<\/li>\n<li>data mining optimizer to automatically select appropriate\u00a0 algorithms and data access methods for frequent\u00a0 itemset mining<\/li>\n<li>text mining by exploiting data mining techniques\u00a0\u00a0 (e.g., clustering, association rules) in different application domain<\/li>\n<li>study of parallel and distributed algorithms to scale itemset mining<\/li>\n<\/ul>\n<\/div>\n\n<h3><!--nextpage--><\/h3>\n<h3>Generalized association rule mining with constraints<\/h3>\n<h4>Tutor<\/h4>\n<p><a href=\"https:\/\/dbdmg.polito.it\/wordpress\/people\/luca-cagliero\/\">Luca Cagliero<\/a>, <a href=\"https:\/\/dbdmg.polito.it\/wordpress\/people\/tania-cerquitelli\/\">Tania Cerquitelli<\/a> (email: name dot surname at polito dot it)<\/p>\n<p>&nbsp;<\/p>\n<h4>Description<\/h4>\n<ul>\n<li>Generalized association rule mining: extension of the traditional association rule mining problem in the presence of taxonomies (aggregation hierarchies) built over data items\n<ul>\n<li>Discovery of correlations among data at different abstraction levels<\/li>\n<li>High number of mined rules -&gt; high computational complexity<\/li>\n<\/ul>\n<\/li>\n<li>Constraints restrict the extracted rules to a subset of interest to ease domain expert analysis<\/li>\n<\/ul>\n<p><em>Applications:<\/em><\/p>\n<ul>\n<li>Study and implementation of novel generalized association rule mining algorithms with constraints<\/li>\n<li>Design of applications, based on generalized rule mining with constraints, to support knowledge discovery data coming different application contexts (e.g., network traffic data, mobile data, social network data)<\/li>\n<\/ul>\n<h3><!--nextpage--><\/h3>\n<p>&nbsp;<\/p>\n<h3><span style=\"font-size: 1.17em; line-height: 19px;\">An optimizer to support data mining activities \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0<\/span><\/h3>\n<h3><img loading=\"lazy\" decoding=\"async\" class=\"alignright\" style=\"border-style: initial; border-color: initial; vertical-align: middle;\" src=\"http:\/\/dbdmg.polito.it\/wordpress\/wp-content\/uploads\/2011\/11\/opt.png\" alt=\"\" width=\"94\" height=\"91\" \/><\/h3>\n<h4>Tutor<\/h4>\n<a href=\"https:\/\/dbdmg.polito.it\/wordpress\/people\/tania-cerquitelli\/\">Tania Cerquitelli<\/a>\n<h4>Description<\/h4>\n<ul>\n<li>\u00a0Association rule extraction\n<ul>\n<li>Frequent itemset extraction -&gt; computationally intensive<\/li>\n<li>Association rule generation from frequent itemsets<\/li>\n<\/ul>\n<\/li>\n<li>Research activity usually focuses on defining efficient algorithms for itemset extraction<\/li>\n<li>Different algorithms are suitable for different data distribution<\/li>\n<li>Some algorithms have been integrated into a DBMS Open Source kernel<\/li>\n<\/ul>\n<p><em>Applications:<\/em><\/p>\n<ul style=\"list-style-type: square;\">\n<li>Data mining optimizer to automatically select appropriate algorithms and data access methods for frequent itemset mining<\/li>\n<li>Design and develop a module (i.e., an optimizer), in case integrated into a DBMS Open Source kernel (e.g., PostgreSQL), which is able to select for each mining process the best algorithm for the current data distribution<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<br class=\"fixfloat\" \/>","protected":false},"excerpt":{"rendered":"<p>Disk-based algorithms to scale-up data mining task Tutors , \u00a0Issues Huge amount of data Most algorithms exploit ad-hoc main memory data structures to efficiently perform data mining These approaches rely on the available physical memory and may run out of memory when the analysis is performed on very large databases Goal design of disk-based structures<a href=\"https:\/\/dbdmg.polito.it\/wordpress\/theses\/data-mining-algorithms\/\">[&#8230;]<\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"parent":2369,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-2608","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/dbdmg.polito.it\/wordpress\/wp-json\/wp\/v2\/pages\/2608","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dbdmg.polito.it\/wordpress\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/dbdmg.polito.it\/wordpress\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/dbdmg.polito.it\/wordpress\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/dbdmg.polito.it\/wordpress\/wp-json\/wp\/v2\/comments?post=2608"}],"version-history":[{"count":37,"href":"https:\/\/dbdmg.polito.it\/wordpress\/wp-json\/wp\/v2\/pages\/2608\/revisions"}],"predecessor-version":[{"id":8784,"href":"https:\/\/dbdmg.polito.it\/wordpress\/wp-json\/wp\/v2\/pages\/2608\/revisions\/8784"}],"up":[{"embeddable":true,"href":"https:\/\/dbdmg.polito.it\/wordpress\/wp-json\/wp\/v2\/pages\/2369"}],"wp:attachment":[{"href":"https:\/\/dbdmg.polito.it\/wordpress\/wp-json\/wp\/v2\/media?parent=2608"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}