Vai al contenuto principale
Coronavirus: aggiornamenti per la comunità universitaria / Coronavirus: updates for UniTo Community
Oggetto:
Oggetto:

Introduction to data mining

Oggetto:

Introduction to data mining

Oggetto:

Academic year 2018/2019

Course ID
MAT0051
Teaching staff
Roberto Esposito
Prof. Rosa Meo
Year
2nd year
Teaching period
First semester
Type
D.M. 270 TAF C - Related or integrative
Credits/Recognition
6
Course disciplinary sector (SSD)
INF/01 - informatica
Delivery
Formal authority
Language
English
Attendance
Optional
Type of examination
Written
Prerequisites
Databases and Algorithms, Programming
Oggetto:

Sommario del corso

Oggetto:

Course objectives

The objectives of the course will be introduce students to the field of Data Mining and Machine Learning, that merge together competencies of statistics and computer science.

The course will teach the differences between tasks and models and will introduce the students to some of the popular models in Machine Learning such as binary classification and related tasks, transformation of a binary classification model into a multiple class model, concept learning by means of logical formulas, tree models and their purposes, rule models, subgroup discovery, linear models (least squares, regression), perceptron, Support Vector Machines, Kernel methods.

The course will introduce the algorithms for the training of the models.

The laboratory part of the course will introduce the students to a practical open software suite that includes the algorithms of learning of the models seen during the course (and much more). 

Oggetto:

Results of learning outcomes

The results of the learning outcomes will be mastering some the main concepts in Data Mining and Machine Learning and using them in the context of a practical open software suite for data analysis and machine learning. 

Oggetto:

Course delivery

The course lessons will be both theoretical and practical (laboratory sessions)

The classes in this course (6 CFU) are borrowed from the classes offered in the first part of another, larger course (9 CFU), named Apprendimento Automatico (Machine Learning) at the Computer Science Department.
The classes of this course will be all the classes of the initial part of the larger course (they are overall, the first 24 classes of the complete course).

These classes are divided into 18 theoretical classes and 6 laboratory sessions.

 

Oggetto:

Learning assessment methods

The final exam will be oral in which the students will be asked to show that they master the theorical lessons (knowledge of the models and of their purposes) and use of the practical software suite (Weka) for data analysis in some use cases.

Oggetto:

Support activities

 Machine learning experiments in Laboratory with a software suite for Data Mining.

The laboratory will be a practical support to the learning of the theorical lessons by means of practical data analysis assignments on public data-sets (UCI KDD Archive and Kaggle, a platform for Data Science challenges).

Oggetto:

Program

Tasks and models; Binary classification and related tasks; Beyond binary classification (transformation of a binary classification model into a multiple class model; Concept learning by means of logical formulas; Version Space; learning hypothesis by means of Horn clauses; Tree models (decision trees, regression trees, features trees, ranking trees); rule models (list of rules and sets of rules); subgroup discovery;  linear models (least squares, regression); perceptron; Support Vector Machines; Kernel methods;

Suggested readings and bibliography

Oggetto:

 



Oggetto:

Class schedule

Oggetto:

Note

This course is borrowed from Machine Learning and will be delivered at Apprendimento Automatico / Introduction to Data Mining, a.a. 18/19

This course (6 CFU) includes the classes offered in the first part of a larger course (of 9 CFU), named Apprendimento Automatico (Machine Learning) held at the Computer Science Department.
The classes in this course will include all the classes in the first part of the larger course (they are the first 24 classes of the complete course). The classes are divided in 18 theoretical classes and 6 laboratory sessions.

 

Oggetto:
Last update: 10/05/2019 17:04
Non cliccare qui!