Vai al contenuto principale
Oggetto:
Oggetto:

Statistical machine learning

Oggetto:

Statistical machine learning

Oggetto:

Academic year 2016/2017

Course ID
MAT0043
Teacher
Prof. Antonio Canale
Year
2nd year
Teaching period
First semester
Type
D.M. 270 TAF C - Related or integrative
Credits/Recognition
6
Course disciplinary sector (SSD)
SECS-S/01 - statistica
Delivery
Class Lecture
Language
English
Attendance
Optional
Type of examination
Mixed
Oggetto:

Sommario del corso

Oggetto:

Course objectives

The course introduces methods and models to extract important patterns and trends from big amount of data, and presents basic concepts of machine learning and data mining from a statistical perspective. All the methods will be introduced from a theoretical point of view and implemented on real datasets in the R language.

Oggetto:

Results of learning outcomes

Knowledge and understanding 

  • Advances knowledge of parametric and nonparametric models for prediction and classification

Applying knowledge and understanding

  • Ability to convert various problems and data into statistical models to perform several type of  prediction/classification.

Making judgements

  • Students will be able to discern the different aspects of statistical learning in modern settings.

Communication skills

  • Students will properly use statistical language to comunicate the results of their findings.

Learning skills

  • The skills acquired will give students the opportunity of improving and deepening their knowledge of statistical modeling.
Oggetto:

Course delivery

Half of the lectures are devoted to the theorerical aspects of statistical machine learning and the remaining half to their practical implemetation in the R software considering both the related numerical and computational issues. Exercises will be assigned during  lectures and lab sessions. 

Oggetto:

Learning assessment methods

The exam consists of three parts: the first part is a written exam on theory; the second part is a practical session with R; the last part is an oral discussion.

Oggetto:

Program

Introduction

  • Context and motivations;
  • Trade-off between goodness-of-fit and model complexity (i.e. variance and bias);
  • Model selection techiniques (AIC, BIC, cross validation);
  • Training and test set; 

Regression

  • Variable selection and shrinkage
  • Elements of nonparametric regression
  • Structured nonparametric regression

Classification:

  • Logistic and multilogit regression;
  • Elements of nonparametric classification
  • Ensable techniques (bagging, boosting, random forest);

Miscellanea:

  • Tools for data visualization;
  • Computational tools (parallel computing, recursive estimations);

 

Suggested readings and bibliography

Oggetto:

  • AZZALINI, SCARPA. Data analysis and data mining . Oxford University Press
  • HASTIE, TIBSHIRANI AND FRIEDMAN. The elements of statistical learning: data mining, inference and prediction. Springer-Verlag.


Oggetto:

Class schedule

DaysTimeClassroom
Wednesday16:00 - 18:00Aula 12 - Edificio Storico Polo di Management ed Economia
Thursday11:15 - 13:15Aula 13 - Edificio Storico Polo di Management ed Economia
Thursday16:00 - 18:00Aula 10 - Edificio Storico Polo di Management ed Economia

Lessons: dal 27/09/2016 to 09/12/2016

Oggetto:

Note

This course will be delivered at the ESOMAS Department.

Oggetto:
Last update: 26/11/2016 16:07
Non cliccare qui!