Statistical Foundations of Learning (CIT4230004)

This course replaces the module IN2378, which will no longer be offered.

This course introduces the students to a statistical perspective of machine learning, and provides mathematical tools to analyze the performance of machine learning algorithms.

The first part of the course introduces concepts on statistical learning theory and other foundational results for machine learning. In particular, the following topics are covered:

Risk minimization, Bayes risk, Empirical risk
Statistical consistency and universal consistency
Vapnik-Chervonenkis (VC) theory of generalization
PAC learning and No free lunch theorem
Algorithmic stability
Universal approximation theorem
Boosting
Above techniques will be used to study generalization of Nearest Neighbor rule, Support Vector Machine and Neural Networks

The second part of the course introduces some advanced and recent topics on the theory of learning:

Approximation bounds for clustering
Theory of over-parametrized models
Training dynamics of neural networks, including neural tangent kernel

A concluding lecture will present some current challenges and recent works in the statistical foundations of learning.

Previous Knowledge Expected

Machine learning (IN2064 or equivalent); Discrete probability theory (IN0018); Analysis for informatics (MA0902)

Some background in statistics (MA2402 or equivalent) could be helpful.

Languages of Instruction

English

Informatik 7 - Theoretical Foundations of Artificial Intelligence

Prof. Debarghya Ghoshdastidar

Technische Universität München
Fakultät für Informatik - I7
Boltzmannstr. 3
85748 Garching bei München

Sekretariat:
Raum 03.11.052
Tel.: +49 89 289 17234
Fax: +49 89 289 17207