First cycle
degree courses
Second cycle
degree courses
Single cycle
degree courses
School of Engineering
ICT FOR INTERNET AND MULTIMEDIA
Course unit
PHYSICS DATA ANALYSIS
INP9086804, A.A. 2019/20

Information concerning the students who enrolled in A.Y. 2019/20

Information on the course unit
Degree course Second cycle degree in
ICT FOR INTERNET AND MULTIMEDIA (Ord. 2019)
IN2371, Degree course structure A.Y. 2019/20, A.Y. 2019/20
N0
bring this page
with you
Degree course track PHOTONICS [003PD]
Number of ECTS credits allocated 6.0
Type of assessment Mark
Course unit English denomination PHYSICS DATA ANALYSIS
Department of reference Department of Information Engineering
Mandatory attendance No
Language of instruction English
Branch PADOVA
Single Course unit The Course unit can be attended under the option Single Course unit attendance
Optional Course unit The Course unit can be chosen as Optional Course unit

Lecturers
Teacher in charge MARCO BAIESI FIS/02
Other lecturers GIOVANNI CARRARO FIS/05
MICHELA MAPELLI FIS/05
MARCO ZANETTI FIS/01

Mutuating
Course unit code Course unit name Teacher in charge Degree course code
SCP8082526 LABORATORY OF COMPUTATIONAL PHYSICS (MOD. B) MARCO BAIESI SC2443

ECTS: details
Type Scientific-Disciplinary Sector Credits allocated
Educational activities in elective or integrative disciplines FIS/01 Experimental Physics 6.0

Course unit organization
Period Second semester
Year 1st Year
Teaching method frontal

Type of hours Credits Teaching
hours
Hours of
Individual study
Shifts
Lecture 6.0 48 102.0 No turn

Calendar
Start of activities 02/03/2020
End of activities 12/06/2020
Show course schedule 2019/20 Reg.2019 course timetable

Examination board
Examination board not defined

Syllabus

Common characteristics of the Integrated Course unit

Prerequisites: Even though not strictly required, the development of the class assumes the attendance of at least two physics laboratory classes during the bachelor degree
Target skills and knowledge: The didactic objective of this class is to teach main data analysis techniques and their application to solve concreate physics problems.
The lectures will review the main methods to extract information from complex physics datasets. The students will be able to gather, summarise and visualise the statistically relevant features of a dataset; furthermore they will learn how to qualitatively and critically compare theoretical predictions with the experimental data.
That knowledge will have to be exercised on practical lab tests, devoted to the analysis of datasets relevant to various scientific areas, i.e. biophysics, astronomy, high energy physics, etc.
Examination methods: To verify the proficiency of the students in the subjects covered by this course, the written reports on the lab experiences will be evaluated; such evaluation will have to be confirmed by an oral exam, during which the students will also be interviewed about what is thought during the lectures.
The oral exam will be split into two parts, each relevant to one of the two modules the class consists of.
Assessment criteria: The written reports on the lab experiences will have to respect the standards of a scientific publication. The data analysis will have to be tailored to the actual scientific problem being tackled and will have to demonstrate originality and the mastering of the established methodology. During the oral exam, in addition to the critical review of the written reports, the comprehension of the fundamental concepts will be tested

Specific characteristics of the Module

Course unit contents: 1. Introduction. Bias-Variance decomposition
2. Gradient descent methods
3. Linear regression: Ridge and LASSO
4. Logistic regression
5. Combining models: bagging, boosting, and random forests
6. Feed-forward deep neural networks: basics
7. Deep neural networks: regularization
8. Deep neural networks: examples
9. Clustering
10. Energy-based models
11. Restricted Boltzmann machines
12. Concluding examples
Planned learning activities and teaching methods: The aim of this course is to expose the students to modern tools for classifying data and machine learning techniques, so that they can apply those methods in lab experiences with computers. The first half of the course is reserved for this purpose of learning general principles via applications, while the second half of the course allows the students, in small groups, to develop a deeper understanding of one specific subject by carrying out a small project.

Each lesson of the first half of the course will include first a theoretical explanation of a key procedure for data analysis or of a class of algorithms, and a second phase in which the students will apply the new ideas on computers. This learning by practical experience is expected to improve the understanding of the theoretical tools and of course it is in line with the classic methodology of lab teaching. The numerical analysis includes either adopting and modifying pre-built software, or sketching simple algorithms from scratch.

The text mainly followed in the course is an open access review on the arxiv:
“A high-bias, low-variance introduction to Machine Learning for physicists” by Pankaj Mehta et al, arXiv:1803.08823.
This review also furnishes useful python notebooks to analyze data and is connected to tools as the scikit-learn package.
Textbooks (and optional supplementary readings)
  • Pankaj Mehta, Marin Bukov, Ching-Hao Wang, Alexandre G.R. Day, Clint Richardson, Charles K. Fisher,, “A high-bias, low-variance introduction to Machine Learning for physicists”. --: --, --. review avaliable open access online: https://arxiv.org/abs/1803.08823

Innovative teaching methods: Teaching and learning strategies
  • Lecturing
  • Laboratory
  • Problem based learning
  • Working in group
  • Problem solving

Innovative teaching methods: Software or applications used
  • Moodle (files, quizzes, workshops, ...)
  • python