- The course will take place on October 11-15 2021, from 10-12 and 14-16
- Course schedule here
- This repository contains material (notebooks and slides) for the INFN CCR Course 'Big Data and ML'
- Basic knowledge of python
- The course will take place remotely due to COVID restrictions.
- We will use zoom
- Please try it before the course starts, so that you're setup.
- Every day we will have a mixture of theoretical and hands-on sessions.
- please read this first
- point your browser to the link we provided you. It looks something like https://jhub.some.ip.address.myip.cloud.infn.it/
- on the first day, open a terminal and clone this repository:
git clone https://github.com/leggerf/MLCourse-INFN-2021.git
- The Jupyter notebook documentation can be found here
- We will be using python, in particular I advise you to get familiar with Jupyter notebooks, numpy and pandas before the course starts. No expert knowledge is required, but doing a couple of tutorials on these topics (easily found on the web) is highly recommended.
- The hands-on will be done on the INFN cloud. Please follow the instructions below and let us know if you have issues.
- if you are already an INFN cloud user, let us know. We need to add you to the resource group for this course
- if you never used INFN cloud, you need to register:
- Please follow these steps.
- in the notes, specify that you're asking for access to participate in the Big Data and ML course
- You will receive by email a link to your Jhub. The url looks something like https://jhub.some.ip.address.myip.cloud.infn.it/
- Click on the link, and accept any security exceptions you might get
- you will need to authenticate with IAM
- you will get to a page that says “Authentication required for …”, click on “Authorize”
- When you get to the server creation page, chooose the option: “Large - 4CPU 8GB”
- It might take a few minutes for the server to start
- in case you get timeout errors, and server creation fails, simply reload the link and try againL
Lunedi 11 ottobre
10:00 - 11:00 Introduction to big data science (Legger)
11:00 - 12:00 Feature extraction (Monaco)
14:00 - 15:00 Smart workflows and distributed computing beyond the grid paradigm (Vallero)
15:00 - 16:00 Esercitazioni (Legger, Vallero, Antonacci, Valentini)
Martedi 12 ottobre
10:00 - 11:00 Introduction to Machine Learning: Supervised models (Legger)
11:00 - 12:00 Introduction to Machine Learning: Unsupervised models (Monaco)
14:00 - 15:00 Distributed data beyond the grid paradigm (Vallero)
15:00 - 16:00 Esercitazioni (Legger, Vallero, Antonacci, Valentini)
Mercoledi 13 ottobre
10:00 - 12:00 Introduction to Deep Learning and neural networks (Legger)
14:00 - 15:00 Distributed infrastructures beyond the grid paradigm (Vallero)
15:00 - 16:00 Esercitazioni (Legger, Vallero, Antonacci, Valentini)
Giovedi 12 ottobre
10:00 - 12:00 Deep Learning architectures: CNN, Autoencoders, RNN (Diacono)
14:00 - 16:00 Esercitazioni (Diacono, Legger, Vallero)
Venerdi 15 ottobre
10:00 - 11:00 Introduction to INFN Cloud (Donvito)
11:00 - 12:00 Introduction to INFN Cloud services (Antonacci)
14:00 - 16:00 Discussione degli esercizi e lightning talks (Legger, Vallero, Antonacci, Valentini)