From Machine Learning to Deep Learning: A concise introduction

This HLRS course addresses students, data scientists, and researchers who would like to have an introduction to Machine and Deep Learning methods to solve challenging and future-oriented problems. Both Machine and Deep Learning methods and examples as well as a method for data compression will be presented. Different examples are shown via hands-on sessions on an HLRS cluster. However, please be aware that this course is not a sequence of beginners’-to-advanced lectures about theoretical aspects of AI.

The first part will be an introduction to basic methods in Machine Learning, including pre-processing and supervised learning using Apache Spark. The course will then move on to elements of supervised Deep Learning on real data to classify annotated images of waste in the wild. Given the deluge of information needed to power machine and deep learning methods, it is imperative to think about effective data processing strategies. Therefore, the course will conclude with an introduction to data compression using the BigWhoop library (developed within EXCELLERAT P2). As an efficient data reduction tool, BigWhoop can be applied to generic numerical datasets to minimize I/O bottlenecks and optimize data storage. The lectures are interleaved with many hands-on sessions using Jupyter Notebooks and scripts on HLRS systems.
In addition, a guest lecture from the IAG will show how Deep Learning can be applied to problems in computational fluid dynamics.

 

Veranstaltungsort

Online course
Organizer: HLRS, University of Stuttgart, Germany

Veranstaltungsbeginn

05. Jun 2023
08:45

Verstaltungsende

07. Jun 2023
15:00

Sprache

Englisch

Einstiegslevel

Basis

Themenbereiche

Daten in HPC / Deep Learning / Maschinelles Lernen

Themen

Künstliche Intelligenz

Big Data

Deep Learning

Maschinelles Lernen

Scientific Machine Learning

Zurück zur Liste

Prerequisites and content levels

Prerequisites
  • Familiarity with Linux operating systems, including Linux shell (some parts of the training will use a cluster).
  • Access to an SSH client for remote access for the interactive portions of the training.
  • Technical background and basic understanding of machine learning concepts will be helpful.
  • Preliminary experience with Python is required. Since Python is used, the following tutorial can be used to learn the syntax.
  • For the second day, familiarity with TensorFlow will be a plus as all hands-on sessions will be using TensorFlow. For those who like to use TensorFlow in advance TensorFlow tutorial will be a great help.
Content levels
  • Basic: 10:30 hours
  • Intermediate: 2:30 hours
  • Advanced: 1 hour
  • Community: 4 hours

Learn more about course curricula and content levels.

Learning outcomes

After this course, participants will

  • have a basic understanding of classical Machine Learning and Deep Learning (DL) concepts and methods,
  • have gained practical experience in applying these methods,
  • and will know how to use HLRS's systems for certain ML or DL tasks.

Instructors

Dr. Khatuna Kakhiani, Patrick Vogler and Dr.-Ing. Lorenzo Zanon (HLRS), and Anna Schwarz (IAG).

Agenda

(preliminary)

08:45 - 09:00  on every day: drop in to Zoom

Day 1: Focus on Pre-processing, Feature Engineering and Machine Learning (9:00 - 17:00, Dr.-Ing. Lorenzo Zanon)

The first day will be based on the “Stuttgart S-Bahn Example” (originally developed by Dennis Hoppe, HLRS) to provide an introduction to Machine Learning. The focus is on data preparation, classification and regression algorithms in supervised learning: Can these tools be helpful to improve the travel experience in the Stuttgart S-Bahn, which are their limits? Apache Spark will be employed for the hands-on sessions on Jupyter Notebooks as well as via interactive jobs on script. Finally, we will also touch upon the visualisation of results.

Day 2: Focus on data processing, Model of ANN and supervised Deep Learning to classify images of waste in the wild (9:00 - 17:30, Dr. Khatuna Kakhiani)

During this day, participants will explore how Deep Learning can be used to classification waste in wild. After brief introduction of Deep Learning, and basic concepts and Building blocks of Deep Neural Networks, participants will learn how to:

  • Implement common deep learning workflow for image classification
  • Process data, experiment with network structure and training parameters
  • Deploy neural network to classify images
  • Visualize results Upon completion, participant will be able to solve classification problems with CNN on other custom datasets. The hands-on training using Jupyter Notebooks, interactive jobs on script and Tensorflow.

Day 3:

  • Guest Lecture: Towards Data-Driven Computational Fluid Dynamics (9:00 - 11:30, Anna Schwarz, IAG)
  • Data Compression of numerical data sets with the BigWhoop library (12:30-15:00, Patrick Vogler, HLRS)

On the third day we start with the guest lecture "Towards Data-Driven Computational Fluid Dynamics". It will be given by Anna Schwarz, Institute of Aerodynamics and Gas Dynamics, University of Stuttgart.

We will conclude the half-day with an introduction to data compression, focusing on the various methods available to us for the efficient size reduction of our training data. Special attention will be paid to which approaches are best suited for different data types and what impact the different approaches and compression rates have on the quality of the datasets. The compression library BigWhoop and its accompanying command line tool will be made available for the hands-on sessions.

Lunch break will be from 13:00-14:00 on the first two days.

Exercises

The exercises will be carried out on HLRS's systems using Jupyter Notebooks.

Registration-information

Register via the button at the top of this page.
We encourage you to register to the waiting list if the course is full. Places might become available.

Registration closes on May 22, 2023.

Late registrations after that date might still be possible according to the course capacity.

Fees

Students without Diploma/Master: 30 EUR
PhD students or employees at a German university or public research institute: 60 EUR
PhD students or employees at a university or public research institute in an EU, EU-associated or PRACE country other than Germany: 120 EUR.
PhD students or employees at a university or public research institute outside of EU, EU-associated or PRACE countries: 240 EUR
Other participants, e.g., from industry, other public service providers, or government: 600 EUR

Our course fee includes coffee breaks (in classroom courses only).

For lists of EU and EU-associated coutries, and PRACE countries have a look at the Horizon Europe and PRACE website.

HLRS Training Collaborations in HPC

HLRS is part of the Gauss Centre for Supercomputing (GCS), together with JSC in Jülich and LRZ in Garching near Munich. EuroCC@GCS is the German National Competence Centre (NCC) for High-Performance Computing. HLRS is also a member of the Baden-Württemberg initiative bwHPC.

This course is provided within the framework of the bwHPC training program.

EXCELLERAT P2

This course is partly realised in cooperation with the Centre of Excellence EXCELLERAT P2 (funded by the European Union, grant agreement No 101092621). See also the EXCELLERAT Service Portal for more information.

Contact

Tobias Haas, phone 0711 685 87223, tobias.haas(at)hlrs.de

Further courses

See the training overview and the Supercomputing Academy pages.