In this tutorial, we will introduce R users to the advantages of working on R on a High Performance Computing cluster. We will provide an overview of the most common Machine Learning methods and then look into how exactly we can explore their parallelization for the purposes of speeding up the run time. We will also show how some of the benchmarking packages in R work. In the end, the participants will have the opportunity to do it all themselves in the Hands-on Session.
Title: R on HPC – Introduction
Last Session: New dates will be announced soon.
Target Group: HPC Basics / HPC User
Format: Tutorial, online
- Accessing R and RStudio on our HPC system
- Overview of some of the main Machine Learning models (e.g. Linear and Logistic regression, Random Forest, etc.)
- Introduction to model benchmarking in R
- Introduction to parallelization in R: data-based and model-based
- Hands-on Session: Exercises
The course material (slides, sample application) will be available.
Participants should have an understanding of Machine Learning methods and basic experience in using R. We recommend attending ML-HPC-B NHR Tutorial in advance or familiarize with Taurus and its compendium page.
Participants will understand the application of main Machine Learning methods in R and be aware of corresponding issues. Further, they will know more about the implementation of parallelization and benchmarking of Machine Learning models in R on an HPC cluster using specific examples.
Check out the other trainings by ScaDS.AI Dresden/Leipzig.