This tutorial focuses on accessing and running R on a High Performance Computing (HPC) system. Since the motivation to switch to an HPC system can be manifold, e.g. due to large memory requirements, GPU usage or increase of computation speed, this training will introduce R users to working in the HPC environment. We also provide an overview of selected Machine Learning methods and show how to work interactively or submit batch jobs. In the end, the participants will have the opportunity to do it all themselves in the Hands-On Session.
Course Details
Title: R on HPC – Introduction
Speakers: Dr. Iryna Okhrin, Neringa Jurenaite
Next Session: 28.04.2023, 10 a.m. – 3 p.m.
Target Group: HPC Basics / HPC User
Language: English
Format: Online Tutorial. The room link will be announced after registration.
Registration: https://events.scads.ai/e/r_hpc
Participation is free of charge.
Add this event to your calendar (iCal).
Agenda
- Accessing R and RStudio on our HPC system
- Overview of some of the main Machine Learning models (e.g. Linear and Logistic regression, Random Forest, etc.)
- Introduction to model benchmarking in R
- Introduction to parallelization in R: data-based and model-based
- Hands-on Session: Exercises
Handouts
The course material (slides, sample application) will be available.
Prerequisites
Participants should have an understanding of Machine Learning methods and basic experience in using R. We recommend attending our Machine Learning on HPC – Introduction tutorial in advance or familiarize with Taurus and its compendium page.
Learning Outcomes
Participants will understand the application of main Machine Learning methods in R and be aware of corresponding issues. Further, they will know more about the implementation of parallelization and benchmarking of Machine Learning models in R on an HPC cluster using specific examples.
Do you have any questions about this tutorial? Don’t hesitate to contact our team!
Check out the other trainings by ScaDS.AI Dresden/Leipzig.