06: High-Performance Computing (HPC)

Although today’s handy laptops perform many advanced and computationally intensive tasks, projects involving Big Data require significantly more resources. That need is satisfied by the HPC infrastructure, built from a network of computing clusters combined with immense memory. Access to these resources is remote, so job submission and data preview occurs through an interface on any local computing machine from any (allowed) geolocation. The HPC infrastructure is a shared community space, so you might want to familiarize yourself with the usage policy to avoid disrupting peer work.

6. Introduction to GNU Parallel

7. Introduction to Containers

7.1 Apptainer
7.2 Docker

Homepage Prior Section Next Section

06: High-Performance Computing (HPC)

Aleksandra Badaczewska

Table of contents

1. Introduction to HPC infrastructure

2. Remote Access to HPC Resources

3. Setting up Your Home Directory for Data Analysis

4. Software Available on HPC

5. Introduction to Job Scheduling

6. Introduction to GNU Parallel

7. Introduction to Containers