Browse the workbook contents

Collection size: 202

Structured List

article

Data Science Principles

article

Applications of Computational Science

intro

Computational Biology

intro

Geospatial Analytics

intro

Machine Learning

article

Operating System (OS) installation

tutorial

Unix command line tools on macOS

tutorial

Unix command line tools on Windows

tutorial

Configuring your Linux system

intro

Must-have software

article

Basic office software

article

Basic developer tools

article

Basic developer libraries

intro

Various methods of software installation

tutorial

Installations on MacBook Pro

tutorial

Installations on Windows

tutorial

Installations on Linux

tutorial

Terminal - an interface for command-line operations

tutorial

Introduction to UNIX Shell: configuration, variables, home dir

tutorial

Basic Commands: navigation, file creation & preview

tutorial

Command Line text files editors: nano, vim

tutorial

System info and access permissions

tutorial

Superuser command: sudo

tutorial

Getting started with UNIX + video + exercises

intro

Useful text manipulation programs

tutorial

GREP – simple search for regular expressions

tutorial

SED – replace pattern in stream text

tutorial

AWK – advanced text processing

tutorial

BIOAWK – biological data manipulation

tutorial

DIFF, COMM, CMP – comparing and collating two files with Unix

cheatsheet

UNIX Commands

intro

Integrated & Interactive Development Environment (IDE)

tutorial

Visual Studio Code: multi-language integrated DE (VSC)

tutorial

Jupyter: interactive web-based multi-kernel DE

tutorial

Getting started with JupyterLab on a local machine

tutorial

Getting started with Jupyter Notebook on HPC systems

tutorial

Sharing Jupyter-based reproducible pipline via MyBinder

intro

Python programming environment(s)

tutorial

Online Console: simple Python code in a browser (without installation)

tutorial

Online Notebook: interactive Python on Try-jupyter and Colab (no installation)

tutorial

Local Python setup on your computing machine (installation needed)

tutorial

Shell & IDLE: Python code in a terminal or simple IDE (beginner)

tutorial

Text editors: create Python code in terminal text files (intermediate)

tutorial

Jupyter Lab: create an interactive Python notebook (advanced)

tutorial

PyCharm: IDE for professional Python developers (professional)

intro

R programming environment(s)

tutorial

RStudio: integrated environment for R programming

tutorial

Setting up RStudio

intro

Basics of Algorithm Structure

tutorial

Introduction to Bash scripting

tutorial

Process multiple files

tutorial

Introduction to regular expression

tutorial

Introduction to Python programming

tutorial

Open-Close & Read-Write of data files

tutorial

JSON module - encoding & decoding JSON data

tutorial

Math module - various mathematical functions

tutorial

Pandas library - data structure manipulation tool

tutorial

NumPy library - multi-dimensional arrays parser

tutorial

SciPy library - algorithms for scientific computing

tutorial

Introduction to R programming

tutorial

Base R - core functionality for statistical computing

tutorial

Base R Graphics - traditional statistical graphing

tutorial

ggplot2 - customizable graphs and charts

tutorial

dplyr - data manipulation and transformation

tutorial

tidyverse - advanced data manipulation, exploration and visualization

tutorial

data.table - aggregation and manipulation of large data sets

tutorial

Introduction to Julia programming

tutorial

Julia setup: installation, environments and Jupyter integration

intro

Introduction to HPC infrastructure

intro

ACCESS - an advanced computing and data resource program

tutorial

XSEDE supercell storage

intro

SCINet Scientific Computing: HPC, high-speed networking and training

tutorial

SCINet: Atlas computing cluster

tutorial

SCINet: Ceres computing cluster

tutorial

SCINet: Juno storage

intro

ISU HPC Network

tutorial

ISU HPC: Condo computing cluster

tutorial

ISU HPC: Nova computing cluster

tutorial

ISU HPC: LSS system

intro

Remote Access to HPC resources

tutorial

Virtual Private Network (VPN) Connection (command line)

tutorial

Secure Shell Connection (SSH) (command line)

tutorial

SSH shortcuts and password-less login

tutorial

Open On Demand (OOD) Connection (web-based GUI)

tutorial

Setting up your home directory for data analysis

tutorial

Example .bashrc file configuration

tutorial

Software Available on HPC

tutorial

Accessing pre-installed modules

tutorial

Accessing software via package manager

tutorial

Installing custom programs in user space

tutorial

Introduction to job scheduling

tutorial

SLURM: basics of workload manager

tutorial

Introduction to SLURM

cheatsheet

SLURM commands

tutorial

Creating SLURM job submission scripts

tutorial

Submitting dependency jobs using SLURM

tutorial

PBS: Portable Batch System

cheatsheet

PBS commands

tutorial

Creating PBS job submission scripts

tutorial

Submitting dependency jobs using PBS

tutorial

Introduction to GNU parallel

intro

Introduction to containers

tutorial

Apptainer: the container system for secure HPC (singularity rebranding)

tutorial

Singularity (archival)

tutorial

Creating containers using Singularity

tutorial

Modifying existing containers

tutorial

Singularity on your Mac via Vagrant

tutorial

Docker - accelerated container application development

tutorial

Remote data access

intro

Remote data transfer

tutorial

Copying Data using Graphical Interface: Globus

tutorial

Copying Data via SSH using Command Line: scp, rsync

tutorial

File transfer using irods

intro

Remote data download

tutorial

Downloading Online Data using WGET

tutorial

Downloading online data using API

tutorial

Downloading online data using Python-based web scraping

tutorial

Downloading online repos using GIT: [GitHub, Bitbucket, SourceForge]

tutorial

Downloading a single folder or file from GitHub

tutorial

Remote data preview (without downloading)

tutorial

Viewing text files using UNIX commands

tutorial

Viewing PDF and PNG files using X11 SSH connection

tutorial

Viewing graphics in a terminal as the text-based ASCII art

tutorial

Mounting remote folder on a local machine

intro

Data manipulation

intro

Manipulating Excel data sheets

tutorial

Create worksheet from multiple text files

tutorial

Export multiple worksheets as separate text files

tutorial

Create index for all worksheets

tutorial

Merge two spreadsheets using a common column

intro

Manipulating text files with Python

tutorial

Read, write, split, select data

tutorial

Data wrangling: use ready-made apps

tutorial

Merge files by common column (python)

tutorial

Aggregate data over slicing variations (python)

tutorial

Split data or create data chunks (python)

intro

Introduction to scientific graphic design

tutorial

Raster graphics tools

tutorial

Vector graphics tools

tutorial

Adobe creative cloud

tutorial

Template-based web tools

intro

Introduction to scientific graphing

tutorial

Gnuplot: creating plots in the UNIX Shell

tutorial

Gnuplot: variables, loops, conditionals

tutorial

Gnuplot: filled curves

tutorial

Plotly-Dash: interactive plotting with Python

tutorial

Introduction to Plotly (python)

tutorial

Introduction to Dash (python)

tutorial

Plotly graphing - interactive examples in the JupyterLab

tutorial

Creating XY scatter plot

tutorial

Creating 1D volcano plot

tutorial

Creating heatmap

tutorial

Creating dendrogram

tutorial

Creating clustergram

tutorial

RStudio – data processing & plotting with R

tutorial

Creating boxplots in R

tutorial

Creating heatmaps in R

tutorial

Creating heatmaps in R using ComplexHeatmap

article

Introduction to Project Management

intro

Overview of essential tools for modern research

article

Project strategy and risk management

tutorial

Project Design vs Project Framework

article

Collaboration & Communication

intro

Team communication tools

tutorial

Introduction to Slack

intro

Resource Management

intro

Data Management

tutorial

Open Science policy

tutorial

FAIR principles

tutorial

CARE principles

tutorial

TRUST principles

tutorial

Data Management Plan (hands-on)

intro

Code Development & Management

intro

Storage & Version Control

tutorial

GIT - a distributed version control system

cheatsheet

GIT Commands

intro

Online hosting platforms for GIT repositories

tutorial

Introduction to GitHub

tutorial

GitHub - getting started

tutorial

GitHub - getting started (class exercise)

tutorial

Introduction to BitBucket

intro

Documentation improvement tools

tutorial

A Practical guide to managing research documentation

tutorial

Tools for research documentation

tutorial

Tools for code documentation

tutorial

Introduction to Markdown

article

Quality Assurance

tutorial

Monitoring & Evaluation

tutorial

Enhancing productivity in research

tutorial

Reproducibility in research

tutorial

Research reproducibility guideline

intro

Project Closing

tutorial

Research Publication

Alphabetical List

01: Introduction to Data Science
02: Setting Up Computing Machine
03: Introduction to Command Line
04: Development Environment
05: Introduction to Programming
06: High-Performance Computing (HPC)
07: Data Acquisition and Wrangling
08: Data Visualization
09: Project Management
Code Development & Management
Data Management
Documentation improvement tools
Storage & Version Control
A Practical guide to managing research documentation
ACCESS - an advanced computing and data resource program
AWK – advanced text processing
Accessing pre-installed modules
Accessing software via package manager
Adobe creative cloud
Aggregate data over slicing variations
Applications of Computational Science
Apptainer: the container system for secure HPC
BIOAWK – biological data manipulation
Base R - core functionality for statistical computing
Base R Graphics - traditional statistical graphing
Basic Commands: navigation, file creation & preview
Basic developer libraries
Basic developer tools
Basic office software
Basics of Algorithm Structure
CARE principles
Collaboration & Communication
Command Line text files editors: nano, vim
Computational Biology
Configuring your Linux system
Copying Data using Graphical Interface: Globus
Copying Data via SSH using Command Line: scp, rsync
Create index for all worksheets
Create worksheet from multiple text files
Creating 1D volcano plot
Creating PBS job submission scripts
Creating SLURM job submission scripts
Creating XY scatter plot
Creating boxplots in R
Creating clustergram
Creating containers using Singularity
Creating dendrogram
Creating heatmap
Creating heatmaps in R
Creating heatmaps in R using ComplexHeatmap
DIFF, COMM, CMP – comparing and collating two files with Unix
Data Management Plan
Data Science Principles
Data manipulation
Data wrangling: use ready-made apps
Docker - accelerated container application development
Downloading Online Data using WGET
Downloading a single folder or file from GitHub
Downloading online data using API
Downloading online data using Python-based web scraping
Downloading online repos using GIT: [GitHub, Bitbucket, SourceForge]
Enhancing productivity in research
Example .bashrc file configuration
Export multiple worksheets as separate text files
FAIR principles
File transfer using irods
GIT - a distributed version control system
GIT Commands
GREP – simple search for regular expressions
Geospatial Analytics
Getting started with Jupyter Notebook on HPC systems
Getting started with JupyterLab on a local machine
Getting started with UNIX + video + exercises
GitHub - getting started
GitHub - getting started (class exercise)
Gnuplot: creating plots in the UNIX Shell
Gnuplot: filled curves
Gnuplot: variables, loops, conditionals
ISU HPC Network
ISU HPC: Condo computing cluster
ISU HPC: LSS system
ISU HPC: Nova computing cluster
Installations on Linux
Installations on MacBook Pro
Installations on Windows
Installing custom programs in user space
Integrated & Interactive Development Environment (IDE)
Introduction to Bash scripting
Introduction to BitBucket
Introduction to Dash
Introduction to GNU parallel
Introduction to GitHub
Introduction to HPC infrastructure
Introduction to Julia programming
Introduction to Markdown
Introduction to Plotly
Introduction to Project Management
Introduction to Python programming
Introduction to R programming
Introduction to SLURM
Introduction to Slack
Introduction to UNIX Shell: configuration, variables, home dir
Introduction to containers
Introduction to job scheduling
Introduction to regular expression
Introduction to scientific graphic design
Introduction to scientific graphing
JSON module - encoding & decoding JSON data
Julia setup: installation, environments and Jupyter integration
Jupyter Lab: create an interactive Python notebook
Jupyter: interactive web-based multi-kernel DE
Local Python setup on your computing machine
Machine Learning
Manipulating Excel data sheets
Manipulating text files with Python
Math module - various mathematical functions
Merge files by common column
Merge two spreadsheets using a common column
Modifying existing containers
Monitoring & Evaluation
Mounting remote folder on a local machine
Must-have software
NumPy library - multi-dimensional arrays parser
Online Console: simple Python code in a browser
Online Notebook: interactive Python on Try-jupyter and Colab
Online hosting platforms for GIT repositories
Open On Demand (OOD) Connection
Open Science policy
Open-Close & Read-Write of data files
Operating System (OS) installation
Overview of essential tools for modern research
PBS commands
PBS: Portable Batch System
Pandas library - data structure manipulation tool
Plotly graphing - interactive examples in the JupyterLab
Plotly-Dash: interactive plotting with Python
Process multiple files
Project Closing
Project Design vs Project Framework
Project strategy and risk management
PyCharm: IDE for professional Python developers
Python programming environment(s)
Quality Assurance
R programming environment(s)
RStudio – data processing & plotting with R
RStudio: integrated environment for R programming
Raster graphics tools
Read, write, split, select data
Remote Access to HPC resources
Remote data access
Remote data download
Remote data preview
Remote data transfer
Reproducibility in research
Research Publication
Research reproducibility guideline
Resource Management
SCINet Scientific Computing: HPC, high-speed networking and training
SCINet: Atlas computing cluster
SCINet: Ceres computing cluster
SCINet: Juno storage
SED – replace pattern in stream text
SLURM commands
SLURM: basics of workload manager
SSH shortcuts and password-less login
SciPy library - algorithms for scientific computing
Secure Shell Connection (SSH)
Setting up RStudio
Setting up your home directory for data analysis
Sharing Jupyter-based reproducible pipline via MyBinder
Shell & IDLE: Python code in a terminal or simple IDE
Singularity
Singularity on your Mac via Vagrant
Software Available on HPC
Split data or create data chunks
Submitting dependency jobs using PBS
Submitting dependency jobs using SLURM
Superuser command: sudo
System info and access permissions
TRUST principles
Team communication tools
Template-based web tools
Terminal - an interface for command-line operations
Text editors: create Python code in terminal text files
Tools for code documentation
Tools for research documentation
UNIX Commands
Unix command line tools on Windows
Unix command line tools on macOS
Useful text manipulation programs
Various methods of software installation
Vector graphics tools
Viewing PDF and PNG files using X11 SSH connection
Viewing graphics in a terminal as the text-based ASCII art
Viewing text files using UNIX commands
Virtual Private Network (VPN) Connection
Visual Studio Code: multi-language integrated DE (VSC)
XSEDE supercell storage
data.table - aggregation and manipulation of large data sets
dplyr - data manipulation and transformation
ggplot2 - customizable graphs and charts
tidyverse - advanced data manipulation, exploration and visualization

Code Development & Management
Storage & Version Control
GIT - a distributed version control system
GIT Commands
Online hosting platforms for GIT repositories
Introduction to GitHub
GitHub - getting started
GitHub - getting started (class exercise)
Introduction to BitBucket
Documentation improvement tools
A Practical guide to managing research documentation
Tools for research documentation
Tools for code documentation
Quality Assurance
Monitoring & Evaluation
Enhancing productivity in research
Reproducibility in research
Research reproducibility guideline
Project Closing
Research Publication

Must-have software
Basic office software
Various methods of software installation
Collaboration & Communication
Team communication tools
Introduction to Slack

Unix command line tools on macOS
Unix command line tools on Windows
Configuring your Linux system
03: Introduction to Command Line
Terminal - an interface for command-line operations
Introduction to UNIX Shell: configuration, variables, home dir
Basic Commands: navigation, file creation & preview
Command Line text files editors: nano, vim
System info and access permissions
Superuser command: sudo
Getting started with UNIX + video + exercises
Useful text manipulation programs
GREP – simple search for regular expressions
SED – replace pattern in stream text
AWK – advanced text processing
BIOAWK – biological data manipulation
DIFF, COMM, CMP – comparing and collating two files with Unix
UNIX Commands
Shell & IDLE: Python code in a terminal or simple IDE
Text editors: create Python code in terminal text files
Basics of Algorithm Structure
Introduction to Bash scripting
Process multiple files
Introduction to regular expression
Julia setup: installation, environments and Jupyter integration
Introduction to HPC infrastructure
Secure Shell Connection (SSH)
SSH shortcuts and password-less login
Open On Demand (OOD) Connection
Setting up your home directory for data analysis
Example .bashrc file configuration
Software Available on HPC
Accessing pre-installed modules
Accessing software via package manager
Installing custom programs in user space
Introduction to job scheduling
SLURM: basics of workload manager
Introduction to SLURM
SLURM commands
Creating SLURM job submission scripts
Submitting dependency jobs using SLURM
PBS: Portable Batch System
PBS commands
Creating PBS job submission scripts
Submitting dependency jobs using PBS
Introduction to GNU parallel
Apptainer: the container system for secure HPC
Singularity
Creating containers using Singularity
Modifying existing containers
Singularity on your Mac via Vagrant
07: Data Acquisition and Wrangling
Remote data access
Remote data transfer
Copying Data using Graphical Interface: Globus
Copying Data via SSH using Command Line: scp, rsync
File transfer using irods
Remote data download
Downloading Online Data using WGET
Downloading online data using API
Downloading online data using Python-based web scraping
Remote data preview
Viewing text files using UNIX commands
Viewing PDF and PNG files using X11 SSH connection
Viewing graphics in a terminal as the text-based ASCII art
Mounting remote folder on a local machine
Data manipulation
Manipulating text files with Python
Read, write, split, select data
Data wrangling: use ready-made apps
Merge files by common column
Aggregate data over slicing variations
Split data or create data chunks

Must-have software
Various methods of software installation
Introduction to HPC infrastructure
ACCESS - an advanced computing and data resource program
XSEDE supercell storage
SCINet Scientific Computing: HPC, high-speed networking and training
SCINet: Atlas computing cluster
SCINet: Ceres computing cluster
SCINet: Juno storage
ISU HPC Network
ISU HPC: Condo computing cluster
ISU HPC: Nova computing cluster
ISU HPC: LSS system
Remote Access to HPC resources
Virtual Private Network (VPN) Connection
Secure Shell Connection (SSH)
SSH shortcuts and password-less login
Open On Demand (OOD) Connection
Software Available on HPC
Accessing pre-installed modules
Accessing software via package manager
Installing custom programs in user space
Introduction to job scheduling
SLURM: basics of workload manager
Introduction to SLURM
SLURM commands
Creating SLURM job submission scripts
Submitting dependency jobs using SLURM
PBS: Portable Batch System
PBS commands
Creating PBS job submission scripts
Submitting dependency jobs using PBS
Introduction to GNU parallel
Introduction to containers
Apptainer: the container system for secure HPC
Singularity
Creating containers using Singularity
Modifying existing containers
Singularity on your Mac via Vagrant
Docker - accelerated container application development
Remote data access
Remote data transfer
Copying Data using Graphical Interface: Globus
Copying Data via SSH using Command Line: scp, rsync
File transfer using irods
Remote data download
Downloading Online Data using WGET
Downloading online data using API
Downloading online data using Python-based web scraping
Downloading online repos using GIT: [GitHub, Bitbucket, SourceForge]
Downloading a single folder or file from GitHub
Remote data preview
Viewing text files using UNIX commands
Viewing PDF and PNG files using X11 SSH connection
Viewing graphics in a terminal as the text-based ASCII art
Mounting remote folder on a local machine
Enhancing productivity in research
Reproducibility in research
Research reproducibility guideline

Introduction to containers
Apptainer: the container system for secure HPC
Singularity
Creating containers using Singularity
Modifying existing containers
Singularity on your Mac via Vagrant
Docker - accelerated container application development

07: Data Acquisition and Wrangling
Remote data access
Remote data download
Downloading Online Data using WGET
Downloading online data using API
Downloading online data using Python-based web scraping
Downloading online repos using GIT: [GitHub, Bitbucket, SourceForge]
Downloading a single folder or file from GitHub

Data Management
Open Science policy
FAIR principles
CARE principles
TRUST principles
Data Management Plan
Documentation improvement tools
A Practical guide to managing research documentation
Tools for research documentation
Quality Assurance
Monitoring & Evaluation
Enhancing productivity in research
Reproducibility in research
Research reproducibility guideline
Project Closing
Research Publication

Jupyter Lab: create an interactive Python notebook
Open-Close & Read-Write of data files
JSON module - encoding & decoding JSON data
Math module - various mathematical functions
Pandas library - data structure manipulation tool
NumPy library - multi-dimensional arrays parser
SciPy library - algorithms for scientific computing
Introduction to R programming
Base R - core functionality for statistical computing
Base R Graphics - traditional statistical graphing
ggplot2 - customizable graphs and charts
dplyr - data manipulation and transformation
tidyverse - advanced data manipulation, exploration and visualization
data.table - aggregation and manipulation of large data sets
07: Data Acquisition and Wrangling
Data manipulation
Manipulating Excel data sheets
Create worksheet from multiple text files
Export multiple worksheets as separate text files
Create index for all worksheets
Merge two spreadsheets using a common column
Manipulating text files with Python
Read, write, split, select data
Data wrangling: use ready-made apps
Merge files by common column
Aggregate data over slicing variations
Split data or create data chunks

Remote data access
Remote data preview
Viewing text files using UNIX commands
Viewing PDF and PNG files using X11 SSH connection
Viewing graphics in a terminal as the text-based ASCII art
Mounting remote folder on a local machine
Manipulating Excel data sheets

01: Introduction to Data Science
Data Science Principles
Applications of Computational Science
Computational Biology
Geospatial Analytics
Machine Learning
07: Data Acquisition and Wrangling

SCINet: Juno storage
ISU HPC: LSS system
Data Management Plan

XSEDE supercell storage
SCINet: Atlas computing cluster
SCINet: Ceres computing cluster
SCINet: Juno storage
ISU HPC: Condo computing cluster
ISU HPC: Nova computing cluster
ISU HPC: LSS system
Software Available on HPC
Remote data access
Remote data transfer
Copying Data using Graphical Interface: Globus
Copying Data via SSH using Command Line: scp, rsync
File transfer using irods

Data Science Principles
BIOAWK – biological data manipulation
Introduction to Python programming
Open-Close & Read-Write of data files
JSON module - encoding & decoding JSON data
Math module - various mathematical functions
Pandas library - data structure manipulation tool
NumPy library - multi-dimensional arrays parser
SciPy library - algorithms for scientific computing
Introduction to R programming
Introduction to Julia programming
Julia setup: installation, environments and Jupyter integration
Downloading online data using API
Remote data preview
Viewing text files using UNIX commands
Viewing PDF and PNG files using X11 SSH connection
Viewing graphics in a terminal as the text-based ASCII art
Raster graphics tools
Vector graphics tools
Adobe creative cloud
Template-based web tools

Must-have software
Basic developer tools
Basic developer libraries
Various methods of software installation
Installations on MacBook Pro
Installations on Windows
Installations on Linux
04: Development Environment
Integrated & Interactive Development Environment (IDE)
Visual Studio Code: multi-language integrated DE (VSC)
Jupyter: interactive web-based multi-kernel DE
Getting started with JupyterLab on a local machine
Getting started with Jupyter Notebook on HPC systems
Sharing Jupyter-based reproducible pipline via MyBinder
Python programming environment(s)
Online Console: simple Python code in a browser
Online Notebook: interactive Python on Try-jupyter and Colab
Local Python setup on your computing machine
Shell & IDLE: Python code in a terminal or simple IDE
Text editors: create Python code in terminal text files
Jupyter Lab: create an interactive Python notebook
PyCharm: IDE for professional Python developers
R programming environment(s)
RStudio: integrated environment for R programming
Setting up RStudio
Julia setup: installation, environments and Jupyter integration
Storage & Version Control
GIT - a distributed version control system
GIT Commands
Online hosting platforms for GIT repositories
Introduction to GitHub
GitHub - getting started
GitHub - getting started (class exercise)
Introduction to BitBucket

TRUST principles

Must-have software
Getting started with JupyterLab on a local machine
Getting started with Jupyter Notebook on HPC systems
Sharing Jupyter-based reproducible pipline via MyBinder
SCINet Scientific Computing: HPC, high-speed networking and training
SCINet: Atlas computing cluster
SCINet: Ceres computing cluster
SCINet: Juno storage
ISU HPC Network
ISU HPC: Condo computing cluster
ISU HPC: Nova computing cluster
ISU HPC: LSS system
Overview of essential tools for modern research
Introduction to Slack
Data Management
Data Management Plan
Code Development & Management
Storage & Version Control
GIT - a distributed version control system
GIT Commands
Online hosting platforms for GIT repositories
Introduction to GitHub
GitHub - getting started
GitHub - getting started (class exercise)
Introduction to BitBucket
Documentation improvement tools
A Practical guide to managing research documentation
Tools for research documentation
Tools for code documentation
Introduction to Markdown
Quality Assurance
Monitoring & Evaluation
Enhancing productivity in research
Reproducibility in research
Research reproducibility guideline
Project Closing
Research Publication

Must-have software
Basic office software
Various methods of software installation
08: Data Visualization
Introduction to scientific graphic design
Raster graphics tools
Vector graphics tools
Adobe creative cloud
Template-based web tools
Introduction to scientific graphing
Gnuplot: creating plots in the UNIX Shell
Gnuplot: variables, loops, conditionals
Gnuplot: filled curves
Plotly-Dash: interactive plotting with Python
Introduction to Plotly
Introduction to Dash
Plotly graphing - interactive examples in the JupyterLab
Creating XY scatter plot
Creating 1D volcano plot
Creating heatmap
Creating dendrogram
Creating clustergram
RStudio – data processing & plotting with R
Creating boxplots in R
Creating heatmaps in R
Creating heatmaps in R using ComplexHeatmap

Jupyter: interactive web-based multi-kernel DE
Getting started with Jupyter Notebook on HPC systems
RStudio: integrated environment for R programming
Setting up RStudio
Julia setup: installation, environments and Jupyter integration
06: High-Performance Computing (HPC)
Introduction to HPC infrastructure
ACCESS - an advanced computing and data resource program
XSEDE supercell storage
SCINet Scientific Computing: HPC, high-speed networking and training
SCINet: Atlas computing cluster
SCINet: Ceres computing cluster
SCINet: Juno storage
ISU HPC Network
ISU HPC: Condo computing cluster
ISU HPC: Nova computing cluster
ISU HPC: LSS system
Remote Access to HPC resources
Virtual Private Network (VPN) Connection
Secure Shell Connection (SSH)
SSH shortcuts and password-less login
Open On Demand (OOD) Connection
Setting up your home directory for data analysis
Example .bashrc file configuration
Software Available on HPC
Accessing pre-installed modules
Accessing software via package manager
Installing custom programs in user space
Introduction to job scheduling
SLURM: basics of workload manager
Introduction to SLURM
SLURM commands
Creating SLURM job submission scripts
Submitting dependency jobs using SLURM
PBS: Portable Batch System
PBS commands
Creating PBS job submission scripts
Submitting dependency jobs using PBS
Introduction to GNU parallel
Introduction to containers
Apptainer: the container system for secure HPC
Singularity
Creating containers using Singularity
Modifying existing containers
Singularity on your Mac via Vagrant
07: Data Acquisition and Wrangling
Remote data access
Remote data transfer
Copying Data using Graphical Interface: Globus
Copying Data via SSH using Command Line: scp, rsync
Remote data preview
Viewing text files using UNIX commands
Viewing PDF and PNG files using X11 SSH connection
Viewing graphics in a terminal as the text-based ASCII art
Mounting remote folder on a local machine

ISU HPC: Condo computing cluster
ISU HPC: Nova computing cluster
Software Available on HPC
Accessing pre-installed modules
Introduction to job scheduling
SLURM: basics of workload manager
Introduction to SLURM
SLURM commands
Creating SLURM job submission scripts
Submitting dependency jobs using SLURM
PBS: Portable Batch System
PBS commands
Creating PBS job submission scripts
Submitting dependency jobs using PBS

Basic developer libraries
Online Notebook: interactive Python on Try-jupyter and Colab
Local Python setup on your computing machine
Jupyter Lab: create an interactive Python notebook
Setting up RStudio
Introduction to Python programming
JSON module - encoding & decoding JSON data
Math module - various mathematical functions
Pandas library - data structure manipulation tool
NumPy library - multi-dimensional arrays parser
SciPy library - algorithms for scientific computing
Introduction to R programming
Base R - core functionality for statistical computing
Base R Graphics - traditional statistical graphing
ggplot2 - customizable graphs and charts
dplyr - data manipulation and transformation
tidyverse - advanced data manipulation, exploration and visualization
data.table - aggregation and manipulation of large data sets
Software Available on HPC
Accessing pre-installed modules
Accessing software via package manager
Installing custom programs in user space
Introduction to GNU parallel
Apptainer: the container system for secure HPC
Singularity
Creating containers using Singularity
Modifying existing containers
Singularity on your Mac via Vagrant
Downloading online data using Python-based web scraping
Manipulating text files with Python
Read, write, split, select data
Data wrangling: use ready-made apps
Merge files by common column
Aggregate data over slicing variations
Split data or create data chunks
Plotly-Dash: interactive plotting with Python
Introduction to Plotly
Introduction to Dash
Plotly graphing - interactive examples in the JupyterLab
Creating XY scatter plot
Creating 1D volcano plot
Creating heatmap
Creating dendrogram
Creating clustergram
RStudio – data processing & plotting with R
Creating boxplots in R
Creating heatmaps in R
Creating heatmaps in R using ComplexHeatmap

Must-have software
Basic office software
Various methods of software installation

Introduction to HPC infrastructure
Introduction to GNU parallel

Jupyter: interactive web-based multi-kernel DE
Online Notebook: interactive Python on Try-jupyter and Colab
Jupyter Lab: create an interactive Python notebook
RStudio: integrated environment for R programming
Setting up RStudio
Introduction to R programming
Base R Graphics - traditional statistical graphing
ggplot2 - customizable graphs and charts
Accessing pre-installed modules
Manipulating Excel data sheets
08: Data Visualization
Introduction to scientific graphing
Gnuplot: creating plots in the UNIX Shell
Gnuplot: variables, loops, conditionals
Gnuplot: filled curves
Plotly-Dash: interactive plotting with Python
Introduction to Plotly
Introduction to Dash
Plotly graphing - interactive examples in the JupyterLab
Creating XY scatter plot
Creating 1D volcano plot
Creating heatmap
Creating dendrogram
Creating clustergram
RStudio – data processing & plotting with R
Creating boxplots in R
Creating heatmaps in R
Creating heatmaps in R using ComplexHeatmap

Must-have software
Basic developer libraries
04: Development Environment
Visual Studio Code: multi-language integrated DE (VSC)
Jupyter: interactive web-based multi-kernel DE
Getting started with JupyterLab on a local machine
Getting started with Jupyter Notebook on HPC systems
Sharing Jupyter-based reproducible pipline via MyBinder
Python programming environment(s)
Online Console: simple Python code in a browser
Online Notebook: interactive Python on Try-jupyter and Colab
Local Python setup on your computing machine
Shell & IDLE: Python code in a terminal or simple IDE
Text editors: create Python code in terminal text files
Jupyter Lab: create an interactive Python notebook
PyCharm: IDE for professional Python developers
R programming environment(s)
RStudio: integrated environment for R programming
Setting up RStudio
05: Introduction to Programming
Basics of Algorithm Structure
Introduction to Python programming
Open-Close & Read-Write of data files
JSON module - encoding & decoding JSON data
Math module - various mathematical functions
Pandas library - data structure manipulation tool
NumPy library - multi-dimensional arrays parser
SciPy library - algorithms for scientific computing
Introduction to R programming
Base R - core functionality for statistical computing
Base R Graphics - traditional statistical graphing
ggplot2 - customizable graphs and charts
dplyr - data manipulation and transformation
tidyverse - advanced data manipulation, exploration and visualization
data.table - aggregation and manipulation of large data sets
Introduction to Julia programming
Julia setup: installation, environments and Jupyter integration
Software Available on HPC
Accessing pre-installed modules
Accessing software via package manager
Installing custom programs in user space
Downloading online data using Python-based web scraping
Data manipulation
Manipulating text files with Python
Read, write, split, select data
Data wrangling: use ready-made apps
Merge files by common column
Aggregate data over slicing variations
Split data or create data chunks
Introduction to scientific graphing
Plotly-Dash: interactive plotting with Python
Introduction to Plotly
Introduction to Dash
Plotly graphing - interactive examples in the JupyterLab
Creating XY scatter plot
Creating 1D volcano plot
Creating heatmap
Creating dendrogram
Creating clustergram
RStudio – data processing & plotting with R
Creating boxplots in R
Creating heatmaps in R
Creating heatmaps in R using ComplexHeatmap

Must-have software
09: Project Management
Introduction to Project Management
Overview of essential tools for modern research
Project strategy and risk management
Project Design vs Project Framework
Collaboration & Communication
Team communication tools
Introduction to Slack
Resource Management
Data Management
Open Science policy
FAIR principles
CARE principles
TRUST principles
Data Management Plan
Code Development & Management
Storage & Version Control
GIT - a distributed version control system
GIT Commands
Online hosting platforms for GIT repositories
Introduction to GitHub
GitHub - getting started
GitHub - getting started (class exercise)
Introduction to BitBucket
Documentation improvement tools
A Practical guide to managing research documentation
Tools for research documentation
Tools for code documentation
Introduction to Markdown
Quality Assurance
Monitoring & Evaluation
Enhancing productivity in research
Reproducibility in research
Research reproducibility guideline
Project Closing
Research Publication

Must-have software
SCINet: Atlas computing cluster
SCINet: Ceres computing cluster
SCINet: Juno storage
ISU HPC: Condo computing cluster
ISU HPC: Nova computing cluster
ISU HPC: LSS system
Remote Access to HPC resources
Virtual Private Network (VPN) Connection
Secure Shell Connection (SSH)
SSH shortcuts and password-less login
Open On Demand (OOD) Connection
Software Available on HPC
07: Data Acquisition and Wrangling
Remote data access
File transfer using irods
Remote data download
Downloading Online Data using WGET
Downloading online data using API
Downloading online data using Python-based web scraping
Downloading online repos using GIT: [GitHub, Bitbucket, SourceForge]
Downloading a single folder or file from GitHub
Remote data preview
Viewing text files using UNIX commands
Viewing PDF and PNG files using X11 SSH connection
Viewing graphics in a terminal as the text-based ASCII art
Mounting remote folder on a local machine

Setting up your home directory for data analysis
Example .bashrc file configuration
File transfer using irods
Remote data download
Downloading Online Data using WGET
Downloading online data using API
Downloading online data using Python-based web scraping
Downloading online repos using GIT: [GitHub, Bitbucket, SourceForge]
Downloading a single folder or file from GitHub
Resource Management
Data Management
Open Science policy
FAIR principles
CARE principles
TRUST principles
Data Management Plan
Code Development & Management
Storage & Version Control
GIT - a distributed version control system
GIT Commands
Online hosting platforms for GIT repositories
Introduction to GitHub
GitHub - getting started
GitHub - getting started (class exercise)
Introduction to BitBucket
Documentation improvement tools
A Practical guide to managing research documentation
Tools for research documentation
Tools for code documentation
Introduction to Markdown

02: Setting Up Computing Machine
Operating System (OS) installation
Unix command line tools on macOS
Unix command line tools on Windows
Configuring your Linux system
Various methods of software installation
Installations on MacBook Pro
Installations on Windows
Installations on Linux
Introduction to UNIX Shell: configuration, variables, home dir
System info and access permissions
Superuser command: sudo
Setting up your home directory for data analysis
Example .bashrc file configuration

Command Line text files editors: nano, vim
Getting started with UNIX + video + exercises
Useful text manipulation programs
GREP – simple search for regular expressions
SED – replace pattern in stream text
AWK – advanced text processing
BIOAWK – biological data manipulation
DIFF, COMM, CMP – comparing and collating two files with Unix
UNIX Commands
Open-Close & Read-Write of data files
JSON module - encoding & decoding JSON data
Read, write, split, select data
Data wrangling: use ready-made apps
Merge files by common column
Aggregate data over slicing variations
Split data or create data chunks

Basic developer tools
Integrated & Interactive Development Environment (IDE)
Visual Studio Code: multi-language integrated DE (VSC)
PyCharm: IDE for professional Python developers
Code Development & Management
Storage & Version Control
GIT - a distributed version control system
GIT Commands
Online hosting platforms for GIT repositories
Introduction to GitHub
GitHub - getting started
GitHub - getting started (class exercise)
Introduction to BitBucket
Documentation improvement tools
A Practical guide to managing research documentation
Tools for research documentation
Tools for code documentation
Monitoring & Evaluation
Enhancing productivity in research
Reproducibility in research
Research reproducibility guideline
Project Closing
Research Publication

Basic developer libraries
Local Python setup on your computing machine
Jupyter Lab: create an interactive Python notebook
Julia setup: installation, environments and Jupyter integration
Installing custom programs in user space
Introduction to containers
Apptainer: the container system for secure HPC
Singularity
Creating containers using Singularity
Modifying existing containers
Singularity on your Mac via Vagrant
Docker - accelerated container application development
Data wrangling: use ready-made apps
Merge files by common column
Aggregate data over slicing variations
Split data or create data chunks
Plotly graphing - interactive examples in the JupyterLab
Creating XY scatter plot
Creating 1D volcano plot
Creating heatmap
Creating dendrogram
Creating clustergram

Jupyter: interactive web-based multi-kernel DE
Online Notebook: interactive Python on Try-jupyter and Colab
Jupyter Lab: create an interactive Python notebook
RStudio: integrated environment for R programming
Setting up RStudio
Viewing graphics in a terminal as the text-based ASCII art
Mounting remote folder on a local machine
Manipulating Excel data sheets
08: Data Visualization
Introduction to scientific graphic design
Raster graphics tools
Vector graphics tools
Adobe creative cloud
Template-based web tools
Introduction to scientific graphing
Gnuplot: creating plots in the UNIX Shell
Gnuplot: variables, loops, conditionals
Gnuplot: filled curves
Plotly-Dash: interactive plotting with Python
Introduction to Plotly
Introduction to Dash
Plotly graphing - interactive examples in the JupyterLab
Creating XY scatter plot
Creating 1D volcano plot
Creating heatmap
Creating dendrogram
Creating clustergram
RStudio – data processing & plotting with R
Creating boxplots in R
Creating heatmaps in R
Creating heatmaps in R using ComplexHeatmap

Browse the workbook contents

Structured List

Alphabetical List

Filter by Category: