Introduction

This is a guide for setup of a windows machine for bioinformatics data analysis. I list here some important tools useful for most bioinformatics analyses and the process to install them. Check out the Project Management guide.

General guide to software installation

Installing any software on Windows usually involves some common steps:

  • Download the installation file(s)
  • Open the downloaded setup file (usually an .exe file). The file will be located in downloads folder by default
  • Accept the default setting (click next, next, next..)
  • Finish installation

Tip: Just google ‘[software name] for windows’ to find installation files. Works for most programs.

Reset an old laptop

  • Get password from previous account holder to login
  • Reset Windows
    • Go to settings
    • Search: Reset this PC
    • Click ‘Get started’ or ‘Reset’ under ‘Reset this PC’
    • You can choose to keep the files or delete all data
  • Log in with new account
  • Download a better internet browser

Developer tools

Atom editor (now archived)

Try VSCode.

Atom is a text editor with git version control integrated in it and is compatible with Markdown syntax. It makes working with Git and GitHub more efficient.

Download ⤴ and install Atom following general guide to installation, and then install packages to improve function. To install packages on Atom, use the package manager in settings (File -> Settings -> Install).

Settings

There are many options in settings, for installing new packages click on install option:

Install Packages

Recommended packages:

  • markdown-folding
  • language-swift-89
  • markdown-pdf
  • minimap
  • wordcount
  • drag-relative-path
  • markdown-scroll-sync
  • autocomplete-python
  • autocomplete-swift
  • autocomplete-R

When installing a package, if you get the following error, just try installing the package again. Click here ⤴ for more info.

Request for package information failed: Application error

Important command

ctrl-shift-p opens command palette, where you can search all Atom commands

Git for Windows

Install Git ⤴

  • Click on the right version to download setup file, I selected 64-bit Git for Windows Setup

setup

  • Open downloaded file and install Git (with default options)
  • To set up a GitHub account check the GitHub tutorial

Linux on Windows

Windows Subsystem for Linux (WSL) is a compatibility layer for running Linux binary executables natively on Windows. Run either PowerShell or Command Prompt as an administrator. For example, you can find Command Prompt from the start menu and click on Run as an administrator.

cmd

or search it from the search box on lower left corner of screen on windows. The options also appear on right (without right click) if you use the search box.

startmenu

Type the following command and press enter to install WSL:

wsl --install

installwsl

Restart your computer to finish installation. This will install Ubuntu distribution by default but can be changed. It is possible to run multiple Linux distributions with WSL. For setting up the Linux system and more options, check Microsoft documentation here ⤴.

Programming Languages and IDE

If you are not familiar with command line, the best way to install and manage IDEs and programming languages like Python and R is through Anaconda Navigator. First, install Anaconda and Anaconda Navigator ⤴ on your computer. You can now install and then use different IDEs through Anaconda Navigator (search anaconda navigator from search bar). Alternatively, see following sections to download IDEs and programming languages independently (you may need to set path variables).

IDE

An Integrated Development Environment (IDE) is a program developed for coding/programming with your language of choice. There are several good IDE options available:

Python

Python is a high-level, general-purpose programming language. It is one of the best languages to learn for a beginner in coding.

R

R is a programming language for statistical computing and graphics. It is one of the most popular tools used for data mining, data modeling and generating graphics.

Install R here ⤴ locally and check out the tutorials for in this workbook for more details including installing R libraries.

Use RStudio and plot to test if R is working. Open RStudio, type following command in console and press enter.

plot(1:50)
On the right side, you should see a scatter plot like this:

Java

Java is also a High-level programming language.
Download Java ⤴

Note: If you are using Windows Subsystem for Linux (WSL), java installation on the Linux system will be different.
Type java in command line (Linux) to get installation commands such as following:

sudo apt install default-jre                 # version 2:1.11-72, or
sudo apt install openjdk-11-jre-headless     # version 11.0.7+10-3ubuntu1
sudo apt install openjdk-8-jre-headless      # version 8u252-b09-1ubuntu1

Perl

Perl is a High-level programming language.
Install Perl ⤴

Conda

Conda is an open-source package and environment manager. Miniforge ⤴ is a minimalistic installer for conda.

Docker

Docker is a platform used to deliver software in packages called containers. Check out the tutorials Docker basics and Intro to Containers for more details.

NextFlow

Check out the Intro to NextFlow ⤴ in our Bioinformatics Workbook for detailed information.

When using nextflow if you get following error:

nextflow: command not found

move nextflow in a directory into $PATH or use ./nextflow if it is in your current path.

More useful tools

Great tools to have on a computer

  • Notepad++ ⤴ is a text and source code editor for Windows
  • Windowgrid ⤴ allows the user to quickly and easily layout their windows on screen
  • Greenshot ⤴ is a screenshot tool with useful features