Introduction
This is a guide for setup of a windows machine for bioinformatics data analysis. I list here some important tools useful for most bioinformatics analyses and the process to install them. Check out the Project Management guide.
General guide to software installation
Installing any software on Windows usually involves some common steps:
- Download the installation file(s)
- Open the downloaded setup file (usually an .exe file). The file will be located in downloads folder by default
- Accept the default setting (click next, next, next..)
- Finish installation
Tip:
Just google ‘[software name] for windows’ to find installation files. Works for most programs.
Reset an old laptop
- Get password from previous account holder to login
- Reset Windows
- Go to settings
- Search: Reset this PC
- Click ‘Get started’ or ‘Reset’ under ‘Reset this PC’
- You can choose to keep the files or delete all data
- Log in with new account
- Download a better internet browser
Developer tools
Atom editor (now archived)
Try VSCode.
Atom is a text editor with git
version control integrated in it and is compatible with Markdown syntax. It makes working with Git and GitHub more efficient.
Download ⤴ and install Atom following general guide to installation, and then install packages to improve function. To install packages on Atom, use the package manager in settings (File
-> Settings
-> Install
).
There are many options in settings, for installing new packages click on install option:
Recommended packages:
- markdown-folding
- language-swift-89
- markdown-pdf
- minimap
- wordcount
- drag-relative-path
- markdown-scroll-sync
- autocomplete-python
- autocomplete-swift
- autocomplete-R
When installing a package, if you get the following error, just try installing the package again. Click here ⤴ for more info.
Request for package information failed: Application error
Important command
ctrl-shift-p
opens command palette, where you can search all Atom commands
Git for Windows
- Click on the right version to download setup file, I selected 64-bit Git for Windows Setup
- Open downloaded file and install Git (with default options)
- To set up a GitHub account check the GitHub tutorial
Linux on Windows
Windows Subsystem for Linux (WSL) is a compatibility layer for running Linux binary executables natively on Windows. Run either PowerShell or Command Prompt as an administrator. For example, you can find Command Prompt from the start menu and click on Run as an administrator.
or search it from the search box on lower left corner of screen on windows. The options also appear on right (without right click) if you use the search box.
Type the following command and press enter to install WSL:
wsl --install
Restart your computer to finish installation. This will install Ubuntu distribution by default but can be changed. It is possible to run multiple Linux distributions with WSL. For setting up the Linux system and more options, check Microsoft documentation here ⤴.
Programming Languages and IDE
If you are not familiar with command line, the best way to install and manage IDEs and programming languages like Python and R is through Anaconda Navigator. First, install Anaconda and Anaconda Navigator ⤴ on your computer. You can now install and then use different IDEs through Anaconda Navigator (search anaconda navigator from search bar). Alternatively, see following sections to download IDEs and programming languages independently (you may need to set path variables).
IDE
An Integrated Development Environment (IDE) is a program developed for coding/programming with your language of choice. There are several good IDE options available:
- Spyder ⤴ is used for scientific programming in python. It is a user friendly IDE.
- RStudio is an IDE for programming in R. There are two options for using RStudio; RStudio Server ⤴ is online version and RStudio Desktop ⤴ is a desktop application run locally. Check out the RStudio tutorial for more details.
- Jupyter ⤴ supports multiple programming languages. Check out the Jupyter tutorial for more details.
Python
Python is a high-level, general-purpose programming language. It is one of the best languages to learn for a beginner in coding.
- Install Python ⤴ locally.
- Follow general guidelines for installation.
- Check out the and Intro to Python pages of this workbook for more details.
R
R is a programming language for statistical computing and graphics. It is one of the most popular tools used for data mining, data modeling and generating graphics.
Install R here ⤴ locally and check out the tutorials for in this workbook for more details including installing R libraries.
Use RStudio and plot to test if R is working. Open RStudio, type following command in console and press enter.
plot(1:50)
On the right side, you should see a scatter plot like this: |
Java
Java is also a High-level programming language.
Download Java ⤴
Note:
If you are using Windows Subsystem for Linux (WSL), java installation on the Linux system will be different.
Type java
in command line (Linux) to get installation commands such as following:
sudo apt install default-jre # version 2:1.11-72, or
sudo apt install openjdk-11-jre-headless # version 11.0.7+10-3ubuntu1
sudo apt install openjdk-8-jre-headless # version 8u252-b09-1ubuntu1
Perl
Perl is a High-level programming language.
Install Perl ⤴
Conda
Conda is an open-source package and environment manager. Miniforge ⤴ is a minimalistic installer for conda.
Docker
Docker is a platform used to deliver software in packages called containers. Check out the tutorials Docker basics and Intro to Containers for more details.
NextFlow
Check out the Intro to NextFlow ⤴ in our Bioinformatics Workbook for detailed information.
When using nextflow
if you get following error:
nextflow: command not found
move nextflow in a directory into $PATH or use ./nextflow
if it is in your current path.
More useful tools
- Jbrowse ⤴ genome browser
- Connect to campus network from off-campus location: VPN instructions ⤴ from Iowa State University
- For transferring data between HPC clusters and your laptop: Globus instructions ⤴ from ISU
- Slack ⤴ messaging app for groups
- Zoom ⤴ video platform for meetings, webinars, and online events
- Webex ⤴ another video platform
Great tools to have on a computer
- Notepad++ ⤴ is a text and source code editor for Windows
- Windowgrid ⤴ allows the user to quickly and easily layout their windows on screen
- Greenshot ⤴ is a screenshot tool with useful features