In Data Science, everyone more-or-less is a developer. Whether you write simple scripts in Bash, submit jobs to a queuing system on an HPC infrastructure, or develop advanced software and web applications in any programming language, an integrated development environment (IDE) makes your daily work easier. Among useful features, there are built-in browsing of the file system, autocomplete of syntax, a preview of available attributes, error detection, customized formatting, and even source code rendering (e.g., markdown or HTML). Even if you’re only using open-source pipelines, it’s worth familiarizing yourself with common editors such as Atom or Jupyter to get the most out of them.
Table of Contents
1. Integrated & Interactive Development Environment (IDE)1.1 Visual Studio Code: multi-language integrated DE (VSC)
1.2 Jupyter: interactive web-based multi-kernel DE
1.2.1 Getting started with JupyterLab on a local machine
1.2.2 Getting started with Jupyter Notebook on HPC systems
1.2.3 Sharing Jupyter-based reproducible pipline via MyBinder
2. Python programming environment(s)
2.1 Online Console: simple Python code in a browser (without installation)
2.2 Online Notebook: interactive Python on Try-jupyter and Colab (no installation)
2.3 Local Python setup on your computing machine (installation needed)
2.3.1 Shell & IDLE: Python code in a terminal or simple IDE (beginner)
2.3.2 Text editors: create Python code in terminal text files (intermediate)
2.3.3 Jupyter Lab: create an interactive Python notebook (advanced)
2.3.4 PyCharm: IDE for professional Python developers (professional)
3. R programming environment(s)
3.1 RStudio: integrated environment for R programming
3.1.1 Setting up RStudio