What is the Binder Project?

The Binder project offers an easy place to share computing environments to everyone. It allows users to specify custom environments and share them with a single link. Use cases involve workshops, scientific workflows and streamline sharing among teams.

The Binder Project builds tools that reward best practices in reproducible data science by utilizing community-developed standards for reproducibility. When repositories follow these best practices and are hosted in an online repository, then Binder automatically builds a linkable environment anybody can access.

What is Binder used for?

Guiding Principles of the Binder Project

The Binder Project uses the following principles in designing its technology and the practices around it. This is a non-exhaustive list.

  • Repositories should be human and machine readable.
  • Use existing specifications and standards if they exist. Adopt new specifications in consultation with the communities that Binder serves.
  • Support many languages and interfaces. Be as workflow-agnostic as possible.
  • Be lightweight and tightly-scoped, but extendable to new workflows, platforms, cloud vendors, etc.

In short, if you follow best-practices in computational science, your repository should work with Binder.

What is in the Binder stack?

Binder is entirely powered by an open-source infrastructure stack. Its main two tools are BinderHub, which is an open-source tool that deploys the Binder service in the cloud, and repo2docker, which generates reproducible Docker images from a git repository. The Binder team also runs a public BinderHub deployment at mybinder.org as a free public service for the community.

repo2docker

Here’s a link to the repo2docker repository

repo2docker is a lightweight tool that converts code repositories into reproducible Docker images. It defines the Reproducible Execution Environment Specification, which is used to define rules for converting repository configuration files into Docker images.

repo2docker is used extensively by BinderHub, but can also be run as an independent command line-based tool for generating your own reproducible Docker images that are run with a Jupyter server.

To learn more about repo2docker, see the repo2docker documentation.

BinderHub

Here’s a link to the BinderHub repository

BinderHub is a web application that allows users to create sharable, interactive, reproducible environments from code repositories. It uses repo2docker to generate Docker images for each environment, and JupyterHub to provide interactive user sessions from those images.

BinderHub is a web application built on Kubernetes, another open-source tool for managing cloud infrastructure. It is cloud- and hardware-agnostic, making it scalable and flexible and able to accomodate many use-cases and communities. One example of a BinderHub deployment lives at mybinder.org.

To learn how to deploy your own BinderHub, see the BinderHub documentation.

mybinder.org

Here’s a link to mybinder.org.

The BinderHub deployment at mybinder.org is a free public service that the Binder Community manages for the community. It is actually a federation of many BinderHub deployments that is run as an experiment in open, community-led infrastructure. We run mybinder.org as a radically transparent public service, and as such there is a lot of information out there about the deployment. Here are a few useful resources in case you’re interested:

Join or connect with the Binder community

Like all Project Jupyter efforts, the Binder Project is an open-source and community-driven project. We’d love for you to join our community and contribute code, time, comments, or appreciation.