Ports and URLs

Name Port URL
Notebooks 38889


In this video I’ll show you how to use Python Notebooks and Apache Spark to perform simple analysis on the Back to the Future transcript.

This tutorial uses a Docker Image that I created and can be found at:


The Docker Image contains Apache Spark 2.0.0-preview pre-built for Hadoop 2.7. It also includes Python 3.5 and Anaconda for running Python Notebooks.

The Dockerfile can be found at: