How to use this box with Vagrant:

Vagrant.configure("2") do |config|
  config.vm.box = "paulovn/spark-base64"
  config.vm.box_version = "0.9.7"
end
vagrant init paulovn/spark-base64 \
  --box-version 0.9.7
vagrant up

This version was created over 8 years ago.

Version updated for Spark 1.6.0. Contains software installed on top of CentOS 6.7:

  • Apache Spark 1.6.0
  • Python 2.7.5 from the Software Collections
  • A virtualenv for Python 2.7.5 with a scientific Python stack (scipy, numpy, matplotplib, pandas, statmodels, gensim, networkx, scikit-learn) plus IPython 4 + Jupyter notebook
  • R 3.2.2 with a few libraries installed (rmarkdown, magrittr, dplyr, tidyr, data.table, ggplot2)
  • Spark notebook Kernels for Scala (Spark Kernel) and R (IRKernel)
  • A couple of small notebook extensions
  • A notebook start script with facilities to configure Spark execution mode

Note this is a base box, in particular neither Spark nor Spark notebook are fully configured. A complementary Vagrantfile builds on this base box to provide a fully functional Spark environment

1 provider for this version.
  • virtualbox
    unknown Hosted by Vagrant Cloud (2.25 GB)