Most of my focus is now on building Hadoop clusters in Docker. See https://github.com/timveil/docker-hadoop. This repository is no longer maintained.
This is a Vagrant box that can serve as a sensible base to install the Hortonworks Data Platform (HDP). This box is based on Centos 7 (https://atlas.hashicorp.com/centos/boxes/7) and includes the below modifications, consistent with Hortonworks documentation (http://docs.hortonworks.com/)
- installation of the following packages
- scp
- curl
- unzip
- tar
- wget
- ntp
- openssl-devel
- gcc (required for VirtualBox Guest Additions)
- kernel-devel (required for VirtualBox Guest Additions)
- java-1.8.0-openjdk-devel.x86_64 (Java to be used by HDP, etc.)
- set
JAVA_HOME
- enabling yum
fastcache
- update all yum packages and clean yum history
- enabling and starting
ntp
- disabling
firewalld
- disabling
transparent huge pages
- disabling
selinux
- updating timezone to New York
- increasing
ulimit
- updating
umask
To run, execute the following
vagrant up --provider virtualbox
The latest release is v1.0.15
. For detailed release information see https://app.vagrantup.com/timveil/boxes/centos7-hdp-base
Before packaging box, make sure to change both vagrant
and root
password to vagrant
sudo su -
# change root password
passwd
# change vagrant password
passwd vagrant
To package and deploy see https://scotch.io/tutorials/how-to-create-a-vagrant-base-box-from-an-existing-one
vagrant package --output centos7-hdp-base.box
vagrant box add centos7-hdp-base centos7-hdp-base.box --force