Skip to content
This repository was archived by the owner on May 7, 2025. It is now read-only.

Latest commit

 

History

History
40 lines (25 loc) · 1.34 KB

File metadata and controls

40 lines (25 loc) · 1.34 KB

MrGeo Development Environment

This repo contains the build scripts for setting up pseudo-distributed MrGeo development environment. Specifically, it builds a Ubuntu 16.04 Vagrant VM loaded with a "single-node" Hadoop distributed file system and the Apache Spark big data processing framework. The addition of HBase is under development.

Getting Up and Running

Clone this repo and at the root, boot up the VM:

mrgeo-dev-env$ vagrant up

This will download the base Ubuntu 16.04 box, pre-install the Ansible DevOps framework, and provision the server. For provisioning details please review the Ansible playbook found in shared/devops/playbook.

You can ssh into the VM with the following Vagrant command:

mrgeo-dev-env$ vagrant ssh

Now you are at the VM's command line interface.

The Hadoop web-UI ports have been forwarded to the host machine, so you may view the Hadoop dashboard at http://localhost:50070/ once the provisioning is complete.

Working with MrGeo

In order to work with MrGeo command line, you need to become the hadoop user

vagrant@vagrant:~$ sudo su hadoop

The MrGeo binary is located here for now (will add symbolic link later):

hadoop@vagrant:/home/vagrant$ cd /srv/mrgeo/mrgeo-cmd/src/main/scripts

hadoop@vagrant:/home/vagrant$ ./mrgeo --help