Skip to content

loum/hadoop-pseudo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hadoop: Pseudo Distributed Container Image

Overview

Quick and easy way to get Hadoop running in pseudo-distributed mode using Docker.

See Hadoop docs for more information.

top

Quick Links

top

Quick Start

Impatient, and just want Hadoop quickly?:

docker run --rm -ti --name hadoop-pseudo loum/hadoop-pseudo:latest

NOTE: More at https://hub.docker.com/r/loum/hadoop-pseudo

top

Prerequisites

top

Getting Started

Get the code and change into the top level git project directory:

git clone https://github.com/loum/hadoop-pseudo.git && cd hadoop-pseudo

NOTE: Run all commands from the top-level directory of the git repository.

For first-time setup, prime the Makester project:

git submodule update --init

Keep Makester project up-to-date with:

make submodule-update

Setup the environment:

make init

top

Getting Help

There should be a make target to get most things done. Check the help for more information:

make help

top

Docker Image Management

NOTE: See Makester's docker subsystem for more detailed container image operations.

Build the container image locally:

make image-build

Search for built container image:

make image-search

Delete the container image:

make image-rm

top

Interact with Hadoop

Configuration

Every Hadoop configuration setting can be overridden during container startup by targeting the setting name and prepending the configuration file context as per the following:

Container runtime

To start the container and wait for all Hadoop services to initiate:

make controlled-run

Get the Hadoop version:

make hadoop-version

To drop into the container runtime's shell and interact with hdfs:

make container-bash

NOTE: The Hadoop Command Reference details the full command suite.

Stop the running container image:

make container-stop

Web interfaces

The following web interfaces are available to view configurations and logs:

top

About

Hadoop pseudo-distributed container image

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages