ansible playbook to deploy cloudera hadoop components to the cluster
-
Updated
Sep 8, 2018 - Shell
ansible playbook to deploy cloudera hadoop components to the cluster
Docker image for Cloudera Hadoop components (CDH5)
A quick and dirty CDH cluster skeleton using Docker for Testing
Getting Started with Hadoop and Big Data
💂♂️ Hadoop/MapReduce Streaming
Spark Benchmark suite to evaluate cluster configuration and compare the performance with other big data frameworks.
Otto-von-Guericke Universität Magdeburg - Big Data SoSe 2017
This is my final project for Data Engineer Expert course at Naya College.
This project creates a small local Hadoop cluster using Cloudera CDH and CentOS.
This repository contains the TF-IDF score calculation for the documents in the Canterbury dataset for a user given search query
The goal of this programming assignment is to compute the PageRanks of an input set of hyperlinked Wikipedia documents using Hadoop MapReduce. The PageRank score of a web page serves as an indicator of the importance of the page. Many web search engines (e.g., Google) use PageRank scores in some form to rank user-submitted queries. The goals of …
chatbot for hipchat (cloud or onpremise) that enables you to talk to your cloudera manager
This repository includes two versions of hadoop management tools
Navigator is a data service that prepares the content for travel agencies, ready for exploration in EWNS (East-West-North-South) direction and hence allows them to render content to the end-user based on their desire to travel.
fundamental-hadoop is basically for introduction about Apache Hadoop and it's ecosystem.
Add a description, image, and links to the cloudera-hadoop topic page so that developers can more easily learn about it.
To associate your repository with the cloudera-hadoop topic, visit your repo's landing page and select "manage topics."