Skip to content
This repository has been archived by the owner on May 3, 2023. It is now read-only.

Latest commit

 

History

History
30 lines (22 loc) · 1.32 KB

clojure-for-bigdata-processing.md

File metadata and controls

30 lines (22 loc) · 1.32 KB

Title: Clojure for Big Data processing

Hadoop is a great tool for large scale data processing still it brings a lot of complexity to the table. From Hardware requirements to the mental model required for writing jobs and day to day tasks like testing and deployment.

Using Clojure and Amazon EMR offers a great path to overcome these challenges, In this talk will cover:

  • Main motivation, Why Clojure + Hadoop will make you work faster.
  • Clojure Hadoop library, cutting off boilerplate.
  • Amazon EMR, intro and main benefits.
  • Using Lemure for job launching.
  • Performance tuning and benchmarking (using Critirium).
  • Cascalog, declarative query engine.
  • Main pitfalls and tips.

Ronen Narkis

A Programming language geek and Github aficionado armed with Clojure, Ruby and Groovy under his belt. Iv been breathing JVM for the past 9 years, from enterprise scale to multi TB data munging map reduce jobs. I strive to practice development as an holistic beast, mastering clean coding, build, packaging, deployment, monitoring and cultural aspects.

Currently I am a Freelance Architect/Consultant, see my linkedin for past positions.

Speaker References