Tags
Language
Tags
March 2024
Su Mo Tu We Th Fr Sa
25 26 27 28 29 1 2
3 4 5 6 7 8 9
10 11 12 13 14 15 16
17 18 19 20 21 22 23
24 25 26 27 28 29 30
31 1 2 3 4 5 6

Programming MapReduce with Scalding

Posted By: AlenMiler
Programming MapReduce with Scalding

Programming MapReduce with Scalding by Antonios Chalkiopoulos
English | June 24, 2014 | ISBN: 1783287012 | 107 Pages | EPUB/MOBI/PDF (True) | 11 MB
With: Code Files

Programming MapReduce with Scalding is a practical guide to setting up a development environment and implementing simple and complex MapReduce transformations in Scalding, using a test-driven development methodology and other best practices.

A practical guide to designing, testing, and implementing complex MapReduce applications in Scala

Overview

Develop MapReduce applications using a functional development language in a lightweight, high-performance, and testable way
Recognize the Scalding capabilities to communicate with external data stores and perform machine learning operations
Full of illustrations and diagrams, practical examples, and tips for deeper understanding of MapReduce application development

This book will first introduce you to how the Cascading framework allows for higher abstraction reasoning over MapReduce applications and then dive into how Scala DSL Scalding enables us to develop elegant and testable applications. It will then teach you how to test Scalding jobs and how to define specifications and behavior-driven development (BDD) with Scalding. This book will also demonstrate how to monitor and maintain cluster stability and efficiently access SQL, NoSQL, and search platforms.

Programming MapReduce with Scalding provides hands-on information starting from proof of concept applications and progressing to production-ready implementations.

What you will learn from this book

Set up an environment to execute jobs in local and Hadoop mode
Preview the complete Scalding API through examples and illustrations
Learn about Scalding capabilities, testing, and pipelining jobs
Understand the concepts of MapReduce patterns and the applications of its ecosystem
Implement logfile analysis and ad-targeting applications using best practices
Apply a test-driven development (TDD) methodology and structure Scalding applications in a modular and testable way
Interact with external NoSQL and SQL data stores from Scalding
Deploy, schedule, monitor, and maintain production systems
Approach

This book is an easy-to-understand, practical guide to designing, testing, and implementing complex MapReduce applications in Scala using the Scalding framework. It is packed with examples featuring log-processing, ad-targeting, and machine learning.