Back to Search

Apache Oozie: The Workflow Scheduler for Hadoop

AUTHOR Srinivasan, Aravind; Islam, Mohammad Kamrul; Islam, Mohammad
PUBLISHER O'Reilly Media (06/23/2015)
PRODUCT TYPE Paperback (Paperback)

Description

Get a solid grounding in Apache Oozie, the workflow scheduler system for managing Hadoop jobs. With this hands-on guide, two experienced Hadoop practitioners walk you through the intricacies of this powerful and flexible platform, with numerous examples and real-world use cases.

Once you set up your Oozie server, you'll dive into techniques for writing and coordinating workflows, and learn how to write complex data pipelines. Advanced topics show you how to handle shared libraries in Oozie, as well as how to implement and manage Oozie's security capabilities.

  • Install and configure an Oozie server, and get an overview of basic concepts
  • Journey through the world of writing and configuring workflows
  • Learn how the Oozie coordinator schedules and executes workflows based on triggers
  • Understand how Oozie manages data dependencies
  • Use Oozie bundles to package several coordinator apps into a data pipeline
  • Learn about security features and shared library management
  • Implement custom extensions and write your own EL functions and actions
  • Debug workflows and manage Oozie's operational details
Show More
Product Format
Product Details
ISBN-13: 9781449369927
ISBN-10: 1449369928
Binding: Paperback or Softback (Trade Paperback (Us))
Content Language: English
More Product Details
Page Count: 269
Carton Quantity: 14
Product Dimensions: 7.00 x 0.57 x 9.19 inches
Weight: 0.96 pound(s)
Feature Codes: Index, Price on Product
Country of Origin: US
Subject Information
BISAC Categories
Computers | Data Science - Data Warehousing
Computers | Data Science - Data Analytics
Computers | Distributed Systems - Client-Server Computing
Dewey Decimal: 004.36
Descriptions, Reviews, Etc.
publisher marketing

Get a solid grounding in Apache Oozie, the workflow scheduler system for managing Hadoop jobs. With this hands-on guide, two experienced Hadoop practitioners walk you through the intricacies of this powerful and flexible platform, with numerous examples and real-world use cases.

Once you set up your Oozie server, you'll dive into techniques for writing and coordinating workflows, and learn how to write complex data pipelines. Advanced topics show you how to handle shared libraries in Oozie, as well as how to implement and manage Oozie's security capabilities.

  • Install and configure an Oozie server, and get an overview of basic concepts
  • Journey through the world of writing and configuring workflows
  • Learn how the Oozie coordinator schedules and executes workflows based on triggers
  • Understand how Oozie manages data dependencies
  • Use Oozie bundles to package several coordinator apps into a data pipeline
  • Learn about security features and shared library management
  • Implement custom extensions and write your own EL functions and actions
  • Debug workflows and manage Oozie's operational details
Show More

Author: Srinivasan, Aravind
Aravind Srinivasan has been involved with Hadoop in general and Oozie in particular since 2008. He is currently a Lead Application Architect at Altiscale, a Hadoop-as-a-service company, where he helps customers with Hadoop application design and architecture. His association with Big Data and Hadoop started during his time at Yahoo, where he spent almost six years working on various data pipelines for advertising systems. He has extensive experience building complicated, low latency data pipelines and also in porting legacy pipelines to Oozie. He drove a lot of Oozie s requirements as a customer in its early days of adoption inside Yahoo and later spent some time as a Product Manager in Yahoo s Hadoop team where he contributed further to Oozie s roadmap. He also spent a year after Yahoo at Think Big Analytics, a Hadoop consulting firm, where he got to consult on some interesting and challenging Big Data integration projects at Facebook. He has a Masters in Computer Science from Arizona State and lives in Silicon Valley.
Show More
List Price $39.99
Your Price  $39.59
Paperback