42,99 €
inkl. MwSt.
Versandkostenfrei*
Versandfertig in über 4 Wochen
  • Broschiertes Buch

Get a solid grounding in Oozie, the workflow scheduler for Hadoop jobs. With this practical guide, two experienced Hadoop practitioners teach you Oozie concepts and caveats through lots of examples. You'll learn how to set up an Oozie server and run jobs, then dive into Oozie workflow techniques: coordinating workflows, bundling applications, and writing to them. Advanced topics show you how to use Oozie to submit MapReduce, Pig, and Hive jobs directly, and how to use Oozie's security capabilities.

Produktbeschreibung
Get a solid grounding in Oozie, the workflow scheduler for Hadoop jobs. With this practical guide, two experienced Hadoop practitioners teach you Oozie concepts and caveats through lots of examples. You'll learn how to set up an Oozie server and run jobs, then dive into Oozie workflow techniques: coordinating workflows, bundling applications, and writing to them. Advanced topics show you how to use Oozie to submit MapReduce, Pig, and Hive jobs directly, and how to use Oozie's security capabilities.
Autorenporträt
Mohammad Kamrul Islam is currently working at Uber in data engineering team as a Staff Software Engineer. Previously, he worked at Linkedin for more than two years as Staff Software Engineer in the Hadoop development team. Before that, he worked at Yahoo for nearly five years as an Oozie architect/technical lead. His fingerprints can befound all over Oozie and is a respected voice in the Oozie community. He has been intimately involved with the Apache Hadoop ecosystem since 2009. Mohammad has a Ph.D. in Computer Science with a specialization in parallel job scheduling from Ohio State University. He received his MSCS degree from Wright State University, Ohio andBSCS from Bangladesh University of Engineering and Technology (BUET). He is a Project Management Committee (PMC) member of both Apache Oozie and Apache TEZ and frequently contributes to Apache YARN/MapReduce and Apache Hive. He was elected as the PMC chair and Vice-President of Oozie as part of the Apache Software Foundation from 2013 through 2015. Aravind Srinivasan has been involved with Hadoop in general and Oozie in particular since 2008. He is currently a Lead Application Architect at Altiscale, a Hadoop-as-a-service company, where he helps customers with Hadoop application design and architecture. His association with Big Data and Hadoop started during his time at Yahoo, where he spent almost six years working on various data pipelines for advertising systems. He has extensive experience building complicated, low latency data pipelines and also in porting legacy pipelines to Oozie. He drove a lot of Oozie's requirements as a customer in its early days of adoption inside Yahoo and later spent some time as a Product Manager in Yahoo's Hadoop team where he contributed further to Oozie's roadmap. He also spent a year after Yahoo at Think Big Analytics, a Hadoop consulting firm, where he got to consult on some interesting and challenging Big Data integration projects at Facebook. He has a Masters in Computer Science from Arizona State and lives in Silicon Valley.