Books > Computing & IT > General theory of computing > Data structures
|
Buy Now
Apache Oozie Essentials (Paperback)
Loot Price: R994
Discovery Miles 9 940
|
|
Apache Oozie Essentials (Paperback)
Expected to ship within 10 - 15 working days
|
Unleash the power of Apache Oozie to create and manage your big
data and machine learning pipelines in one go About This Book *
Teaches you everything you need to know to get started with Apache
Oozie from scratch and manage your data pipelines effortlessly *
Learn to write data ingestion workflows with the help of real-life
examples from the author's own personal experience * Embed Spark
jobs to run your machine learning models on top of Hadoop Who This
Book Is For If you are an expert Hadoop user who wants to use
Apache Oozie to handle workflows efficiently, this book is for you.
This book will be handy to anyone who is familiar with the basics
of Hadoop and wants to automate data and machine learning
pipelines. What You Will Learn * Install and configure Oozie from
source code on your Hadoop cluster * Dive into the world of Oozie
with Java MapReduce jobs * Schedule Hive ETL and data ingestion
jobs * Import data from a database through Sqoop jobs in HDFS *
Create and process data pipelines with Pig, hive scripts as per
business requirements. * Run machine learning Spark jobs on Hadoop
* Create quick Oozie jobs using Hue * Make the most of Oozie's
security capabilities by configuring Oozie's security In Detail As
more and more organizations are discovering the use of big data
analytics, interest in platforms that provide storage, computation,
and analytic capabilities is booming exponentially. This calls for
data management. Hadoop caters to this need. Oozie fulfils this
necessity for a scheduler for a Hadoop job by acting as a cron to
better analyze data. Apache Oozie Essentials starts off with the
basics right from installing and configuring Oozie from source code
on your Hadoop cluster to managing your complex clusters. You will
learn how to create data ingestion and machine learning workflows.
This book is sprinkled with the examples and exercises to help you
take your big data learning to the next level. You will discover
how to write workflows to run your MapReduce, Pig ,Hive, and Sqoop
scripts and schedule them to run at a specific time or for a
specific business requirement using a coordinator. This book has
engaging real-life exercises and examples to get you in the thick
of things. Lastly, you'll get a grip of how to embed Spark jobs,
which can be used to run your machine learning models on Hadoop. By
the end of the book, you will have a good knowledge of Apache
Oozie. You will be capable of using Oozie to handle large Hadoop
workflows and even improve the availability of your Hadoop
environment. Style and approach This book is a hands-on guide that
explains Oozie using real-world examples. Each chapter is blended
beautifully with fundamental concepts sprinkled in-between case
study solution algorithms and topped off with self-learning
exercises.
General
Is the information for this product incomplete, wrong or inappropriate?
Let us know about it.
Does this product have an incorrect or missing image?
Send us a new image.
Is this product missing categories?
Add more categories.
Review This Product
No reviews yet - be the first to create one!
|
You might also like..
|
Email address subscribed successfully.
A activation email has been sent to you.
Please click the link in that email to activate your subscription.