|
|
Books > Computing & IT > Applications of computing > Databases > Data warehousing
Get up to speed with the new features added to Microsoft SQL Server
2019 Analysis Services and create models to support your business
Key Features Explore tips and tricks to design, develop, and
optimize end-to-end data analytics solutions using Microsoft's
technologies Learn tabular modeling and multi-dimensional cube
design development using real-world examples Implement Analysis
Services to help you make productive business decisions Book
DescriptionSQL Server Analysis Services (SSAS) continues to be a
leading enterprise-scale toolset, enabling customers to deliver
data and analytics across large datasets with great performance.
This book will help you understand MS SQL Server 2019's new
features and improvements, especially when it comes to SSAS. First,
you'll cover a quick overview of SQL Server 2019, learn how to
choose the right analytical model to use, and understand their key
differences. You'll then explore how to create a multi-dimensional
model with SSAS and expand on that model with MDX. Next, you'll
create and deploy a tabular model using Microsoft Visual Studio and
Management Studio. You'll learn when and how to use both tabular
and multi-dimensional model types, how to deploy and configure your
servers to support them, and design principles that are relevant to
each model. The book comes packed with tips and tricks to build
measures, optimize your design, and interact with models using
Excel and Power BI. All this will help you visualize data to gain
useful insights and make better decisions. Finally, you'll discover
practices and tools for securing and maintaining your models once
they are deployed. By the end of this MS SQL Server book, you'll be
able to choose the right model and build and deploy it to support
the analytical needs of your business. What you will learn
Determine the best analytical model using SSAS Cover the core
aspects involved in MDX, including writing your first query
Implement calculated tables and calculation groups (new in version
2019) in DAX Create and deploy tabular and multi-dimensional models
on SQL 2019 Connect and create data visualizations using Excel and
Power BI Implement row-level and other data security methods with
tabular and multi-dimensional models Explore essential concepts and
techniques to scale, manage, and optimize your SSAS solutions Who
this book is forThis Microsoft SQL Server book is for BI
professionals and data analysts who are looking for a practical
guide to creating and maintaining tabular and multi-dimensional
models using SQL Server 2019 Analysis Services. A basic working
knowledge of BI solutions such as Power BI and database querying is
required.
Explore the latest Azure ETL techniques both on-premises and in the
cloud using Azure services such as SQL Server Integration Services
(SSIS), Azure Data Factory, and Azure Databricks Key Features
Understand the key components of an ETL solution using Azure
Integration Services Discover the common and not-so-common
challenges faced while creating modern and scalable ETL solutions
Program and extend your packages to develop efficient data
integration and data transformation solutions Book DescriptionETL
is one of the most common and tedious procedures for moving and
processing data from one database to another. With the help of this
book, you will be able to speed up the process by designing
effective ETL solutions using the Azure services available for
handling and transforming any data to suit your requirements. With
this cookbook, you'll become well versed in all the features of SQL
Server Integration Services (SSIS) to perform data migration and
ETL tasks that integrate with Azure. You'll learn how to transform
data in Azure and understand how legacy systems perform ETL
on-premises using SSIS. Later chapters will get you up to speed
with connecting and retrieving data from SQL Server 2019 Big Data
Clusters, and even show you how to extend and customize the SSIS
toolbox using custom-developed tasks and transforms. This ETL book
also contains practical recipes for moving and transforming data
with Azure services, such as Data Factory and Azure Databricks, and
lets you explore various options for migrating SSIS packages to
Azure. Toward the end, you'll find out how to profile data in the
cloud and automate service creation with Business Intelligence
Markup Language (BIML). By the end of this book, you'll have
developed the skills you need to create and automate ETL solutions
on-premises as well as in Azure. What you will learn Explore ETL
and how it is different from ELT Move and transform various data
sources with Azure ETL and ELT services Use SSIS 2019 with Azure
HDInsight clusters Discover how to query SQL Server 2019 Big Data
Clusters hosted in Azure Migrate SSIS solutions to Azure and solve
key challenges associated with it Understand why data profiling is
crucial and how to implement it in Azure Databricks Get to grips
with BIML and learn how it applies to SSIS and Azure Data Factory
solutions Who this book is forThis book is for data warehouse
architects, ETL developers, or anyone who wants to build scalable
ETL applications in Azure. Those looking to extend their existing
on-premise ETL applications to use big data and a variety of Azure
services or others interested in migrating existing on-premise
solutions to the Azure cloud platform will also find the book
useful. Familiarity with SQL Server services is necessary to get
the most out of this book.
The concept of a big data warehouse appeared in order to store
moving data objects and temporal data information. Moving objects
are geometries that change their position and shape continuously
over time. In order to support spatio-temporal data, a data model
and associated query language is needed for supporting moving
objects. Emerging Perspectives in Big Data Warehousing is an
essential research publication that explores current innovative
activities focusing on the integration between data warehousing and
data mining with an emphasis on the applicability to real-world
problems. Featuring a wide range of topics such as index
structures, ontology, and user behavior, this book is ideally
designed for IT consultants, researchers, professionals, computer
scientists, academicians, and managers.
Explore expert techniques such as advanced indexing and high
availability to build scalable, reliable, and fault-tolerant
database applications using PostgreSQL 13 Key Features Master
advanced PostgreSQL 13 concepts with the help of real-world
datasets and examples Leverage PostgreSQL's indexing features to
fine-tune the performance of your queries Extend PostgreSQL's
functionalities to suit your organization's needs with minimal
effort Book DescriptionThanks to its reliability, robustness, and
high performance, PostgreSQL has become one of the most advanced
open source databases on the market. This updated fourth edition
will help you understand PostgreSQL administration and how to build
dynamic database solutions for enterprise apps with the latest
release of PostgreSQL, including designing both physical and
technical aspects of the system architecture with ease. Starting
with an introduction to the new features in PostgreSQL 13, this
book will guide you in building efficient and fault-tolerant
PostgreSQL apps. You'll explore advanced PostgreSQL features, such
as logical replication, database clusters, performance tuning,
advanced indexing, monitoring, and user management, to manage and
maintain your database. You'll then work with the PostgreSQL
optimizer, configure PostgreSQL for high speed, and move from
Oracle to PostgreSQL. The book also covers transactions, locking,
and indexes, and shows you how to improve performance with query
optimization. You'll also focus on how to manage network security
and work with backups and replication while exploring useful
PostgreSQL extensions that optimize the performance of large
databases. By the end of this PostgreSQL book, you'll be able to
get the most out of your database by executing advanced
administrative tasks. What you will learn Get well versed with
advanced SQL functions in PostgreSQL 13 Get to grips with
administrative tasks such as log file management and monitoring
Work with stored procedures and manage backup and recovery Employ
replication and failover techniques to reduce data loss Perform
database migration from Oracle to PostgreSQL with ease Replicate
PostgreSQL database systems to create backups and scale your
database Manage and improve server security to protect your data
Troubleshoot your PostgreSQL instance to find solutions to common
and not-so-common problems Who this book is forThis database
administration book is for PostgreSQL developers and database
administrators and professionals who want to implement advanced
functionalities and master complex administrative tasks with
PostgreSQL 13. Prior experience in PostgreSQL and familiarity with
the basics of database administration will assist with
understanding key concepts covered in the book.
Kick-start your DevOps career by learning how to effectively deploy
Kubernetes on Azure in an easy, comprehensive, and fun way with
hands-on coding tasks Key Features Understand the fundamentals of
Docker and Kubernetes Learn to implement microservices architecture
using the Kubernetes platform Discover how you can scale your
application workloads in Azure Kubernetes Service (AKS) Book
DescriptionFrom managing versioning efficiently to improving
security and portability, technologies such as Kubernetes and
Docker have greatly helped cloud deployments and application
development. Starting with an introduction to Docker, Kubernetes,
and Azure Kubernetes Service (AKS), this book will guide you
through deploying an AKS cluster in different ways. You'll then
explore the Azure portal by deploying a sample guestbook
application on AKS and installing complex Kubernetes apps using
Helm. With the help of real-world examples, you'll also get to
grips with scaling your application and cluster. As you advance,
you'll understand how to overcome common challenges in AKS and
secure your application with HTTPS and Azure AD (Active Directory).
Finally, you'll explore serverless functions such as HTTP triggered
Azure functions and queue triggered functions. By the end of this
Kubernetes book, you'll be well-versed with the fundamentals of
Azure Kubernetes Service and be able to deploy containerized
workloads on Microsoft Azure with minimal management overhead. What
you will learn Plan, configure, and run containerized applications
in production Use Docker to build apps in containers and deploy
them on Kubernetes Improve the configuration and deployment of apps
on the Azure Cloud Store your container images securely with Azure
Container Registry Install complex Kubernetes applications using
Helm Integrate Kubernetes with multiple Azure PaaS services, such
as databases, Event Hubs and Functions. Who this book is forThis
book is for aspiring DevOps professionals, system administrators,
developers, and site reliability engineers looking to understand
test and deployment processes and improve their efficiency. If
you're new to working with containers and orchestration, you'll
find this book useful.
Learn Azure's cloud capabilities with the help of this introductory
guide to employing Azure for your cloud infrastructure needs. Key
Features Get a clear overview of Azure's capabilities and benefits,
and learn how to get started efficiently Develop the ability to opt
for cloud architecture and design that best fits your organization
Leverage Azure opportunities for cost savings and optimization Book
DescriptionMicrosoft Azure is a powerful cloud computing platform
that offers a multitude of services and capabilities for
organizations of any size moving to a cloud strategy. Azure
Strategy and Implementation Guide Third Edition encapsulates the
entire spectrum of measures involved in Azure deployment that
includes understanding Azure fundamentals, choosing a suitable
cloud architecture, building on design principles, becoming
familiar with Azure DevOps, and learning best practices for
optimization and management. The book begins by introducing you to
the Azure cloud platform and demonstrating the substantial scope of
digital transformation and innovation that can be achieved by
leveraging Azure's capabilities. The guide further acquaints you
with practical insights on application modernization, Azure
Infrastructure as a Service (IaaS) deployment, infrastructure
management, key application architectures, best practices of Azure
DevOps, and Azure automation. By the end of this book, you will be
proficient in driving Azure operations right from the planning and
cloud migration stage to cost management and troubleshooting. What
you will learn Deploy and run Azure infrastructure services Carry
out detailed planning for migrating applications to the cloud with
Azure Move underlying code class structure into a serverless model
Use a gateway to isolate your services and applications Define
roles and responsibilities in DevOps Implement release &
deployment coordination and automation Who this book is forAzure
Strategy and Implementation Guide Third Edition is designed to
benefit Azure architects, cloud solution architects, Azure
developers, Azure administrators, and anyone who wants to develop
an expertise in operating and administering the Azure cloud. A
basic familiarity with operating systems and databases will help
you grasp the concepts covered in this book.
A comprehensive guide to understanding key techniques for
architecture and hardware planning, monitoring, replication,
backups, and decoupling Key Features Newly updated edition,
covering the latest PostgreSQL 12 features with hands-on
industry-driven recipes Create a PostgreSQL cluster that stays
online even when disaster strikes Learn how to avoid costly
downtime and data loss that can ruin your business Book
DescriptionDatabases are nothing without the data they store. In
the event of an outage or technical catastrophe, immediate recovery
is essential. This updated edition ensures that you will learn the
important concepts related to node architecture design, as well as
techniques such as using repmgr for failover automation. From
cluster layout and hardware selection to software stacks and
horizontal scalability, this PostgreSQL cookbook will help you
build a PostgreSQL cluster that will survive crashes, resist data
corruption, and grow smoothly with customer demand. You'll start by
understanding how to plan a PostgreSQL database architecture that
is resistant to outages and scalable, as it is the scaffolding on
which everything rests. With the bedrock established, you'll cover
the topics that PostgreSQL database administrators need to know to
manage a highly available cluster. This includes configuration,
troubleshooting, monitoring and alerting, backups through proxies,
failover automation, and other considerations that are essential
for a healthy PostgreSQL cluster. Later, you'll learn to use
multi-master replication to maximize server availability. Later
chapters will guide you through managing major version upgrades
without downtime. By the end of this book, you'll have learned how
to build an efficient and adaptive PostgreSQL 12 database cluster.
What you will learn Understand how to protect data with PostgreSQL
replication tools Focus on hardware planning to ensure that your
database runs efficiently Reduce database resource contention with
connection pooling Monitor and visualize cluster activity with
Nagios and the TIG (Telegraf, InfluxDB, Grafana) stack Construct a
robust software stack that can detect and avert outages Use
multi-master to achieve an enduring PostgreSQL cluster Who this
book is forThis book is for Postgres administrators and developers
who are looking to build and maintain a highly reliable PostgreSQL
cluster. Although knowledge of the new features of PostgreSQL 12 is
not required, a basic understanding of PostgreSQL administration is
expected.
Learn how to migrate your SAP data to Azure simply and
successfully. Key Features Learn why Azure is suitable for
business-critical systems Understand how to migrate your SAP
infrastructure to Azure Use Lift & shift migration, Lift &
migrate, Lift & migrate to HANA, or Lift & transform to
S/4HANA Book DescriptionCloud technologies have now reached a level
where even the most critical business systems can run on them. For
most organizations SAP is the key business system. If SAP is
unavailable for any reason then potentially your business stops.
Because of this, it is understandable that you will be concerned
whether such a critical system can run in the public cloud.
However, the days when you truly ran your IT system on-premises
have long since gone. Most organizations have been getting rid of
their own data centers and increasingly moving to co-location
facilities. In this context the public cloud is nothing more than
an additional virtual data center connected to your existing
network. There are typically two main reasons why you may consider
migrating SAP to Azure: You need to replace the infrastructure that
is currently running SAP, or you want to migrate SAP to a new
database. Depending on your goal SAP offers different migration
paths. You can decide either to migrate the current workload to
Azure as-is, or to combine it with changing the database and
execute both activities as a single step. SAP on Azure
Implementation Guide covers the main migration options to lead you
through migrating your SAP data to Azure simply and successfully.
What you will learn Successfully migrate your SAP infrastructure to
Azure Understand the security benefits of Azure See how Azure can
scale to meet the most demanding of business needs Ensure your SAP
infrastructure maintains high availability Increase business
agility through cloud capabilities Leverage cloud-native
capabilities to enhance SAP Who this book is forSAP on Azure
Implementation Guide is designed to benefit existing SAP architects
looking to migrate their SAP infrastructure to Azure. Whether you
are an architect implementing the migration or an IT decision maker
evaluating the benefits of migration, this book is for you.
Understand data science concepts and methodologies to manage and
deliver top-notch solutions for your organization Key Features
Learn the basics of data science and explore its possibilities and
limitations Manage data science projects and assemble teams
effectively even in the most challenging situations Understand
management principles and approaches for data science projects to
streamline the innovation process Book DescriptionData science and
machine learning can transform any organization and unlock new
opportunities. However, employing the right management strategies
is crucial to guide the solution from prototype to production.
Traditional approaches often fail as they don't entirely meet the
conditions and requirements necessary for current data science
projects. In this book, you'll explore the right approach to data
science project management, along with useful tips and best
practices to guide you along the way. After understanding the
practical applications of data science and artificial intelligence,
you'll see how to incorporate them into your solutions. Next, you
will go through the data science project life cycle, explore the
common pitfalls encountered at each step, and learn how to avoid
them. Any data science project requires a skilled team, and this
book will offer the right advice for hiring and growing a data
science team for your organization. Later, you'll be shown how to
efficiently manage and improve your data science projects through
the use of DevOps and ModelOps. By the end of this book, you will
be well versed with various data science solutions and have gained
practical insights into tackling the different challenges that
you'll encounter on a daily basis. What you will learn Understand
the underlying problems of building a strong data science pipeline
Explore the different tools for building and deploying data science
solutions Hire, grow, and sustain a data science team Manage data
science projects through all stages, from prototype to production
Learn how to use ModelOps to improve your data science pipelines
Get up to speed with the model testing techniques used in both
development and production stages Who this book is forThis book is
for data scientists, analysts, and program managers who want to use
data science for business productivity by incorporating data
science workflows efficiently. Some understanding of basic data
science concepts will be useful to get the most out of this book.
A beginner's guide to storing, managing, and analyzing data with
the updated features of Elastic 7.0 Key Features Gain access to new
features and updates introduced in Elastic Stack 7.0 Grasp the
fundamentals of Elastic Stack including Elasticsearch, Logstash,
and Kibana Explore useful tips for using Elastic Cloud and
deploying Elastic Stack in production environments Book
DescriptionThe Elastic Stack is a powerful combination of tools for
techniques such as distributed search, analytics, logging, and
visualization of data. Elastic Stack 7.0 encompasses new features
and capabilities that will enable you to find unique insights into
analytics using these techniques. This book will give you a
fundamental understanding of what the stack is all about, and help
you use it efficiently to build powerful real-time data processing
applications. The first few sections of the book will help you
understand how to set up the stack by installing tools, and
exploring their basic configurations. You'll then get up to speed
with using Elasticsearch for distributed searching and analytics,
Logstash for logging, and Kibana for data visualization. As you
work through the book, you will discover the technique of creating
custom plugins using Kibana and Beats. This is followed by coverage
of the Elastic X-Pack, a useful extension for effective security
and monitoring. You'll also find helpful tips on how to use Elastic
Cloud and deploy Elastic Stack in production environments. By the
end of this book, you'll be well versed with the fundamental
Elastic Stack functionalities and the role of each component in the
stack to solve different data processing problems. What you will
learn Install and configure an Elasticsearch architecture Solve the
full-text search problem with Elasticsearch Discover powerful
analytics capabilities through aggregations using Elasticsearch
Build a data pipeline to transfer data from a variety of sources
into Elasticsearch for analysis Create interactive dashboards for
effective storytelling with your data using Kibana Learn how to
secure, monitor and use Elastic Stack's alerting and reporting
capabilities Take applications to an on-premise or cloud-based
production environment with Elastic Stack Who this book is forThis
book is for entry-level data professionals, software engineers,
e-commerce developers, and full-stack developers who want to learn
about Elastic Stack and how the real-time processing and search
engine works for business analytics and enterprise search
applications. Previous experience with Elastic Stack is not
required, however knowledge of data warehousing and database
concepts will be helpful.
A practical guide to administer, monitor and replicate your
PostgreSQL 11 database Key Features Study and apply the newly
introduced features in PostgreSQL 11 Tackle any problem in
PostgreSQL 11 administration and management Catch up on expert
techniques for monitoring, fine-tuning, and securing your database
Book DescriptionPostgreSQL is a powerful, open source database
management system with an enviable reputation for high performance
and stability. With many new features in its arsenal, PostgreSQL 11
allows you to scale up your PostgreSQL infrastructure. This book
takes a step-by-step, recipe-based approach to effective PostgreSQL
administration. The book will introduce you to new features such as
logical replication, native table partitioning, additional query
parallelism, and much more to help you to understand and control,
crash recovery and plan backups. You will learn how to tackle a
variety of problems and pain points for any database administrator
such as creating tables, managing views, improving performance, and
securing your database. As you make steady progress, the book will
draw attention to important topics such as monitoring roles,
backup, and recovery of your PostgreSQL 11 database to help you
understand roles and produce a summary of log files, ensuring high
availability, concurrency, and replication. By the end of this
book, you will have the necessary knowledge to manage your
PostgreSQL 11 database efficiently. What you will learn
Troubleshoot open source PostgreSQL version 11 on various platforms
Deploy best practices for planning and designing live databases
Select and implement robust backup and recovery techniques in
PostgreSQL 11 Use pgAdmin or OmniDB to perform database
administrator (DBA) tasks Adopt efficient replication and high
availability techniques in PostgreSQL Improve the performance of
your PostgreSQL solution Who this book is forThis book is designed
for database administrators, data architects, database developers,
or anyone with an interest in planning and running live production
databases using PostgreSQL 11. It is also ideal if you're looking
for hands-on solutions to any problem associated with PostgreSQL 11
administration. Some experience with handling PostgreSQL databases
will be beneficial
Leverage the power of MongoDB 4.x to build and administer
fault-tolerant database applications Key Features Master the new
features and capabilities of MongoDB 4.x Implement advanced data
modeling, querying, and administration techniques in MongoDB
Includes rich case-studies and best practices followed by expert
MongoDB developers Book DescriptionMongoDB is the best platform for
working with non-relational data and is considered to be the
smartest tool for organizing data in line with business needs. The
recently released MongoDB 4.x supports ACID transactions and makes
the technology an asset for enterprises across the IT and fintech
sectors. This book provides expertise in advanced and niche areas
of managing databases (such as modeling and querying databases)
along with various administration techniques in MongoDB, thereby
helping you become a successful MongoDB expert. The book helps you
understand how the newly added capabilities function with the help
of some interesting examples and large datasets. You will dive
deeper into niche areas such as high-performance configurations,
optimizing SQL statements, configuring large-scale sharded
clusters, and many more. You will also master best practices in
overcoming database failover, and master recovery and backup
procedures for database security. By the end of the book, you will
have gained a practical understanding of administering database
applications both on premises and on the cloud; you will also be
able to scale database applications across all servers. What you
will learn Perform advanced querying techniques such as indexing
and expressions Configure, monitor, and maintain a highly scalable
MongoDB environment Master replication and data sharding to
optimize read/write performance Administer MongoDB-based
applications on premises or on the cloud Integrate MongoDB with big
data sources to process huge amounts of data Deploy MongoDB on
Kubernetes containers Use MongoDB in IoT, mobile, and serverless
environments Who this book is forThis book is ideal for MongoDB
developers and database administrators who wish to become
successful MongoDB experts and build scalable and fault-tolerant
applications using MongoDB. It will also be useful for database
professionals who wish to become certified MongoDB professionals.
Some understanding of MongoDB and basic database concepts is
required to get the most out of this book.
Simplify your ETL processes with these hands-on data hygiene tips,
tricks, and best practices. Key Features Focus on the basics of
data wrangling Study various ways to extract the most out of your
data in less time Boost your learning curve with bonus topics like
random data generation and data integrity checks Book
DescriptionFor data to be useful and meaningful, it must be curated
and refined. Data Wrangling with Python teaches you the core ideas
behind these processes and equips you with knowledge of the most
popular tools and techniques in the domain. The book starts with
the absolute basics of Python, focusing mainly on data structures.
It then delves into the fundamental tools of data wrangling like
NumPy and Pandas libraries. You'll explore useful insights into why
you should stay away from traditional ways of data cleaning, as
done in other languages, and take advantage of the specialized
pre-built routines in Python. This combination of Python tips and
tricks will also demonstrate how to use the same Python backend and
extract/transform data from an array of sources including the
Internet, large database vaults, and Excel financial tables. To
help you prepare for more challenging scenarios, you'll cover how
to handle missing or wrong data, and reformat it based on the
requirements from the downstream analytics tool. The book will
further help you grasp concepts through real-world examples and
datasets. By the end of this book, you will be confident in using a
diverse array of sources to extract, clean, transform, and format
your data efficiently. What you will learn Use and manipulate
complex and simple data structures Harness the full potential of
DataFrames and numpy.array at run time Perform web scraping with
BeautifulSoup4 and html5lib Execute advanced string search and
manipulation with RegEX Handle outliers and perform data imputation
with Pandas Use descriptive statistics and plotting techniques
Practice data wrangling and modeling using data generation
techniques Who this book is forData Wrangling with Python is
designed for developers, data analysts, and business analysts who
are keen to pursue a career as a full-fledged data scientist or
analytics expert. Although, this book is for beginners, prior
working knowledge of Python is necessary to easily grasp the
concepts covered here. It will also help to have rudimentary
knowledge of relational database and SQL.
A fast paced guide that will help you learn about Apache Hadoop 3
and its ecosystem Key Features Set up, configure and get started
with Hadoop to get useful insights from large data sets Work with
the different components of Hadoop such as MapReduce, HDFS and YARN
Learn about the new features introduced in Hadoop 3 Book
DescriptionApache Hadoop is a widely used distributed data
platform. It enables large datasets to be efficiently processed
instead of using one large computer to store and process the data.
This book will get you started with the Hadoop ecosystem, and
introduce you to the main technical topics, including MapReduce,
YARN, and HDFS. The book begins with an overview of big data and
Apache Hadoop. Then, you will set up a pseudo Hadoop development
environment and a multi-node enterprise Hadoop cluster. You will
see how the parallel programming paradigm, such as MapReduce, can
solve many complex data processing problems. The book also covers
the important aspects of the big data software development
lifecycle, including quality assurance and control, performance,
administration, and monitoring. You will then learn about the
Hadoop ecosystem, and tools such as Kafka, Sqoop, Flume, Pig, Hive,
and HBase. Finally, you will look at advanced topics, including
real time streaming using Apache Storm, and data analytics using
Apache Spark. By the end of the book, you will be well versed with
different configurations of the Hadoop 3 cluster. What you will
learn Store and analyze data at scale using HDFS, MapReduce and
YARN Install and configure Hadoop 3 in different modes Use Yarn
effectively to run different applications on Hadoop based platform
Understand and monitor how Hadoop cluster is managed Consume
streaming data using Storm, and then analyze it using Spark Explore
Apache Hadoop ecosystem components, such as Flume, Sqoop, HBase,
Hive, and Kafka Who this book is forAspiring Big Data professionals
who want to learn the essentials of Hadoop 3 will find this book to
be useful. Existing Hadoop users who want to get up to speed with
the new features introduced in Hadoop 3 will also benefit from this
book. Having knowledge of Java programming will be an added
advantage.
|
|