|
|
Books > Computing & IT > Applications of computing > Databases > Data warehousing
Do you enjoy completing puzzles? Perhaps one of the most
challenging (yet rewarding) puzzles is delivering a successful data
warehouse suitable for data mining and analytics. The Analytical
Puzzle describes an unbiased, practical, and comprehensive approach
to building a data warehouse which will lead to an increased level
of business intelligence within your organisation. New technologies
continuously impact this approach and therefore this book explains
how to leverage big data, cloud computing, data warehouse
appliances, data mining, predictive analytics, data visualisation
and mobile devices. This book describes an unbiased, practical, and
comprehensive approach to building a data warehouse which will lead
to an increased level of business intelligence within your
organisation. New technologies continuously impact this approach
and therefore this book explains how to leverage big data, cloud
computing, data warehouse appliances, data mining, predictive
analytics, data visualisation and mobile devices.
Understand the fundamentals of Kubernetes deployment on Azure with
a learn-by-doing approach Key Features Get to grips with the
fundamentals of containers and Kubernetes Deploy containerized
applications using the Kubernetes platform Learn how you can scale
your workloads and secure your application running in Azure
Kubernetes Service Book DescriptionContainers and Kubernetes
containers facilitate cloud deployments and application development
by enabling efficient versioning with improved security and
portability. With updated chapters on role-based access control,
pod identity, storing secrets, and network security in AKS, this
third edition begins by introducing you to containers, Kubernetes,
and Azure Kubernetes Service (AKS), and guides you through
deploying an AKS cluster in different ways. You will then delve
into the specifics of Kubernetes by deploying a sample guestbook
application on AKS and installing complex Kubernetes apps using
Helm. With the help of real-world examples, you'll also get to
grips with scaling your applications and clusters. As you advance,
you'll learn how to overcome common challenges in AKS and secure
your applications with HTTPS. You will also learn how to secure
your clusters and applications in a dedicated section on security.
In the final section, you'll learn about advanced integrations,
which give you the ability to create Azure databases and run
serverless functions on AKS as well as the ability to integrate AKS
with a continuous integration and continuous delivery (CI/CD)
pipeline using GitHub Actions. By the end of this Kubernetes book,
you will be proficient in deploying containerized workloads on
Microsoft Azure with minimal management overhead. What you will
learn Plan, configure, and run containerized applications in
production. Use Docker to build applications in containers and
deploy them on Kubernetes. Monitor the AKS cluster and the
application. Monitor your infrastructure and applications in
Kubernetes using Azure Monitor. Secure your cluster and
applications using Azure-native security tools. Connect an app to
the Azure database. Store your container images securely with Azure
Container Registry. Install complex Kubernetes applications using
Helm. Integrate Kubernetes with multiple Azure PaaS services, such
as databases, Azure Security Center, and Functions. Use GitHub
Actions to perform continuous integration and continuous delivery
to your cluster. Who this book is forIf you are an aspiring DevOps
professional, system administrator, developer, or site reliability
engineer interested in learning how to get the most out of
containers and Kubernetes, then this book is for you.
Solve real-world data problems and create data-driven workflows for
easy data movement and processing at scale with Azure Data Factory
Key Features Learn how to load and transform data from various
sources, both on-premises and on cloud Use Azure Data Factory's
visual environment to build and manage hybrid ETL pipelines
Discover how to prepare, transform, process, and enrich data to
generate key insights Book DescriptionAzure Data Factory (ADF) is a
modern data integration tool available on Microsoft Azure. This
Azure Data Factory Cookbook helps you get up and running by showing
you how to create and execute your first job in ADF. You'll learn
how to branch and chain activities, create custom activities, and
schedule pipelines. This book will help you to discover the
benefits of cloud data warehousing, Azure Synapse Analytics, and
Azure Data Lake Gen2 Storage, which are frequently used for big
data analytics. With practical recipes, you'll learn how to
actively engage with analytical tools from Azure Data Services and
leverage your on-premise infrastructure with cloud-native tools to
get relevant business insights. As you advance, you'll be able to
integrate the most commonly used Azure Services into ADF and
understand how Azure services can be useful in designing ETL
pipelines. The book will take you through the common errors that
you may encounter while working with ADF and show you how to use
the Azure portal to monitor pipelines. You'll also understand error
messages and resolve problems in connectors and data flows with the
debugging capabilities of ADF. By the end of this book, you'll be
able to use ADF as the main ETL and orchestration tool for your
data warehouse or data platform projects. What you will learn
Create an orchestration and transformation job in ADF Develop,
execute, and monitor data flows using Azure Synapse Create big data
pipelines using Azure Data Lake and ADF Build a machine learning
app with Apache Spark and ADF Migrate on-premises SSIS jobs to ADF
Integrate ADF with commonly used Azure services such as Azure ML,
Azure Logic Apps, and Azure Functions Run big data compute jobs
within HDInsight and Azure Databricks Copy data from AWS S3 and
Google Cloud Storage to Azure Storage using ADF's built-in
connectors Who this book is forThis book is for ETL developers,
data warehouse and ETL architects, software professionals, and
anyone who wants to learn about the common and not-so-common
challenges faced while developing traditional and hybrid ETL
solutions using Microsoft's Azure Data Factory. You'll also find
this book useful if you are looking for recipes to improve or
enhance your existing ETL pipelines. Basic knowledge of data
warehousing is expected.
Discover how to build a cloud-based data warehouse at
petabyte-scale that is burstable and built to scale for end-to-end
analytical solutions Key Features Discover how to translate
familiar data warehousing concepts into Redshift implementation Use
impressive Redshift features to optimize development,
productionizing, and operations processes Find out how to use
advanced features such as concurrency scaling, Redshift Spectrum,
and federated queries Book DescriptionAmazon Redshift is a fully
managed, petabyte-scale AWS cloud data warehousing service. It
enables you to build new data warehouse workloads on AWS and
migrate on-premises traditional data warehousing platforms to
Redshift. This book on Amazon Redshift starts by focusing on
Redshift architecture, showing you how to perform database
administration tasks on Redshift. You'll then learn how to optimize
your data warehouse to quickly execute complex analytic queries
against very large datasets. Because of the massive amount of data
involved in data warehousing, designing your database for
analytical processing lets you take full advantage of Redshift's
columnar architecture and managed services. As you advance, you'll
discover how to deploy fully automated and highly scalable extract,
transform, and load (ETL) processes, which help minimize the
operational efforts that you have to invest in managing regular ETL
pipelines and ensure the timely and accurate refreshing of your
data warehouse. Finally, you'll gain a clear understanding of
Redshift use cases, data ingestion, data management, security, and
scaling so that you can build a scalable data warehouse platform.
By the end of this Redshift book, you'll be able to implement a
Redshift-based data analytics solution and have understood the best
practice solutions to commonly faced problems. What you will learn
Use Amazon Redshift to build petabyte-scale data warehouses that
are agile at scale Integrate your data warehousing solution with a
data lake using purpose-built features and services on AWS Build
end-to-end analytical solutions from data sourcing to consumption
with the help of useful recipes Leverage Redshift's comprehensive
security capabilities to meet the most demanding business
requirements Focus on architectural insights and rationale when
using analytical recipes Discover best practices for working with
big data to operate a fully managed solution Who this book is
forThis book is for anyone involved in architecting, implementing,
and optimizing an Amazon Redshift data warehouse, such as data
warehouse developers, data analysts, database administrators, data
engineers, and data scientists. Basic knowledge of data
warehousing, database systems, and cloud concepts and familiarity
with Redshift will be beneficial.
Quickly build and deploy massive data pipelines and improve
productivity using Azure Databricks Key Features Get to grips with
the distributed training and deployment of machine learning and
deep learning models Learn how ETLs are integrated with Azure Data
Factory and Delta Lake Explore deep learning and machine learning
models in a distributed computing infrastructure Book
DescriptionMicrosoft Azure Databricks helps you to harness the
power of distributed computing and apply it to create robust data
pipelines, along with training and deploying machine learning and
deep learning models. Databricks' advanced features enable
developers to process, transform, and explore data. Distributed
Data Systems with Azure Databricks will help you to put your
knowledge of Databricks to work to create big data pipelines. The
book provides a hands-on approach to implementing Azure Databricks
and its associated methodologies that will make you productive in
no time. Complete with detailed explanations of essential concepts,
practical examples, and self-assessment questions, you'll begin
with a quick introduction to Databricks core functionalities,
before performing distributed model training and inference using
TensorFlow and Spark MLlib. As you advance, you'll explore MLflow
Model Serving on Azure Databricks and implement distributed
training pipelines using HorovodRunner in Databricks. Finally,
you'll discover how to transform, use, and obtain insights from
massive amounts of data to train predictive models and create
entire fully working data pipelines. By the end of this MS Azure
book, you'll have gained a solid understanding of how to work with
Databricks to create and manage an entire big data pipeline. What
you will learn Create ETLs for big data in Azure Databricks Train,
manage, and deploy machine learning and deep learning models
Integrate Databricks with Azure Data Factory for extract,
transform, load (ETL) pipeline creation Discover how to use Horovod
for distributed deep learning Find out how to use Delta Engine to
query and process data from Delta Lake Understand how to use Data
Factory in combination with Databricks Use Structured Streaming in
a production-like environment Who this book is forThis book is for
software engineers, machine learning engineers, data scientists,
and data engineers who are new to Azure Databricks and want to
build high-quality data pipelines without worrying about
infrastructure. Knowledge of Azure Databricks basics is required to
learn the concepts covered in this book more effectively. A basic
understanding of machine learning concepts and beginner-level
Python programming knowledge is also recommended.
 |
Integrating Data
(Paperback)
Bill Inmon, Patty Haines, David Rapien
|
R839
R718
Discovery Miles 7 180
Save R121 (14%)
|
Ships in 18 - 22 working days
|
|
|
Nearly 80 recipes to help you collect and transform data from
multiple sources into a single data source, making it way easier to
perform analytics on the data Key Features Build data pipelines
from scratch and find solutions to common data engineering problems
Learn how to work with Azure Data Factory, Data Lake, Databricks,
and Synapse Analytics Monitor and maintain your data engineering
pipelines using Log Analytics, Azure Monitor, and Azure Purview
Book DescriptionThe famous quote 'Data is the new oil' seems more
true every day as the key to most organizations' long-term success
lies in extracting insights from raw data. One of the major
challenges organizations face in leveraging value out of data is
building performant data engineering pipelines for data
visualization, ingestion, storage, and processing. This second
edition of the immensely successful book by Ahmad Osama brings to
you several recent enhancements in Azure data engineering and
shares approximately 80 useful recipes covering common scenarios in
building data engineering pipelines in Microsoft Azure. You'll
explore recipes from Azure Synapse Analytics workspaces Gen 2 and
get to grips with Synapse Spark pools, SQL Serverless pools,
Synapse integration pipelines, and Synapse data flows. You'll also
understand Synapse SQL Pool optimization techniques in this second
edition. Besides Synapse enhancements, you'll discover helpful tips
on managing Azure SQL Database and learn about security, high
availability, and performance monitoring. Finally, the book takes
you through overall data engineering pipeline management, focusing
on monitoring using Log Analytics and tracking data lineage using
Azure Purview. By the end of this book, you'll be able to build
superior data engineering pipelines along with having an invaluable
go-to guide. What you will learn Process data using Azure
Databricks and Azure Synapse Analytics Perform data transformation
using Azure Synapse data flows Perform common administrative tasks
in Azure SQL Database Build effective Synapse SQL pools which can
be consumed by Power BI Monitor Synapse SQL and Spark pools using
Log Analytics Track data lineage using Microsoft Purview
integration with pipelines Who this book is forThis book is for
data engineers, data architects, database administrators, and data
professionals who want to get well versed with the Azure data
services for building data pipelines. Basic understanding of cloud
and data engineering concepts will help in getting the most out of
this book.
Learn Azure's cloud capabilities with the help of this introductory
guide to employing Azure for your cloud infrastructure needs. Key
Features Get a clear overview of Azure's capabilities and benefits,
and learn how to get started efficiently Develop the ability to opt
for cloud architecture and design that best fits your organization
Leverage Azure opportunities for cost savings and optimization Book
DescriptionMicrosoft Azure is a powerful cloud computing platform
that offers a multitude of services and capabilities for
organizations of any size moving to a cloud strategy. Azure
Strategy and Implementation Guide Third Edition encapsulates the
entire spectrum of measures involved in Azure deployment that
includes understanding Azure fundamentals, choosing a suitable
cloud architecture, building on design principles, becoming
familiar with Azure DevOps, and learning best practices for
optimization and management. The book begins by introducing you to
the Azure cloud platform and demonstrating the substantial scope of
digital transformation and innovation that can be achieved by
leveraging Azure's capabilities. The guide further acquaints you
with practical insights on application modernization, Azure
Infrastructure as a Service (IaaS) deployment, infrastructure
management, key application architectures, best practices of Azure
DevOps, and Azure automation. By the end of this book, you will be
proficient in driving Azure operations right from the planning and
cloud migration stage to cost management and troubleshooting. What
you will learn Deploy and run Azure infrastructure services Carry
out detailed planning for migrating applications to the cloud with
Azure Move underlying code class structure into a serverless model
Use a gateway to isolate your services and applications Define
roles and responsibilities in DevOps Implement release &
deployment coordination and automation Who this book is forAzure
Strategy and Implementation Guide Third Edition is designed to
benefit Azure architects, cloud solution architects, Azure
developers, Azure administrators, and anyone who wants to develop
an expertise in operating and administering the Azure cloud. A
basic familiarity with operating systems and databases will help
you grasp the concepts covered in this book.
Learn to extract actionable insights from your big data in real
time using a range of Microsoft Azure features Key Features Updated
with the latest features and new additions to Microsoft Azure
Master the fundamentals of cloud analytics using Azure Learn to use
Azure Synapse Analytics (formerly known as Azure SQL Data
Warehouse) to derive real-time customer insights Book
DescriptionCloud Analytics with Microsoft Azure serves as a
comprehensive guide for big data analysis and processing using a
range of Microsoft Azure features. This book covers everything you
need to build your own data warehouse and learn numerous techniques
to gain useful insights by analyzing big data The book begins by
introducing you to the power of data with big data analytics, the
Internet of Things (IoT), machine learning, artificial
intelligence, and DataOps. You will learn about cloud-scale
analytics and the services Microsoft Azure offers to empower
businesses to discover insights. You will also be introduced to the
new features and functionalities added to the modern data
warehouse. Finally, you will look at two real-world business use
cases to demonstrate high-level solutions using Microsoft Azure.
The aim of these use cases will be to illustrate how real-time data
can be analyzed in Azure to derive meaningful insights and make
business decisions. You will learn to build an end-to-end analytics
pipeline on the cloud with machine learning and deep learning
concepts. By the end of this book, you will be proficient in
analyzing large amounts of data with Azure and using it effectively
to benefit your organization. What you will learn Explore the
concepts of modern data warehouses and data pipelines Discover
unique design considerations while applying a cloud analytics
solution Design an end-to-end analytics pipeline on the cloud
Differentiate between structured, semi-structured, and unstructured
data Choose a cloud-based service for your data analytics solutions
Use Azure services to ingest, store, and analyze data of any scale
Who this book is forThis book is designed to benefit software
engineers, Azure developers, cloud consultants, and anyone who is
keen to learn the process of deriving business insights from huge
amounts of data using Azure. Though not necessary, a basic
understanding of data analytics concepts such as data streaming,
data types, the machine learning life cycle, and Docker containers
will help you get the most out of the book.
Explore expert techniques such as advanced indexing and high
availability to build scalable, reliable, and fault-tolerant
database applications using PostgreSQL 13 Key Features Master
advanced PostgreSQL 13 concepts with the help of real-world
datasets and examples Leverage PostgreSQL's indexing features to
fine-tune the performance of your queries Extend PostgreSQL's
functionalities to suit your organization's needs with minimal
effort Book DescriptionThanks to its reliability, robustness, and
high performance, PostgreSQL has become one of the most advanced
open source databases on the market. This updated fourth edition
will help you understand PostgreSQL administration and how to build
dynamic database solutions for enterprise apps with the latest
release of PostgreSQL, including designing both physical and
technical aspects of the system architecture with ease. Starting
with an introduction to the new features in PostgreSQL 13, this
book will guide you in building efficient and fault-tolerant
PostgreSQL apps. You'll explore advanced PostgreSQL features, such
as logical replication, database clusters, performance tuning,
advanced indexing, monitoring, and user management, to manage and
maintain your database. You'll then work with the PostgreSQL
optimizer, configure PostgreSQL for high speed, and move from
Oracle to PostgreSQL. The book also covers transactions, locking,
and indexes, and shows you how to improve performance with query
optimization. You'll also focus on how to manage network security
and work with backups and replication while exploring useful
PostgreSQL extensions that optimize the performance of large
databases. By the end of this PostgreSQL book, you'll be able to
get the most out of your database by executing advanced
administrative tasks. What you will learn Get well versed with
advanced SQL functions in PostgreSQL 13 Get to grips with
administrative tasks such as log file management and monitoring
Work with stored procedures and manage backup and recovery Employ
replication and failover techniques to reduce data loss Perform
database migration from Oracle to PostgreSQL with ease Replicate
PostgreSQL database systems to create backups and scale your
database Manage and improve server security to protect your data
Troubleshoot your PostgreSQL instance to find solutions to common
and not-so-common problems Who this book is forThis database
administration book is for PostgreSQL developers and database
administrators and professionals who want to implement advanced
functionalities and master complex administrative tasks with
PostgreSQL 13. Prior experience in PostgreSQL and familiarity with
the basics of database administration will assist with
understanding key concepts covered in the book.
A beginner's guide to storing, managing, and analyzing data with
the updated features of Elastic 7.0 Key Features Gain access to new
features and updates introduced in Elastic Stack 7.0 Grasp the
fundamentals of Elastic Stack including Elasticsearch, Logstash,
and Kibana Explore useful tips for using Elastic Cloud and
deploying Elastic Stack in production environments Book
DescriptionThe Elastic Stack is a powerful combination of tools for
techniques such as distributed search, analytics, logging, and
visualization of data. Elastic Stack 7.0 encompasses new features
and capabilities that will enable you to find unique insights into
analytics using these techniques. This book will give you a
fundamental understanding of what the stack is all about, and help
you use it efficiently to build powerful real-time data processing
applications. The first few sections of the book will help you
understand how to set up the stack by installing tools, and
exploring their basic configurations. You'll then get up to speed
with using Elasticsearch for distributed searching and analytics,
Logstash for logging, and Kibana for data visualization. As you
work through the book, you will discover the technique of creating
custom plugins using Kibana and Beats. This is followed by coverage
of the Elastic X-Pack, a useful extension for effective security
and monitoring. You'll also find helpful tips on how to use Elastic
Cloud and deploy Elastic Stack in production environments. By the
end of this book, you'll be well versed with the fundamental
Elastic Stack functionalities and the role of each component in the
stack to solve different data processing problems. What you will
learn Install and configure an Elasticsearch architecture Solve the
full-text search problem with Elasticsearch Discover powerful
analytics capabilities through aggregations using Elasticsearch
Build a data pipeline to transfer data from a variety of sources
into Elasticsearch for analysis Create interactive dashboards for
effective storytelling with your data using Kibana Learn how to
secure, monitor and use Elastic Stack's alerting and reporting
capabilities Take applications to an on-premise or cloud-based
production environment with Elastic Stack Who this book is forThis
book is for entry-level data professionals, software engineers,
e-commerce developers, and full-stack developers who want to learn
about Elastic Stack and how the real-time processing and search
engine works for business analytics and enterprise search
applications. Previous experience with Elastic Stack is not
required, however knowledge of data warehousing and database
concepts will be helpful.
Leverage the power of MongoDB 4.x to build and administer
fault-tolerant database applications Key Features Master the new
features and capabilities of MongoDB 4.x Implement advanced data
modeling, querying, and administration techniques in MongoDB
Includes rich case-studies and best practices followed by expert
MongoDB developers Book DescriptionMongoDB is the best platform for
working with non-relational data and is considered to be the
smartest tool for organizing data in line with business needs. The
recently released MongoDB 4.x supports ACID transactions and makes
the technology an asset for enterprises across the IT and fintech
sectors. This book provides expertise in advanced and niche areas
of managing databases (such as modeling and querying databases)
along with various administration techniques in MongoDB, thereby
helping you become a successful MongoDB expert. The book helps you
understand how the newly added capabilities function with the help
of some interesting examples and large datasets. You will dive
deeper into niche areas such as high-performance configurations,
optimizing SQL statements, configuring large-scale sharded
clusters, and many more. You will also master best practices in
overcoming database failover, and master recovery and backup
procedures for database security. By the end of the book, you will
have gained a practical understanding of administering database
applications both on premises and on the cloud; you will also be
able to scale database applications across all servers. What you
will learn Perform advanced querying techniques such as indexing
and expressions Configure, monitor, and maintain a highly scalable
MongoDB environment Master replication and data sharding to
optimize read/write performance Administer MongoDB-based
applications on premises or on the cloud Integrate MongoDB with big
data sources to process huge amounts of data Deploy MongoDB on
Kubernetes containers Use MongoDB in IoT, mobile, and serverless
environments Who this book is forThis book is ideal for MongoDB
developers and database administrators who wish to become
successful MongoDB experts and build scalable and fault-tolerant
applications using MongoDB. It will also be useful for database
professionals who wish to become certified MongoDB professionals.
Some understanding of MongoDB and basic database concepts is
required to get the most out of this book.
|
|