|
|
Showing 1 - 7 of
7 matches in All Departments
The design patterns in this book capture best practices and
solutions to recurring problems in machine learning. The authors,
three Google engineers, catalog proven methods to help data
scientists tackle common problems throughout the ML process. These
design patterns codify the experience of hundreds of experts into
straightforward, approachable advice. In this book, you will find
detailed explanations of 30 patterns for data and problem
representation, operationalization, repeatability, reproducibility,
flexibility, explainability, and fairness. Each pattern includes a
description of the problem, a variety of potential solutions, and
recommendations for choosing the best technique for your situation.
You'll learn how to: Identify and mitigate common challenges when
training, evaluating, and deploying ML models Represent data for
different ML model types, including embeddings, feature crosses,
and more Choose the right model type for specific problems Build a
robust training loop that uses checkpoints, distribution strategy,
and hyperparameter tuning Deploy scalable ML systems that you can
retrain and update to reflect new data Interpret model predictions
for stakeholders and ensure models are treating users fairly
This book presents innovative work in Climate Informatics, a new
field that reflects the application of data mining methods to
climate science, and shows where this new and fast growing field is
headed. Given its interdisciplinary nature, Climate Informatics
offers insights, tools and methods that are increasingly needed in
order to understand the climate system, an aspect which in turn has
become crucial because of the threat of climate change. There has
been a veritable explosion in the amount of data produced by
satellites, environmental sensors and climate models that monitor,
measure and forecast the earth system. In order to meaningfully
pursue knowledge discovery on the basis of such voluminous and
diverse datasets, it is necessary to apply machine learning
methods, and Climate Informatics lies at the intersection of
machine learning and climate science. This book grew out of the
fourth workshop on Climate Informatics held in Boulder, Colorado in
Sep. 2014.
All cloud architects need to know how to build data platforms—the
key to enabling businesses with data and delivering enterprise-wide
intelligence in a fast and efficient way. This handbook is ideal
for learning how to design, build, and modernize cloud native data
and machine learning platforms using AWS, Azure, Google Cloud, or
multicloud tools like Fivetran, dbt, Snowflake, and Databricks.
Authors Marco Tranquillin, Valliappa Lakshmanan, and Firat Tekiner
cover the entire data lifecycle in a cloud environment, from
ingestion to activation, using real-world enterprise architectures.
You'll learn how to transform and modernize familiar solutions,
like data warehouses and data lakes, and you'll be able to leverage
recent AI/ML patterns to get accurate and quicker insights to drive
competitive advantage. This book shows you how to: Design a modern
cloud native or hybrid data analytics and machine learning platform
Accelerate data-led innovation by consolidating enterprise data in
a data platform Democratize access to enterprise data and allow
business teams to extract insights and build AI/ML capabilities
Enable your business to make decisions in real time using streaming
pipelines Move from a descriptive analytics approach to a more
predictive and prescriptive one by building an MLOps platform Make
your organization more effective in working with data analytics and
machine learning in a cloud environment
As you move data to the cloud, you need to consider a comprehensive
approach to data governance, along with well-defined and
agreed-upon policies to ensure your organization meets compliance
requirements. Data governance incorporates the ways people,
processes, and technology work together to ensure data is
trustworthy and can be used effectively. This practical guide shows
you how to effectively implement and scale data governance
throughout your organization. Chief information, data, and security
officers and their teams will learn strategy and tooling to support
democratizing data and unlocking its value while enforcing
security, privacy, and other governance standards. Through good
data governance, you can inspire customer trust, enable your
organization to identify business efficiencies, generate more
competitive offerings, and improve customer experience. This book
shows you how. You'll learn: Data governance strategies addressing
people, processes, and tools Benefits and challenges of a
cloud-based data governance approach How data governance is
conducted from ingest to preparation and use How to handle the
ongoing improvement of data quality Challenges and techniques in
governing streaming data Data protection for authentication,
security, backup, and monitoring How to build a data culture in
your organization
Learn how easy it is to apply sophisticated statistical and machine
learning methods to real-world problems when you build using Google
Cloud Platform (GCP). This hands-on guide shows data engineers and
data scientists how to implement an end-to-end data pipeline with
cloud native tools on GCP. Throughout this updated second edition,
you'll work through a sample business decision by employing a
variety of data science approaches. Follow along by building a data
pipeline in your own project on GCP, and discover how to solve data
science problems in a transformative and more collaborative way.
You'll learn how to: Employ best practices in building highly
scalable data and ML pipelines on Google Cloud Automate and
schedule data ingest using Cloud Run Create and populate a
dashboard in Data Studio Build a real-time analytics pipeline using
Pub/Sub, Dataflow, and BigQuery Conduct interactive data
exploration with BigQuery Create a Bayesian model with Spark on
Cloud Dataproc Forecast time series and do anomaly detection with
BigQuery ML Aggregate within time windows with Dataflow Train
explainable machine learning models with Vertex AI Operationalize
ML with Vertex AI Pipelines
This practical book shows you how to employ machine learning models
to extract information from images. ML engineers and data
scientists will learn how to solve a variety of image problems
including classification, object detection, autoencoders, image
generation, counting, and captioning with proven ML techniques.
This book provides a great introduction to end-to-end deep
learning: dataset creation, data preprocessing, model design, model
training, evaluation, deployment, and interpretability. Google
engineers Valliappa Lakshmanan, Martin Goerner, and Ryan Gillard
show you how to develop accurate and explainable computer vision ML
models and put them into large-scale production using robust ML
architecture in a flexible and maintainable way. You'll learn how
to design, train, evaluate, and predict with models written in
TensorFlow or Keras. You'll learn how to: Design ML architecture
for computer vision tasks Select a model (such as ResNet,
SqueezeNet, or EfficientNet) appropriate to your task Create an
end-to-end ML pipeline to train, evaluate, deploy, and explain your
model Preprocess images for data augmentation and to support
learnability Incorporate explainability and responsible AI best
practices Deploy image models as web services or on edge devices
Monitor and manage ML models
Work with petabyte-scale datasets while building a collaborative,
agile workplace in the process. This practical book is the
canonical reference to Google BigQuery, the query engine that lets
you conduct interactive analysis of large datasets. BigQuery
enables enterprises to efficiently store, query, ingest, and learn
from their data in a convenient framework. With this book, you’ll
examine how to analyze data at scale to derive insights from large
datasets efficiently. Valliappa Lakshmanan, tech lead for Google
Cloud Platform, and Jordan Tigani, engineering director for the
BigQuery team, provide best practices for modern data warehousing
within an autoscaled, serverless public cloud. Whether you want to
explore parts of BigQuery you’re not familiar with or prefer to
focus on specific tasks, this reference is indispensable.
|
|