0
Your cart

Your cart is empty

Browse All Departments
Price
  • R100 - R250 (4)
  • R250 - R500 (63)
  • R500+ (1,222)
  • -
Status
Format
Author / Contributor
Publisher

Books > Computing & IT > Applications of computing > Databases > Data capture & analysis

Cleaning Data for Effective Data Science - Doing the other 80% of the work with Python, R, and command-line tools (Paperback):... Cleaning Data for Effective Data Science - Doing the other 80% of the work with Python, R, and command-line tools (Paperback)
David Mertz
R1,222 Discovery Miles 12 220 Ships in 10 - 15 working days

Think about your data intelligently and ask the right questions Key Features Master data cleaning techniques necessary to perform real-world data science and machine learning tasks Spot common problems with dirty data and develop flexible solutions from first principles Test and refine your newly acquired skills through detailed exercises at the end of each chapter Book DescriptionData cleaning is the all-important first step to successful data science, data analysis, and machine learning. If you work with any kind of data, this book is your go-to resource, arming you with the insights and heuristics experienced data scientists had to learn the hard way. In a light-hearted and engaging exploration of different tools, techniques, and datasets real and fictitious, Python veteran David Mertz teaches you the ins and outs of data preparation and the essential questions you should be asking of every piece of data you work with. Using a mixture of Python, R, and common command-line tools, Cleaning Data for Effective Data Science follows the data cleaning pipeline from start to end, focusing on helping you understand the principles underlying each step of the process. You'll look at data ingestion of a vast range of tabular, hierarchical, and other data formats, impute missing values, detect unreliable data and statistical anomalies, and generate synthetic features. The long-form exercises at the end of each chapter let you get hands-on with the skills you've acquired along the way, also providing a valuable resource for academic courses. What you will learn Ingest and work with common data formats like JSON, CSV, SQL and NoSQL databases, PDF, and binary serialized data structures Understand how and why we use tools such as pandas, SciPy, scikit-learn, Tidyverse, and Bash Apply useful rules and heuristics for assessing data quality and detecting bias, like Benford's law and the 68-95-99.7 rule Identify and handle unreliable data and outliers, examining z-score and other statistical properties Impute sensible values into missing data and use sampling to fix imbalances Use dimensionality reduction, quantization, one-hot encoding, and other feature engineering techniques to draw out patterns in your data Work carefully with time series data, performing de-trending and interpolation Who this book is forThis book is designed to benefit software developers, data scientists, aspiring data scientists, teachers, and students who work with data. If you want to improve your rigor in data hygiene or are looking for a refresher, this book is for you. Basic familiarity with statistics, general concepts in machine learning, knowledge of a programming language (Python or R), and some exposure to data science are helpful.

Hands-On Graph Analytics with Neo4j - Perform graph processing and visualization techniques using connected data across your... Hands-On Graph Analytics with Neo4j - Perform graph processing and visualization techniques using connected data across your enterprise (Paperback)
Estelle Scifo
R1,190 Discovery Miles 11 900 Ships in 10 - 15 working days

Discover how to use Neo4j to identify relationships within complex and large graph datasets using graph modeling, graph algorithms, and machine learning Key Features Get up and running with graph analytics with the help of real-world examples Explore various use cases such as fraud detection, graph-based search, and recommendation systems Get to grips with the Graph Data Science library with the help of examples, and use Neo4j in the cloud for effective application scaling Book DescriptionNeo4j is a graph database that includes plugins to run complex graph algorithms. The book starts with an introduction to the basics of graph analytics, the Cypher query language, and graph architecture components, and helps you to understand why enterprises have started to adopt graph analytics within their organizations. You'll find out how to implement Neo4j algorithms and techniques and explore various graph analytics methods to reveal complex relationships in your data. You'll be able to implement graph analytics catering to different domains such as fraud detection, graph-based search, recommendation systems, social networking, and data management. You'll also learn how to store data in graph databases and extract valuable insights from it. As you become well-versed with the techniques, you'll discover graph machine learning in order to address simple to complex challenges using Neo4j. You will also understand how to use graph data in a machine learning model in order to make predictions based on your data. Finally, you'll get to grips with structuring a web application for production using Neo4j. By the end of this book, you'll not only be able to harness the power of graphs to handle a broad range of problem areas, but you'll also have learned how to use Neo4j efficiently to identify complex relationships in your data. What you will learn Become well-versed with Neo4j graph database building blocks, nodes, and relationships Discover how to create, update, and delete nodes and relationships using Cypher querying Use graphs to improve web search and recommendations Understand graph algorithms such as pathfinding, spatial search, centrality, and community detection Find out different steps to integrate graphs in a normal machine learning pipeline Formulate a link prediction problem in the context of machine learning Implement graph embedding algorithms such as DeepWalk, and use them in Neo4j graphs Who this book is forThis book is for data analysts, business analysts, graph analysts, and database developers looking to store and process graph data to reveal key data insights. This book will also appeal to data scientists who want to build intelligent graph applications catering to different domains. Some experience with Neo4j is required.

D3.js in Action, 2E (Paperback, 2nd ed.): Elijah Meeks D3.js in Action, 2E (Paperback, 2nd ed.)
Elijah Meeks
R1,135 Discovery Miles 11 350 Ships in 9 - 17 working days

D3 allows developers to create scalable graphs for any modern browser. They start with a structure, dataset, or algorithm and programmatically generate static, interactive, or animated images that responsively scale to any screen. D3.js in Action, Second Edition is completely revised and updated for D3 v4 and ES6. It's a practical tutorial for creating interactive graphics and data-driven applications using D3. Readers will start with in-depth explanations of D3's out-of-the-box layouts, along with dozens of realworld use cases that align with different types of visualizations. By the end, readers will be ready to integrate D3.js into their web development process and add data visualization to transform any site or internal application. Key Features: * Completely revised and updated * Practical tutorial * In-depth explanations Readers need basic HTML, CSS, and JavaScript skills. No experience with D3 or SVG is required. About the Technology: D3.js is a JavaScript library that allows data to be represented graphically.

Telling Your Data Story - Data Storytelling for Data Management (Paperback): Scott Taylor Telling Your Data Story - Data Storytelling for Data Management (Paperback)
Scott Taylor
R712 R629 Discovery Miles 6 290 Save R83 (12%) Ships in 10 - 15 working days
Zero to Data Viz as a Tableau Desktop Specialist (Paperback): John J Zugelder Zero to Data Viz as a Tableau Desktop Specialist (Paperback)
John J Zugelder
R993 R898 Discovery Miles 8 980 Save R95 (10%) Ships in 10 - 15 working days
Hands-On Data Analysis with Pandas - A Python data science handbook for data collection, wrangling, analysis, and... Hands-On Data Analysis with Pandas - A Python data science handbook for data collection, wrangling, analysis, and visualization, 2nd Edition (Paperback, 2nd Revised edition)
Stefanie Molin; Foreword by Ken Jee
R1,780 Discovery Miles 17 800 Ships in 10 - 15 working days

Get to grips with pandas by working with real datasets and master data discovery, data manipulation, data preparation, and handling data for analytical tasks Key Features Perform efficient data analysis and manipulation tasks using pandas 1.x Apply pandas to different real-world domains with the help of step-by-step examples Make the most of pandas as an effective data exploration tool Book DescriptionExtracting valuable business insights is no longer a 'nice-to-have', but an essential skill for anyone who handles data in their enterprise. Hands-On Data Analysis with Pandas is here to help beginners and those who are migrating their skills into data science get up to speed in no time. This book will show you how to analyze your data, get started with machine learning, and work effectively with the Python libraries often used for data science, such as pandas, NumPy, matplotlib, seaborn, and scikit-learn. Using real-world datasets, you will learn how to use the pandas library to perform data wrangling to reshape, clean, and aggregate your data. Then, you will learn how to conduct exploratory data analysis by calculating summary statistics and visualizing the data to find patterns. In the concluding chapters, you will explore some applications of anomaly detection, regression, clustering, and classification using scikit-learn to make predictions based on past data. This updated edition will equip you with the skills you need to use pandas 1.x to efficiently perform various data manipulation tasks, reliably reproduce analyses, and visualize your data for effective decision making - valuable knowledge that can be applied across multiple domains. What you will learn Understand how data analysts and scientists gather and analyze data Perform data analysis and data wrangling using Python Combine, group, and aggregate data from multiple sources Create data visualizations with pandas, matplotlib, and seaborn Apply machine learning algorithms to identify patterns and make predictions Use Python data science libraries to analyze real-world datasets Solve common data representation and analysis problems using pandas Build Python scripts, modules, and packages for reusable analysis code Who this book is forThis book is for data science beginners, data analysts, and Python developers who want to explore each stage of data analysis and scientific computing using a wide range of datasets. Data scientists looking to implement pandas in their machine learning workflow will also find plenty of valuable know-how as they progress. You'll find it easier to follow along with this book if you have a working knowledge of the Python programming language, but a Python crash-course tutorial is provided in the code bundle for anyone who needs a refresher.

Linear Regression Analysis 2e (Hardcover, 2nd Edition): G.A.F. Seber Linear Regression Analysis 2e (Hardcover, 2nd Edition)
G.A.F. Seber
R4,483 Discovery Miles 44 830 Ships in 9 - 17 working days

An extensive treatment of a key method in the statistician’s toolbox

For more than two decades, the First Edition of Linear Regression Analysis has been an authoritative resource for one of the most common methods of handling statistical data. There have been many advances in the field over the last twenty years, including the development of more efficient and accurate regression computer programs, new ways of fitting regressions, and new methods of model selection and prediction. Linear Regression Analysis, Second Edition, revises and expands this standard text, providing extensive coverage of state-of-the-art theory and applications of linear regression analysis.

Requiring no specialized knowledge beyond a good grasp of matrix algebra and some acquaintance with straight-line regression and simple analysis of variance models, this new edition features:

  • Up-to-date accounts of computational methods and algorithms currently in use without getting entrenched in minor computing details
  • A careful and detailed survey of the research literature, making this a highly useful reference
  • Expanded coverage of diagnostics, and more discussion of methods of model fitting, model selection and prediction
  • More than 200 problems throughout the book plus outline solutions

Concise, mathematically clear, and comprehensive, Linear Regression Analysis, Second Edition, serves as both a reliable reference for the practitioner and a valuable textbook for the student.

Data Engineering with Python - Work with massive datasets to design data models and automate data pipelines using Python... Data Engineering with Python - Work with massive datasets to design data models and automate data pipelines using Python (Paperback)
Paul Crickard
R1,347 Discovery Miles 13 470 Ships in 10 - 15 working days

Build, monitor, and manage real-time data pipelines to create data engineering infrastructure efficiently using open-source Apache projects Key Features Become well-versed in data architectures, data preparation, and data optimization skills with the help of practical examples Design data models and learn how to extract, transform, and load (ETL) data using Python Schedule, automate, and monitor complex data pipelines in production Book DescriptionData engineering provides the foundation for data science and analytics, and forms an important part of all businesses. This book will help you to explore various tools and methods that are used for understanding the data engineering process using Python. The book will show you how to tackle challenges commonly faced in different aspects of data engineering. You'll start with an introduction to the basics of data engineering, along with the technologies and frameworks required to build data pipelines to work with large datasets. You'll learn how to transform and clean data and perform analytics to get the most out of your data. As you advance, you'll discover how to work with big data of varying complexity and production databases, and build data pipelines. Using real-world examples, you'll build architectures on which you'll learn how to deploy data pipelines. By the end of this Python book, you'll have gained a clear understanding of data modeling techniques, and will be able to confidently build data engineering pipelines for tracking data, running quality checks, and making necessary changes in production. What you will learn Understand how data engineering supports data science workflows Discover how to extract data from files and databases and then clean, transform, and enrich it Configure processors for handling different file formats as well as both relational and NoSQL databases Find out how to implement a data pipeline and dashboard to visualize results Use staging and validation to check data before landing in the warehouse Build real-time pipelines with staging areas that perform validation and handle failures Get to grips with deploying pipelines in the production environment Who this book is forThis book is for data analysts, ETL developers, and anyone looking to get started with or transition to the field of data engineering or refresh their knowledge of data engineering using Python. This book will also be useful for students planning to build a career in data engineering or IT professionals preparing for a transition. No previous knowledge of data engineering is required.

Managing Data Science - Effective strategies to manage data science projects and build a sustainable team (Paperback): Kirill... Managing Data Science - Effective strategies to manage data science projects and build a sustainable team (Paperback)
Kirill Dubovikov
R910 Discovery Miles 9 100 Ships in 10 - 15 working days

Understand data science concepts and methodologies to manage and deliver top-notch solutions for your organization Key Features Learn the basics of data science and explore its possibilities and limitations Manage data science projects and assemble teams effectively even in the most challenging situations Understand management principles and approaches for data science projects to streamline the innovation process Book DescriptionData science and machine learning can transform any organization and unlock new opportunities. However, employing the right management strategies is crucial to guide the solution from prototype to production. Traditional approaches often fail as they don't entirely meet the conditions and requirements necessary for current data science projects. In this book, you'll explore the right approach to data science project management, along with useful tips and best practices to guide you along the way. After understanding the practical applications of data science and artificial intelligence, you'll see how to incorporate them into your solutions. Next, you will go through the data science project life cycle, explore the common pitfalls encountered at each step, and learn how to avoid them. Any data science project requires a skilled team, and this book will offer the right advice for hiring and growing a data science team for your organization. Later, you'll be shown how to efficiently manage and improve your data science projects through the use of DevOps and ModelOps. By the end of this book, you will be well versed with various data science solutions and have gained practical insights into tackling the different challenges that you'll encounter on a daily basis. What you will learn Understand the underlying problems of building a strong data science pipeline Explore the different tools for building and deploying data science solutions Hire, grow, and sustain a data science team Manage data science projects through all stages, from prototype to production Learn how to use ModelOps to improve your data science pipelines Get up to speed with the model testing techniques used in both development and production stages Who this book is forThis book is for data scientists, analysts, and program managers who want to use data science for business productivity by incorporating data science workflows efficiently. Some understanding of basic data science concepts will be useful to get the most out of this book.

Python Data Cleaning Cookbook - Modern techniques and Python tools to detect and remove dirty data and extract key insights... Python Data Cleaning Cookbook - Modern techniques and Python tools to detect and remove dirty data and extract key insights (Paperback)
Michael Walker
R1,309 Discovery Miles 13 090 Ships in 10 - 15 working days

Discover how to describe your data in detail, identify data issues, and find out how to solve them using commonly used techniques and tips and tricks Key Features Get well-versed with various data cleaning techniques to reveal key insights Manipulate data of different complexities to shape them into the right form as per your business needs Clean, monitor, and validate large data volumes to diagnose problems before moving on to data analysis Book DescriptionGetting clean data to reveal insights is essential, as directly jumping into data analysis without proper data cleaning may lead to incorrect results. This book shows you tools and techniques that you can apply to clean and handle data with Python. You'll begin by getting familiar with the shape of data by using practices that can be deployed routinely with most data sources. Then, the book teaches you how to manipulate data to get it into a useful form. You'll also learn how to filter and summarize data to gain insights and better understand what makes sense and what does not, along with discovering how to operate on data to address the issues you've identified. Moving on, you'll perform key tasks, such as handling missing values, validating errors, removing duplicate data, monitoring high volumes of data, and handling outliers and invalid dates. Next, you'll cover recipes on using supervised learning and Naive Bayes analysis to identify unexpected values and classification errors, and generate visualizations for exploratory data analysis (EDA) to visualize unexpected values. Finally, you'll build functions and classes that you can reuse without modification when you have new data. By the end of this Python book, you'll be equipped with all the key skills that you need to clean data and diagnose problems within it. What you will learn Find out how to read and analyze data from a variety of sources Produce summaries of the attributes of data frames, columns, and rows Filter data and select columns of interest that satisfy given criteria Address messy data issues, including working with dates and missing values Improve your productivity in Python pandas by using method chaining Use visualizations to gain additional insights and identify potential data issues Enhance your ability to learn what is going on in your data Build user-defined functions and classes to automate data cleaning Who this book is forThis book is for anyone looking for ways to handle messy, duplicate, and poor data using different Python tools and techniques. The book takes a recipe-based approach to help you to learn how to clean and manage data. Working knowledge of Python programming is all you need to get the most out of the book.

Hands-On SQL Server 2019 Analysis Services - Design and query tabular and multi-dimensional models using Microsoft's SQL... Hands-On SQL Server 2019 Analysis Services - Design and query tabular and multi-dimensional models using Microsoft's SQL Server Analysis Services (Paperback)
Steven Hughes; Foreword by Adam Jorgensen
R1,470 Discovery Miles 14 700 Ships in 10 - 15 working days

Get up to speed with the new features added to Microsoft SQL Server 2019 Analysis Services and create models to support your business Key Features Explore tips and tricks to design, develop, and optimize end-to-end data analytics solutions using Microsoft's technologies Learn tabular modeling and multi-dimensional cube design development using real-world examples Implement Analysis Services to help you make productive business decisions Book DescriptionSQL Server Analysis Services (SSAS) continues to be a leading enterprise-scale toolset, enabling customers to deliver data and analytics across large datasets with great performance. This book will help you understand MS SQL Server 2019's new features and improvements, especially when it comes to SSAS. First, you'll cover a quick overview of SQL Server 2019, learn how to choose the right analytical model to use, and understand their key differences. You'll then explore how to create a multi-dimensional model with SSAS and expand on that model with MDX. Next, you'll create and deploy a tabular model using Microsoft Visual Studio and Management Studio. You'll learn when and how to use both tabular and multi-dimensional model types, how to deploy and configure your servers to support them, and design principles that are relevant to each model. The book comes packed with tips and tricks to build measures, optimize your design, and interact with models using Excel and Power BI. All this will help you visualize data to gain useful insights and make better decisions. Finally, you'll discover practices and tools for securing and maintaining your models once they are deployed. By the end of this MS SQL Server book, you'll be able to choose the right model and build and deploy it to support the analytical needs of your business. What you will learn Determine the best analytical model using SSAS Cover the core aspects involved in MDX, including writing your first query Implement calculated tables and calculation groups (new in version 2019) in DAX Create and deploy tabular and multi-dimensional models on SQL 2019 Connect and create data visualizations using Excel and Power BI Implement row-level and other data security methods with tabular and multi-dimensional models Explore essential concepts and techniques to scale, manage, and optimize your SSAS solutions Who this book is forThis Microsoft SQL Server book is for BI professionals and data analysts who are looking for a practical guide to creating and maintaining tabular and multi-dimensional models using SQL Server 2019 Analysis Services. A basic working knowledge of BI solutions such as Power BI and database querying is required.

Python Data Analytics - The Beginner's Real World Crash Course (Paperback): Travis Booth Python Data Analytics - The Beginner's Real World Crash Course (Paperback)
Travis Booth
R545 Discovery Miles 5 450 Ships in 10 - 15 working days
Learn Power BI - Step by Step Guide to Building Your Own Reports (Paperback): Derek Wilson Learn Power BI - Step by Step Guide to Building Your Own Reports (Paperback)
Derek Wilson
R279 Discovery Miles 2 790 Ships in 10 - 15 working days
C# and Oledb - Working with the Dataview (Paperback): Richard Thomas Edwards C# and Oledb - Working with the Dataview (Paperback)
Richard Thomas Edwards
R400 Discovery Miles 4 000 Ships in 10 - 15 working days
C# And SQL CLient - Working with the Dataview (Paperback): Richard Thomas Edwards C# And SQL CLient - Working with the Dataview (Paperback)
Richard Thomas Edwards
R400 Discovery Miles 4 000 Ships in 10 - 15 working days
Hands-On SAS for Data Analysis - A practical guide to performing effective queries, data visualization, and reporting... Hands-On SAS for Data Analysis - A practical guide to performing effective queries, data visualization, and reporting techniques (Paperback)
Harish Gulati
R1,321 Discovery Miles 13 210 Ships in 10 - 15 working days

Leverage the full potential of SAS to get unique, actionable insights from your data Key Features Build enterprise-class data solutions using SAS and become well-versed in SAS programming Work with different data structures, and run SQL queries to manipulate your data Explore essential concepts and techniques with practical examples to confidently pass the SAS certification exam Book DescriptionSAS is one of the leading enterprise tools in the world today when it comes to data management and analysis. It enables the fast and easy processing of data and helps you gain valuable business insights for effective decision-making. This book will serve as a comprehensive guide that will prepare you for the SAS certification exam. After a quick overview of the SAS architecture and components, the book will take you through the different approaches to importing and reading data from different sources using SAS. You will then cover SAS Base and 4GL, understanding data management and analysis, along with exploring SAS functions for data manipulation and transformation. Next, you'll discover SQL procedures and get up to speed on creating and validating queries. In the concluding chapters, you'll learn all about data visualization, right from creating bar charts and sample geographic maps through to assigning patterns and formats. In addition to this, the book will focus on macro programming and its advanced aspects. By the end of this book, you will be well versed in SAS programming and have the skills you need to easily handle and manage your data-related problems in SAS. What you will learn Explore a variety of SAS modules and packages for efficient data analysis Use SAS 4GL functions to manipulate, merge, sort, and transform data Gain useful insights into advanced PROC SQL options in SAS to interact with data Get to grips with SAS Macro and define your own macros to share data Discover the different graphical libraries to shape and visualize data with Apply the SAS Output Delivery System to prepare detailed reports Who this book is forBudding or experienced data professionals who want to get started with SAS will benefit from this book. Those looking to prepare for the SAS certification exam will also find this book to be a useful resource. Some understanding of basic data management concepts will help you get the most out of this book.

Machine Learning Using TensorFlow Cookbook - Create powerful machine learning algorithms with TensorFlow (Paperback): Alexia... Machine Learning Using TensorFlow Cookbook - Create powerful machine learning algorithms with TensorFlow (Paperback)
Alexia Audevart, Konrad Banachewicz, Luca Massaron
R972 Discovery Miles 9 720 Ships in 10 - 15 working days

Comprehensive recipes to give you valuable insights on Transformers, Reinforcement Learning, and more Key Features Deep Learning solutions from Kaggle Masters and Google Developer Experts Get to grips with the fundamentals including variables, matrices, and data sources Learn advanced techniques to make your algorithms faster and more accurate Book DescriptionThe independent recipes in Machine Learning Using TensorFlow Cookbook will teach you how to perform complex data computations and gain valuable insights into your data. Dive into recipes on training models, model evaluation, sentiment analysis, regression analysis, artificial neural networks, and deep learning - each using Google's machine learning library, TensorFlow. This cookbook covers the fundamentals of the TensorFlow library, including variables, matrices, and various data sources. You'll discover real-world implementations of Keras and TensorFlow and learn how to use estimators to train linear models and boosted trees, both for classification and regression. Explore the practical applications of a variety of deep learning architectures, such as recurrent neural networks and Transformers, and see how they can be used to solve computer vision and natural language processing (NLP) problems. With the help of this book, you will be proficient in using TensorFlow, understand deep learning from the basics, and be able to implement machine learning algorithms in real-world scenarios. What you will learn Take TensorFlow into production Implement and fine-tune Transformer models for various NLP tasks Apply reinforcement learning algorithms using the TF-Agents framework Understand linear regression techniques and use Estimators to train linear models Execute neural networks and improve predictions on tabular data Master convolutional neural networks and recurrent neural networks through practical recipes Who this book is forIf you are a data scientist or a machine learning engineer, and you want to skip detailed theoretical explanations in favor of building production-ready machine learning models using TensorFlow, this book is for you. Basic familiarity with Python, linear algebra, statistics, and machine learning is necessary to make the most out of this book.

Mastering PostgreSQL 13 - Build, administer, and maintain database applications efficiently with PostgreSQL 13, 4th Edition... Mastering PostgreSQL 13 - Build, administer, and maintain database applications efficiently with PostgreSQL 13, 4th Edition (Paperback, 4th Revised edition)
Hans-Jurgen Schoenig
R1,215 Discovery Miles 12 150 Ships in 10 - 15 working days

Explore expert techniques such as advanced indexing and high availability to build scalable, reliable, and fault-tolerant database applications using PostgreSQL 13 Key Features Master advanced PostgreSQL 13 concepts with the help of real-world datasets and examples Leverage PostgreSQL's indexing features to fine-tune the performance of your queries Extend PostgreSQL's functionalities to suit your organization's needs with minimal effort Book DescriptionThanks to its reliability, robustness, and high performance, PostgreSQL has become one of the most advanced open source databases on the market. This updated fourth edition will help you understand PostgreSQL administration and how to build dynamic database solutions for enterprise apps with the latest release of PostgreSQL, including designing both physical and technical aspects of the system architecture with ease. Starting with an introduction to the new features in PostgreSQL 13, this book will guide you in building efficient and fault-tolerant PostgreSQL apps. You'll explore advanced PostgreSQL features, such as logical replication, database clusters, performance tuning, advanced indexing, monitoring, and user management, to manage and maintain your database. You'll then work with the PostgreSQL optimizer, configure PostgreSQL for high speed, and move from Oracle to PostgreSQL. The book also covers transactions, locking, and indexes, and shows you how to improve performance with query optimization. You'll also focus on how to manage network security and work with backups and replication while exploring useful PostgreSQL extensions that optimize the performance of large databases. By the end of this PostgreSQL book, you'll be able to get the most out of your database by executing advanced administrative tasks. What you will learn Get well versed with advanced SQL functions in PostgreSQL 13 Get to grips with administrative tasks such as log file management and monitoring Work with stored procedures and manage backup and recovery Employ replication and failover techniques to reduce data loss Perform database migration from Oracle to PostgreSQL with ease Replicate PostgreSQL database systems to create backups and scale your database Manage and improve server security to protect your data Troubleshoot your PostgreSQL instance to find solutions to common and not-so-common problems Who this book is forThis database administration book is for PostgreSQL developers and database administrators and professionals who want to implement advanced functionalities and master complex administrative tasks with PostgreSQL 13. Prior experience in PostgreSQL and familiarity with the basics of database administration will assist with understanding key concepts covered in the book.

Business Skills for Data Scientists - Practical Guidance in Six Key Topics (Paperback): David Stephenson Business Skills for Data Scientists - Practical Guidance in Six Key Topics (Paperback)
David Stephenson; Foreword by John Elder
R844 Discovery Miles 8 440 Ships in 10 - 15 working days
Advanced Natural Language Processing with TensorFlow 2 - Build effective real-world NLP applications using NER, RNNs, seq2seq... Advanced Natural Language Processing with TensorFlow 2 - Build effective real-world NLP applications using NER, RNNs, seq2seq models, Transformers, and more (Paperback)
Ashish Bansal
R1,180 Discovery Miles 11 800 Ships in 10 - 15 working days

One-stop solution for NLP practitioners, ML developers, and data scientists to build effective NLP systems that can perform real-world complicated tasks Key Features Apply deep learning algorithms and techniques such as BiLSTMS, CRFs, BPE and more using TensorFlow 2 Explore applications like text generation, summarization, weakly supervised labelling and more Read cutting edge material with seminal papers provided in the GitHub repository with full working code Book DescriptionRecently, there have been tremendous advances in NLP, and we are now moving from research labs into practical applications. This book comes with a perfect blend of both the theoretical and practical aspects of trending and complex NLP techniques. The book is focused on innovative applications in the field of NLP, language generation, and dialogue systems. It helps you apply the concepts of pre-processing text using techniques such as tokenization, parts of speech tagging, and lemmatization using popular libraries such as Stanford NLP and SpaCy. You will build Named Entity Recognition (NER) from scratch using Conditional Random Fields and Viterbi Decoding on top of RNNs. The book covers key emerging areas such as generating text for use in sentence completion and text summarization, bridging images and text by generating captions for images, and managing dialogue aspects of chatbots. You will learn how to apply transfer learning and fine-tuning using TensorFlow 2. Further, it covers practical techniques that can simplify the labelling of textual data. The book also has a working code that is adaptable to your use cases for each tech piece. By the end of the book, you will have an advanced knowledge of the tools, techniques and deep learning architecture used to solve complex NLP problems. What you will learn Grasp important pre-steps in building NLP applications like POS tagging Use transfer and weakly supervised learning using libraries like Snorkel Do sentiment analysis using BERT Apply encoder-decoder NN architectures and beam search for summarizing texts Use Transformer models with attention to bring images and text together Build apps that generate captions and answer questions about images using custom Transformers Use advanced TensorFlow techniques like learning rate annealing, custom layers, and custom loss functions to build the latest DeepNLP models Who this book is forThis is not an introductory book and assumes the reader is familiar with basics of NLP and has fundamental Python skills, as well as basic knowledge of machine learning and undergraduate-level calculus and linear algebra. The readers who can benefit the most from this book include intermediate ML developers who are familiar with the basics of supervised learning and deep learning techniques and professionals who already use TensorFlow/Python for purposes such as data science, ML, research, analysis, etc.

Essential Statistics for Non-STEM Data Analysts - Get to grips with the statistics and math knowledge needed to enter the world... Essential Statistics for Non-STEM Data Analysts - Get to grips with the statistics and math knowledge needed to enter the world of data science with Python (Paperback)
Rongpeng Li
R1,187 Discovery Miles 11 870 Ships in 10 - 15 working days

Reinforce your understanding of data science and data analysis from a statistical perspective to extract meaningful insights from your data using Python programming Key Features Work your way through the entire data analysis pipeline with statistics concerns in mind to make reasonable decisions Understand how various data science algorithms function Build a solid foundation in statistics for data science and machine learning using Python-based examples Book DescriptionStatistics remain the backbone of modern analysis tasks, helping you to interpret the results produced by data science pipelines. This book is a detailed guide covering the math and various statistical methods required for undertaking data science tasks. The book starts by showing you how to preprocess data and inspect distributions and correlations from a statistical perspective. You'll then get to grips with the fundamentals of statistical analysis and apply its concepts to real-world datasets. As you advance, you'll find out how statistical concepts emerge from different stages of data science pipelines, understand the summary of datasets in the language of statistics, and use it to build a solid foundation for robust data products such as explanatory models and predictive models. Once you've uncovered the working mechanism of data science algorithms, you'll cover essential concepts for efficient data collection, cleaning, mining, visualization, and analysis. Finally, you'll implement statistical methods in key machine learning tasks such as classification, regression, tree-based methods, and ensemble learning. By the end of this Essential Statistics for Non-STEM Data Analysts book, you'll have learned how to build and present a self-contained, statistics-backed data product to meet your business goals. What you will learn Find out how to grab and load data into an analysis environment Perform descriptive analysis to extract meaningful summaries from data Discover probability, parameter estimation, hypothesis tests, and experiment design best practices Get to grips with resampling and bootstrapping in Python Delve into statistical tests with variance analysis, time series analysis, and A/B test examples Understand the statistics behind popular machine learning algorithms Answer questions on statistics for data scientist interviews Who this book is forThis book is an entry-level guide for data science enthusiasts, data analysts, and anyone starting out in the field of data science and looking to learn the essential statistical concepts with the help of simple explanations and examples. If you're a developer or student with a non-mathematical background, you'll find this book useful. Working knowledge of the Python programming language is required.

Getting Started with Google BERT - Build and train state-of-the-art natural language processing models using BERT (Paperback):... Getting Started with Google BERT - Build and train state-of-the-art natural language processing models using BERT (Paperback)
Sudharsan Ravichandiran
R1,169 Discovery Miles 11 690 Ships in 10 - 15 working days

Kickstart your NLP journey by exploring BERT and its variants such as ALBERT, RoBERTa, DistilBERT, VideoBERT, and more with Hugging Face's transformers library Key Features Explore the encoder and decoder of the transformer model Become well-versed with BERT along with ALBERT, RoBERTa, and DistilBERT Discover how to pre-train and fine-tune BERT models for several NLP tasks Book DescriptionBERT (bidirectional encoder representations from transformer) has revolutionized the world of natural language processing (NLP) with promising results. This book is an introductory guide that will help you get to grips with Google's BERT architecture. With a detailed explanation of the transformer architecture, this book will help you understand how the transformer's encoder and decoder work. You'll explore the BERT architecture by learning how the BERT model is pre-trained and how to use pre-trained BERT for downstream tasks by fine-tuning it for NLP tasks such as sentiment analysis and text summarization with the Hugging Face transformers library. As you advance, you'll learn about different variants of BERT such as ALBERT, RoBERTa, and ELECTRA, and look at SpanBERT, which is used for NLP tasks like question answering. You'll also cover simpler and faster BERT variants based on knowledge distillation such as DistilBERT and TinyBERT. The book takes you through MBERT, XLM, and XLM-R in detail and then introduces you to sentence-BERT, which is used for obtaining sentence representation. Finally, you'll discover domain-specific BERT models such as BioBERT and ClinicalBERT, and discover an interesting variant called VideoBERT. By the end of this BERT book, you'll be well-versed with using BERT and its variants for performing practical NLP tasks. What you will learn Understand the transformer model from the ground up Find out how BERT works and pre-train it using masked language model (MLM) and next sentence prediction (NSP) tasks Get hands-on with BERT by learning to generate contextual word and sentence embeddings Fine-tune BERT for downstream tasks Get to grips with ALBERT, RoBERTa, ELECTRA, and SpanBERT models Get the hang of the BERT models based on knowledge distillation Understand cross-lingual models such as XLM and XLM-R Explore Sentence-BERT, VideoBERT, and BART Who this book is forThis book is for NLP professionals and data scientists looking to simplify NLP tasks to enable efficient language understanding using BERT. A basic understanding of NLP concepts and deep learning is required to get the best out of this book.

Python Data Analysis - Perform data collection, data processing, wrangling, visualization, and model building using Python... Python Data Analysis - Perform data collection, data processing, wrangling, visualization, and model building using Python (Paperback, 3rd Revised edition)
Avinash Navlani, Armando Fandango, Ivan Idris
R1,109 Discovery Miles 11 090 Ships in 10 - 15 working days

Understand data analysis pipelines using machine learning algorithms and techniques with this practical guide Key Features Prepare and clean your data to use it for exploratory analysis, data manipulation, and data wrangling Discover supervised, unsupervised, probabilistic, and Bayesian machine learning methods Get to grips with graph processing and sentiment analysis Book DescriptionData analysis enables you to generate value from small and big data by discovering new patterns and trends, and Python is one of the most popular tools for analyzing a wide variety of data. With this book, you'll get up and running using Python for data analysis by exploring the different phases and methodologies used in data analysis and learning how to use modern libraries from the Python ecosystem to create efficient data pipelines. Starting with the essential statistical and data analysis fundamentals using Python, you'll perform complex data analysis and modeling, data manipulation, data cleaning, and data visualization using easy-to-follow examples. You'll then understand how to conduct time series analysis and signal processing using ARMA models. As you advance, you'll get to grips with smart processing and data analytics using machine learning algorithms such as regression, classification, Principal Component Analysis (PCA), and clustering. In the concluding chapters, you'll work on real-world examples to analyze textual and image data using natural language processing (NLP) and image analytics techniques, respectively. Finally, the book will demonstrate parallel computing using Dask. By the end of this data analysis book, you'll be equipped with the skills you need to prepare data for analysis and create meaningful data visualizations for forecasting values from data. What you will learn Explore data science and its various process models Perform data manipulation using NumPy and pandas for aggregating, cleaning, and handling missing values Create interactive visualizations using Matplotlib, Seaborn, and Bokeh Retrieve, process, and store data in a wide range of formats Understand data preprocessing and feature engineering using pandas and scikit-learn Perform time series analysis and signal processing using sunspot cycle data Analyze textual data and image data to perform advanced analysis Get up to speed with parallel computing using Dask Who this book is forThis book is for data analysts, business analysts, statisticians, and data scientists looking to learn how to use Python for data analysis. Students and academic faculties will also find this book useful for learning and teaching Python data analysis using a hands-on approach. A basic understanding of math and working knowledge of the Python programming language will help you get started with this book.

Hands-On Deep Learning with R - A practical guide to designing, building, and improving neural network models using R... Hands-On Deep Learning with R - A practical guide to designing, building, and improving neural network models using R (Paperback)
Michael Pawlus, Rodger Devine
R1,160 Discovery Miles 11 600 Ships in 10 - 15 working days

Explore and implement deep learning to solve various real-world problems using modern R libraries such as TensorFlow, MXNet, H2O, and Deepnet Key Features Understand deep learning algorithms and architectures using R and determine which algorithm is best suited for a specific problem Improve models using parameter tuning, feature engineering, and ensembling Apply advanced neural network models such as deep autoencoders and generative adversarial networks (GANs) across different domains Book DescriptionDeep learning enables efficient and accurate learning from a massive amount of data. This book will help you overcome a number of challenges using various deep learning algorithms and architectures with R programming. This book starts with a brief overview of machine learning and deep learning and how to build your first neural network. You'll understand the architecture of various deep learning algorithms and their applicable fields, learn how to build deep learning models, optimize hyperparameters, and evaluate model performance. Various deep learning applications in image processing, natural language processing (NLP), recommendation systems, and predictive analytics will also be covered. Later chapters will show you how to tackle recognition problems such as image recognition and signal detection, programmatically summarize documents, conduct topic modeling, and forecast stock market prices. Toward the end of the book, you will learn the common applications of GANs and how to build a face generation model using them. Finally, you'll get to grips with using reinforcement learning and deep reinforcement learning to solve various real-world problems. By the end of this deep learning book, you will be able to build and deploy your own deep learning applications using appropriate frameworks and algorithms. What you will learn Design a feedforward neural network to see how the activation function computes an output Create an image recognition model using convolutional neural networks (CNNs) Prepare data, decide hidden layers and neurons and train your model with the backpropagation algorithm Apply text cleaning techniques to remove uninformative text using NLP Build, train, and evaluate a GAN model for face generation Understand the concept and implementation of reinforcement learning in R Who this book is forThis book is for data scientists, machine learning engineers, and deep learning developers who are familiar with machine learning and are looking to enhance their knowledge of deep learning using practical examples. Anyone interested in increasing the efficiency of their machine learning applications and exploring various options in R will also find this book useful. Basic knowledge of machine learning techniques and working knowledge of the R programming language is expected.

Automated Machine Learning - Hyperparameter optimization, neural architecture search, and algorithm selection with cloud... Automated Machine Learning - Hyperparameter optimization, neural architecture search, and algorithm selection with cloud platforms (Paperback)
Adnan Masood; Foreword by Ahmed Sherif
R1,190 Discovery Miles 11 900 Ships in 10 - 15 working days

Get to grips with automated machine learning and adopt a hands-on approach to AutoML implementation and associated methodologies Key Features Get up to speed with AutoML using OSS, Azure, AWS, GCP, or any platform of your choice Eliminate mundane tasks in data engineering and reduce human errors in machine learning models Find out how you can make machine learning accessible for all users to promote decentralized processes Book DescriptionEvery machine learning engineer deals with systems that have hyperparameters, and the most basic task in automated machine learning (AutoML) is to automatically set these hyperparameters to optimize performance. The latest deep neural networks have a wide range of hyperparameters for their architecture, regularization, and optimization, which can be customized effectively to save time and effort. This book reviews the underlying techniques of automated feature engineering, model and hyperparameter tuning, gradient-based approaches, and much more. You'll discover different ways of implementing these techniques in open source tools and then learn to use enterprise tools for implementing AutoML in three major cloud service providers: Microsoft Azure, Amazon Web Services (AWS), and Google Cloud Platform. As you progress, you'll explore the features of cloud AutoML platforms by building machine learning models using AutoML. The book will also show you how to develop accurate models by automating time-consuming and repetitive tasks in the machine learning development lifecycle. By the end of this machine learning book, you'll be able to build and deploy AutoML models that are not only accurate, but also increase productivity, allow interoperability, and minimize feature engineering tasks. What you will learn Explore AutoML fundamentals, underlying methods, and techniques Assess AutoML aspects such as algorithm selection, auto featurization, and hyperparameter tuning in an applied scenario Find out the difference between cloud and operations support systems (OSS) Implement AutoML in enterprise cloud to deploy ML models and pipelines Build explainable AutoML pipelines with transparency Understand automated feature engineering and time series forecasting Automate data science modeling tasks to implement ML solutions easily and focus on more complex problems Who this book is forCitizen data scientists, machine learning developers, artificial intelligence enthusiasts, or anyone looking to automatically build machine learning models using the features offered by open source tools, Microsoft Azure Machine Learning, AWS, and Google Cloud Platform will find this book useful. Beginner-level knowledge of building ML models is required to get the best out of this book. Prior experience in using Enterprise cloud is beneficial.

Free Delivery
Pinterest Twitter Facebook Google+
You may like...
Supersymmetry
Vladimir K Dobrev Hardcover R3,861 Discovery Miles 38 610
DR. Simply A4 Hardcover Sketch Book…
R371 R347 Discovery Miles 3 470
Variational Object-Oriented Programming…
Mira Mezini Hardcover R4,507 Discovery Miles 45 070
Measurement of Image Velocity
David J. Fleet Hardcover R2,999 Discovery Miles 29 990
Probabilistic Nodes Combination (PNC…
Dariusz Jacek Jakobczak Hardcover R4,776 Discovery Miles 47 760
Group Theory, Algebra, and Number Theory…
Horst G. Zimmer Hardcover R5,117 Discovery Miles 51 170
Etchr A5 Perfect Sketchbook - Not…
R1,203 R1,076 Discovery Miles 10 760
Symbol Correspondences for Spin Systems
Pedro de M. Rios, Eldar Straume Hardcover R3,220 R1,969 Discovery Miles 19 690
Treeline A6 Lined Memo Hardcover Books…
R132 Discovery Miles 1 320
The Evolution of Consciousness…
Paula Droege Hardcover R3,201 Discovery Miles 32 010

 

Partners