0
Your cart

Your cart is empty

Browse All Departments
Price
  • R100 - R250 (8)
  • R250 - R500 (70)
  • R500+ (1,214)
  • -
Status
Format
Author / Contributor
Publisher

Books > Computing & IT > Applications of computing > Databases > Data capture & analysis

Data Analysis in Criminal Justice and Criminology - History, Concept, and Application (Paperback): Philip D. McCormack, Angela... Data Analysis in Criminal Justice and Criminology - History, Concept, and Application (Paperback)
Philip D. McCormack, Angela Callahan
R3,670 Discovery Miles 36 700 Ships in 10 - 15 working days

Data Analysis in Criminal Justice and Criminology: History, Concept, and Application breaks down various data analysis techniques to help students build their conceptual understanding of key methods and processes. The information in the text encourages discussion and consideration of how and why data analysis plays an important role in the fields of criminal justice and criminology. The book is divided into three units. Unit 1 discusses how data analysis is used in criminal justice and criminology, various methods of data collection, the importance of identifying the purpose of analysis and key data elements prior to analyzing information, and graphical representation of data. Unit 2 introduces students to samples, distributions, and the central limit theorem as it relates to data analysis. This section provides students with the essential knowledge and skills needed to understand statistical concepts and calculations. The final unit explains how to move beyond statistical description to statistical inference and how sample statistics can be used to estimate population parameters. Highly accessible in nature, Data Analysis in Criminal Justice and Criminology is ideal for undergraduate and graduate courses in criminal justice, criminology, and sociology especially those with emphasis on data analysis.

Hands-On Data Analysis with NumPy and pandas - Implement Python packages from data manipulation to processing (Paperback):... Hands-On Data Analysis with NumPy and pandas - Implement Python packages from data manipulation to processing (Paperback)
Curtis Miller
R785 Discovery Miles 7 850 Ships in 10 - 15 working days

Get to grips with the most popular Python packages that make data analysis possible Key Features Explore the tools you need to become a data analyst Discover practical examples to help you grasp data processing concepts Walk through hierarchical indexing and grouping for data analysis Book DescriptionPython, a multi-paradigm programming language, has become the language of choice for data scientists for visualization, data analysis, and machine learning. Hands-On Data Analysis with NumPy and Pandas starts by guiding you in setting up the right environment for data analysis with Python, along with helping you install the correct Python distribution. In addition to this, you will work with the Jupyter notebook and set up a database. Once you have covered Jupyter, you will dig deep into Python's NumPy package, a powerful extension with advanced mathematical functions. You will then move on to creating NumPy arrays and employing different array methods and functions. You will explore Python's pandas extension which will help you get to grips with data mining and learn to subset your data. Last but not the least you will grasp how to manage your datasets by sorting and ranking them. By the end of this book, you will have learned to index and group your data for sophisticated data analysis and manipulation. What you will learn Understand how to install and manage Anaconda Read, sort, and map data using NumPy and pandas Find out how to create and slice data arrays using NumPy Discover how to subset your DataFrames using pandas Handle missing data in a pandas DataFrame Explore hierarchical indexing and plotting with pandas Who this book is forHands-On Data Analysis with NumPy and Pandas is for you if you are a Python developer and want to take your first steps into the world of data analysis. No previous experience of data analysis is required to enjoy this book.

MicroStrategy Quick Start Guide - Data analytics and visualizations for Business Intelligence (Paperback): Fernando Carlos... MicroStrategy Quick Start Guide - Data analytics and visualizations for Business Intelligence (Paperback)
Fernando Carlos Rivero Esqueda
R1,081 Discovery Miles 10 810 Ships in 10 - 15 working days

Build reporting applications and dashboards using the different MicroStrategy objects Key Features Learn the fundamentals of MicroStrategy Use MicroStrategy to get actionable insights from your business data Create visualizations and build intuitive dashboards and reports Book DescriptionMicroStrategy is an enterprise business intelligence application. It turns data into reports for making and executing key organization decisions. This book shows you how to implement Business Intelligence (BI) with MicroStrategy. It takes you from setting up and configuring MicroStrategy to security and administration. The book starts by detailing the different components of the MicroStrategy platform, and the key concepts of Metadata and Project Source. You will then install and configure MicroStrategy and lay down the foundations for building MicroStrategy BI solutions. By learning about objects and different object types, you will develop a strong understanding of the MicroStrategy Schema and Public Objects. With these MicroStrategy objects, you will enhance and scale your BI and Analytics solutions. Finally, you will learn about the administration, security, and monitoring of your BI solution. What you will learn Set up the MicroStrategy Intelligence Server and client tools Create a MicroStrategy metadata repository and your first Project Explore the main MicroStrategy object types and their dependencies Create, manipulate, and share Reports Create and share Dashboards Manage Users and Groups Who this book is forThis book is for Business Intelligence professionals or data analysts who want to get started with Microstrategy. Some basic understanding of BI and data analysis will be required to get the most from this book.

Applied Data Science with Python and Jupyter - Use powerful industry-standard tools to unlock new, actionable insights from... Applied Data Science with Python and Jupyter - Use powerful industry-standard tools to unlock new, actionable insights from your data (Paperback)
Alex Galea
R833 Discovery Miles 8 330 Ships in 10 - 15 working days

Become the master player of data exploration by creating reproducible data processing pipelines, visualizations, and prediction models for your applications. Key Features Get up and running with the Jupyter ecosystem and some example datasets Learn about key machine learning concepts such as SVM, KNN classifiers, and Random Forests Discover how you can use web scraping to gather and parse your own bespoke datasets Book DescriptionGetting started with data science doesn't have to be an uphill battle. Applied Data Science with Python and Jupyter is a step-by-step guide ideal for beginners who know a little Python and are looking for a quick, fast-paced introduction to these concepts. In this book, you'll learn every aspect of the standard data workflow process, including collecting, cleaning, investigating, visualizing, and modeling data. You'll start with the basics of Jupyter, which will be the backbone of the book. After familiarizing ourselves with its standard features, you'll look at an example of it in practice with our first analysis. In the next lesson, you dive right into predictive analytics, where multiple classification algorithms are implemented. Finally, the book ends by looking at data collection techniques. You'll see how web data can be acquired with scraping techniques and via APIs, and then briefly explore interactive visualizations. What you will learn Get up and running with the Jupyter ecosystem Identify potential areas of investigation and perform exploratory data analysis Plan a machine learning classification strategy and train classification models Use validation curves and dimensionality reduction to tune and enhance your models Scrape tabular data from web pages and transform it into Pandas DataFrames Create interactive, web-friendly visualizations to clearly communicate your findings Who this book is forApplied Data Science with Python and Jupyter is ideal for professionals with a variety of job descriptions across a large range of industries, given the rising popularity and accessibility of data science. You'll need some prior experience with Python, with any prior work with libraries such as Pandas, Matplotlib, and Pandas providing you a useful head start.

Hands-On Dashboard Development with Shiny - A practical guide to building effective web applications and dashboards... Hands-On Dashboard Development with Shiny - A practical guide to building effective web applications and dashboards (Paperback)
Chris Beeley
R622 Discovery Miles 6 220 Ships in 10 - 15 working days

Progressively explore UI development with Shiny via practical examples Key Features Write a Shiny interface in pure HTML Explore powerful layout functions to make attractive dashboards and other intuitive interfaces Get to grips with Bootstrap and leverage it in your Shiny applications Book DescriptionAlthough vanilla Shiny applications look attractive with some layout flexibility, you may still want to have more control over how the interface is laid out to produce a dashboard. Hands-On Dashboard Development with Shiny helps you incorporate this in your applications. The book starts by guiding you in producing an application based on the diamonds dataset included in the ggplot2 package. You'll create a single application, but the interface will be reskinned and rebuilt throughout using different methods to illustrate their uses and functions using HTML, CSS, and JavaScript. You will also learn to develop an application that creates documents and reports using R Markdown. Furthermore, the book demonstrates the use of HTML templates and the Bootstrap framework. Moving along, you will learn how to produce dashboards using the Shiny command and dashboard package. Finally, you will learn how to lay out applications using a wide range of built-in functions. By the end of the book, you will have an understanding of the principles that underpin layout in Shiny applications, including sections of HTML added to a vanilla Shiny application, HTML interfaces written from scratch, dashboards, navigation bars, and interfaces. What you will learn Add HTML to a Shiny application and write its interfaces from scratch in HTML Use built-in Shiny functions to produce attractive and flexible layouts Produce dashboards, adding icons and notifications Explore Bootstrap themes to lay out your applications Get insights into UI development with hands-on examples Use R Markdown to create and download reports Who this book is forIf you have some experience writing Shiny applications and want to use HTML, CSS, and Bootstrap to make custom interfaces, then this book is for you.

Data Science with SQL Server Quick Start Guide - Integrate SQL Server with data science (Paperback): Dejan Sarka Data Science with SQL Server Quick Start Guide - Integrate SQL Server with data science (Paperback)
Dejan Sarka
R889 Discovery Miles 8 890 Ships in 10 - 15 working days

Get unique insights from your data by combining the power of SQL Server, R and Python Key Features Use the features of SQL Server 2017 to implement the data science project life cycle Leverage the power of R and Python to design and develop efficient data models find unique insights from your data with powerful techniques for data preprocessing and analysis Book DescriptionSQL Server only started to fully support data science with its two most recent editions. If you are a professional from both worlds, SQL Server and data science, and interested in using SQL Server and Machine Learning (ML) Services for your projects, then this is the ideal book for you. This book is the ideal introduction to data science with Microsoft SQL Server and In-Database ML Services. It covers all stages of a data science project, from businessand data understanding,through data overview, data preparation, modeling and using algorithms, model evaluation, and deployment. You will learn to use the engines and languages that come with SQL Server, including ML Services with R and Python languages and Transact-SQL. You will also learn how to choose which algorithm to use for which task, and learn the working of each algorithm. What you will learn Use the popular programming languages,T-SQL, R, and Python, for data science Understand your data with queries and introductory statistics Create and enhance the datasets for ML Visualize and analyze data using basic and advanced graphs Explore ML using unsupervised and supervised models Deploy models in SQL Server and perform predictions Who this book is forSQL Server professionals who want to start with data science, and data scientists who would like to start using SQL Server in their projects will find this book to be useful. Prior exposure to SQL Server will be helpful.

Inside VB.Net - IDE Driven Code Using ADO and MSPersist (Paperback): Richard Thomas Edwards Inside VB.Net - IDE Driven Code Using ADO and MSPersist (Paperback)
Richard Thomas Edwards
R359 Discovery Miles 3 590 Ships in 10 - 15 working days
Inside VB.NET - IDE driven code using ADO and MSDAOSP (Paperback): Richard Thomas Edwards Inside VB.NET - IDE driven code using ADO and MSDAOSP (Paperback)
Richard Thomas Edwards
R360 Discovery Miles 3 600 Ships in 10 - 15 working days
Power BI - Business Intelligence Clinic - Create and Learn (Paperback): Roger F Silva Power BI - Business Intelligence Clinic - Create and Learn (Paperback)
Roger F Silva
R619 Discovery Miles 6 190 Ships in 10 - 15 working days
Hands-On Ensemble Learning with R - A beginner's guide to combining the power of machine learning algorithms using... Hands-On Ensemble Learning with R - A beginner's guide to combining the power of machine learning algorithms using ensemble techniques (Paperback)
Prabhanjan Narayanachar Tattar
R1,144 Discovery Miles 11 440 Ships in 10 - 15 working days

Explore powerful R packages to create predictive models using ensemble methods Key Features Implement machine learning algorithms to build ensemble-efficient models Explore powerful R packages to create predictive models using ensemble methods Learn to build ensemble models on large datasets using a practical approach Book DescriptionEnsemble techniques are used for combining two or more similar or dissimilar machine learning algorithms to create a stronger model. Such a model delivers superior prediction power and can give your datasets a boost in accuracy. Hands-On Ensemble Learning with R begins with the important statistical resampling methods. You will then walk through the central trilogy of ensemble techniques - bagging, random forest, and boosting - then you'll learn how they can be used to provide greater accuracy on large datasets using popular R packages. You will learn how to combine model predictions using different machine learning algorithms to build ensemble models. In addition to this, you will explore how to improve the performance of your ensemble models. By the end of this book, you will have learned how machine learning algorithms can be combined to reduce common problems and build simple efficient ensemble models with the help of real-world examples. What you will learn Carry out an essential review of re-sampling methods, bootstrap, and jackknife Explore the key ensemble methods: bagging, random forests, and boosting Use multiple algorithms to make strong predictive models Enjoy a comprehensive treatment of boosting methods Supplement methods with statistical tests, such as ROC Walk through data structures in classification, regression, survival, and time series data Use the supplied R code to implement ensemble methods Learn stacking method to combine heterogeneous machine learning models Who this book is forThis book is for you if you are a data scientist or machine learning developer who wants to implement machine learning techniques by building ensemble models with the power of R. You will learn how to combine different machine learning algorithms to perform efficient data processing. Basic knowledge of machine learning techniques and programming knowledge of R would be an added advantage.

Mastering Kibana 6.x - Visualize your Elastic Stack data with histograms, maps, charts, and graphs (Paperback): Anurag... Mastering Kibana 6.x - Visualize your Elastic Stack data with histograms, maps, charts, and graphs (Paperback)
Anurag Srivastava
R1,038 Discovery Miles 10 380 Ships in 10 - 15 working days

Get to grips with Kibana and its advanced functions to create interactive visualizations and dashboards Key Features Explore visualizations and perform histograms, stats, and map analytics Unleash X-Pack and Timelion, and learn alerting, monitoring, and reporting features Manage dashboards with Beats and create machine learning jobs for faster analytics Book DescriptionKibana is one of the popular tools among data enthusiasts for slicing and dicing large datasets and uncovering Business Intelligence (BI) with the help of its rich and powerful visualizations. To begin with, Mastering Kibana 6.x quickly introduces you to the features of Kibana 6.x, before teaching you how to create smart dashboards in no time. You will explore metric analytics and graph exploration, followed by understanding how to quickly customize Kibana dashboards. In addition to this, you will learn advanced analytics such as maps, hits, and list analytics. All this will help you enhance your skills in running and comparing multiple queries and filters, influencing your data visualization skills at scale. With Kibana's Timelion feature, you can analyze time series data with histograms and stats analytics. By the end of this book, you will have created a speedy machine learning job using X-Pack capabilities. What you will learn Create unique dashboards with various intuitive data visualizations Visualize Timelion expressions with added histograms and stats analytics Integrate X-Pack with your Elastic Stack in simple steps Extract data from Elasticsearch for advanced analysis and anomaly detection using dashboards Build dashboards from web applications for application logs Create monitoring and alerting dashboards using Beats Who this book is forMastering Kibana 6.x is for you if you are a big data engineer, DevOps engineer, or data scientist aspiring to go beyond data visualization at scale and gain maximum insights from their large datasets. Basic knowledge of Elasticstack will be an added advantage, although not mandatory.

Hands-On Recommendation Systems with Python - Start building powerful and personalized, recommendation engines with Python... Hands-On Recommendation Systems with Python - Start building powerful and personalized, recommendation engines with Python (Paperback)
Rounak Banik
R803 Discovery Miles 8 030 Ships in 10 - 15 working days

With Hands-On Recommendation Systems with Python, learn the tools and techniques required in building various kinds of powerful recommendation systems (collaborative, knowledge and content based) and deploying them to the web Key Features Build industry-standard recommender systems Only familiarity with Python is required No need to wade through complicated machine learning theory to use this book Book DescriptionRecommendation systems are at the heart of almost every internet business today; from Facebook to Netflix to Amazon. Providing good recommendations, whether it's friends, movies, or groceries, goes a long way in defining user experience and enticing your customers to use your platform. This book shows you how to do just that. You will learn about the different kinds of recommenders used in the industry and see how to build them from scratch using Python. No need to wade through tons of machine learning theory-you'll get started with building and learning about recommenders as quickly as possible.. In this book, you will build an IMDB Top 250 clone, a content-based engine that works on movie metadata. You'll use collaborative filters to make use of customer behavior data, and a Hybrid Recommender that incorporates content based and collaborative filtering techniques With this book, all you need to get started with building recommendation systems is a familiarity with Python, and by the time you're fnished, you will have a great grasp of how recommenders work and be in a strong position to apply the techniques that you will learn to your own problem domains. What you will learn Get to grips with the different kinds of recommender systems Master data-wrangling techniques using the pandas library Building an IMDB Top 250 Clone Build a content based engine to recommend movies based on movie metadata Employ data-mining techniques used in building recommenders Build industry-standard collaborative filters using powerful algorithms Building Hybrid Recommenders that incorporate content based and collaborative fltering Who this book is forIf you are a Python developer and want to develop applications for social networking, news personalization or smart advertising, this is the book for you. Basic knowledge of machine learning techniques will be helpful, but not mandatory.

fastText Quick Start Guide - Get started with Facebook's library for text representation and classification (Paperback):... fastText Quick Start Guide - Get started with Facebook's library for text representation and classification (Paperback)
Joydeep Bhattacharjee
R803 Discovery Miles 8 030 Ships in 10 - 15 working days

Perform efficient fast text representation and classification with Facebook's fastText library Key Features Introduction to Facebook's fastText library for NLP Perform efficient word representations, sentence classification, vector representation Build better, more scalable solutions for text representation and classification Book DescriptionFacebook's fastText library handles text representation and classification, used for Natural Language Processing (NLP). Most organizations have to deal with enormous amounts of text data on a daily basis, and gaining efficient data insights requires powerful NLP tools such as fastText. This book is your ideal introduction to fastText. You will learn how to create fastText models from the command line, without the need for complicated code. You will explore the algorithms that fastText is built on and how to use them for word representation and text classification. Next, you will use fastText in conjunction with other popular libraries and frameworks such as Keras, TensorFlow, and PyTorch. Finally, you will deploy fastText models to mobile devices. By the end of this book, you will have all the required knowledge to use fastText in your own applications at work or in projects. What you will learn Create models using the default command line options in fastText Understand the algorithms used in fastText to create word vectors Combine command line text transformation capabilities and the fastText library to implement a training, validation, and prediction pipeline Explore word representation and sentence classification using fastText Use Gensim and spaCy to load the vectors, transform, lemmatize, and perform other NLP tasks efficiently Develop a fastText NLP classifier using popular frameworks, such as Keras, Tensorflow, and PyTorch Who this book is forThis book is for data analysts, data scientists, and machine learning developers who want to perform efficient word representation and sentence classification using Facebook's fastText library. Basic knowledge of Python programming is required.

Art + Data - A Collection of Tableau Dashboards (hard cover) (Hardcover): Decisive Data Art + Data - A Collection of Tableau Dashboards (hard cover) (Hardcover)
Decisive Data
R866 R721 Discovery Miles 7 210 Save R145 (17%) Ships in 10 - 15 working days
Big Data Analytics with SAS (Paperback): David Pope Big Data Analytics with SAS (Paperback)
David Pope
R1,211 Discovery Miles 12 110 Ships in 10 - 15 working days

Leverage the capabilities of SAS to process and analyze Big Data About This Book * Combine SAS with platforms such as Hadoop, SAP HANA, and Cloud Foundry-based platforms for effecient Big Data analytics * Learn how to use the web browser-based SAS Studio and iPython Jupyter Notebook interfaces with SAS * Practical, real-world examples on predictive modeling, forecasting, optimizing and reporting your Big Data analysis with SAS Who This Book Is For SAS professionals and data analysts who wish to perform analytics on Big Data using SAS to gain actionable insights will find this book to be very useful. If you are a data science professional looking to perform large-scale analytics with SAS, this book will also help you. A basic understanding of SAS will be helpful, but is not mandatory. What You Will Learn * Configure a free version of SAS in order do hands-on exercises dealing with data management, analysis, and reporting. * Understand the basic concepts of the SAS language which consists of the data step (for data preparation) and procedures (or PROCs) for analysis. * Make use of the web browser based SAS Studio and iPython Jupyter Notebook interfaces for coding in the SAS, DS2, and FedSQL programming languages. * Understand how the DS2 programming language plays an important role in Big Data preparation and analysis using SAS * Integrate and work efficiently with Big Data platforms like Hadoop, SAP HANA, and cloud foundry based systems. In Detail SAS has been recognized by Money Magazine and Payscale as one of the top business skills to learn in order to advance one's career. Through innovative data management, analytics, and business intelligence software and services, SAS helps customers solve their business problems by allowing them to make better decisions faster. This book introduces the reader to the SAS and how they can use SAS to perform efficient analysis on any size data, including Big Data. The reader will learn how to prepare data for analysis, perform predictive, forecasting, and optimization analysis and then deploy or report on the results of these analyses. While performing the coding examples within this book the reader will learn how to use the web browser based SAS Studio and iPython Jupyter Notebook interfaces for working with SAS. Finally, the reader will learn how SAS's architecture is engineered and designed to scale up and/or out and be combined with the open source offerings such as Hadoop, Python, and R. By the end of this book, you will be able to clearly understand how you can efficiently analyze Big Data using SAS. Style and approach The book starts off by introducing the reader to SAS and the SAS programming language which provides data management, analytical, and reporting capabilities. Most chapters include hands on examples which highlights how SAS provides The Power to Know (c). The reader will learn that if they are looking to perform large-scale data analysis that SAS provides an open platform engineered and designed to scale both up and out which allows the power of SAS to combine with open source offerings such as Hadoop, Python, and R.

Practical Data Wrangling (Paperback): Allan Visochek Practical Data Wrangling (Paperback)
Allan Visochek
R841 Discovery Miles 8 410 Ships in 10 - 15 working days

Turn your noisy data into relevant, insight-ready information by leveraging the data wrangling techniques in Python and R About This Book * This easy-to-follow guide takes you through every step of the data wrangling process in the best possible way * Work with different types of datasets, and reshape the layout of your data to make it easier for analysis * Get simple examples and real-life data wrangling solutions for data pre-processing Who This Book Is For If you are a data scientist, data analyst, or a statistician who wants to learn how to wrangle your data for analysis in the best possible manner, this book is for you. As this book covers both R and Python, some understanding of them will be beneficial. What You Will Learn * Read a csv file into python and R, and print out some statistics on the data * Gain knowledge of the data formats and programming structures involved in retrieving API data * Make effective use of regular expressions in the data wrangling process * Explore the tools and packages available to prepare numerical data for analysis * Find out how to have better control over manipulating the structure of the data * Create a dexterity to programmatically read, audit, correct, and shape data * Write and complete programs to take in, format, and output data sets In Detail Around 80% of time in data analysis is spent on cleaning and preparing data for analysis. This is, however, an important task, and is a prerequisite to the rest of the data analysis workflow, including visualization, analysis and reporting. Python and R are considered a popular choice of tool for data analysis, and have packages that can be best used to manipulate different kinds of data, as per your requirements. This book will show you the different data wrangling techniques, and how you can leverage the power of Python and R packages to implement them. You'll start by understanding the data wrangling process and get a solid foundation to work with different types of data. You'll work with different data structures and acquire and parse data from various locations. You'll also see how to reshape the layout of data and manipulate, summarize, and join data sets. Finally, we conclude with a quick primer on accessing and processing data from databases, conducting data exploration, and storing and retrieving data quickly using databases. The book includes practical examples on each of these points using simple and real-world data sets to give you an easier understanding. By the end of the book, you'll have a thorough understanding of all the data wrangling concepts and how to implement them in the best possible way. Style and approach This is a practical book on data wrangling designed to give you an insight into the practical application of data wrangling. It takes you through complex concepts and tasks in an accessible way, featuring information on a wide range of data wrangling techniques with Python and R.

Become a Python Data Analyst - Perform exploratory data analysis and gain insight into scientific computing using Python... Become a Python Data Analyst - Perform exploratory data analysis and gain insight into scientific computing using Python (Paperback)
Alvaro Fuentes
R664 Discovery Miles 6 640 Ships in 10 - 15 working days

Enhance your data analysis and predictive modeling skills using popular Python tools Key Features Cover all fundamental libraries for operation and manipulation of Python for data analysis Implement real-world datasets to perform predictive analytics with Python Access modern data analysis techniques and detailed code with scikit-learn and SciPy Book DescriptionPython is one of the most common and popular languages preferred by leading data analysts and statisticians for working with massive datasets and complex data visualizations. Become a Python Data Analyst introduces Python's most essential tools and libraries necessary to work with the data analysis process, right from preparing data to performing simple statistical analyses and creating meaningful data visualizations. In this book, we will cover Python libraries such as NumPy, pandas, matplotlib, seaborn, SciPy, and scikit-learn, and apply them in practical data analysis and statistics examples. As you make your way through the chapters, you will learn to efficiently use the Jupyter Notebook to operate and manipulate data using NumPy and the pandas library. In the concluding chapters, you will gain experience in building simple predictive models and carrying out statistical computation and analysis using rich Python tools and proven data analysis techniques. By the end of this book, you will have hands-on experience performing data analysis with Python. What you will learn Explore important Python libraries and learn to install Anaconda distribution Understand the basics of NumPy Produce informative and useful visualizations for analyzing data Perform common statistical calculations Build predictive models and understand the principles of predictive analytics Who this book is forBecome a Python Data Analyst is for entry-level data analysts, data engineers, and BI professionals who want to make complete use of Python tools for performing efficient data analysis. Prior knowledge of Python programming is necessary to understand the concepts covered in this book

PySpark Cookbook - Over 60 recipes for implementing big data processing and analytics using Apache Spark and Python... PySpark Cookbook - Over 60 recipes for implementing big data processing and analytics using Apache Spark and Python (Paperback)
Denny Lee, Tomasz Drabas
R1,125 Discovery Miles 11 250 Ships in 10 - 15 working days

Combine the power of Apache Spark and Python to build effective big data applications Key Features Perform effective data processing, machine learning, and analytics using PySpark Overcome challenges in developing and deploying Spark solutions using Python Explore recipes for efficiently combining Python and Apache Spark to process data Book DescriptionApache Spark is an open source framework for efficient cluster computing with a strong interface for data parallelism and fault tolerance. The PySpark Cookbook presents effective and time-saving recipes for leveraging the power of Python and putting it to use in the Spark ecosystem. You'll start by learning the Apache Spark architecture and how to set up a Python environment for Spark. You'll then get familiar with the modules available in PySpark and start using them effortlessly. In addition to this, you'll discover how to abstract data with RDDs and DataFrames, and understand the streaming capabilities of PySpark. You'll then move on to using ML and MLlib in order to solve any problems related to the machine learning capabilities of PySpark and use GraphFrames to solve graph-processing problems. Finally, you will explore how to deploy your applications to the cloud using the spark-submit command. By the end of this book, you will be able to use the Python API for Apache Spark to solve any problems associated with building data-intensive applications. What you will learn Configure a local instance of PySpark in a virtual environment Install and configure Jupyter in local and multi-node environments Create DataFrames from JSON and a dictionary using pyspark.sql Explore regression and clustering models available in the ML module Use DataFrames to transform data used for modeling Connect to PubNub and perform aggregations on streams Who this book is forThe PySpark Cookbook is for you if you are a Python developer looking for hands-on recipes for using the Apache Spark 2.x ecosystem in the best possible way. A thorough understanding of Python (and some familiarity with Spark) will help you get the best out of the book.

Apache Hive Essentials - Essential techniques to help you process, and get unique insights from, big data, 2nd Edition... Apache Hive Essentials - Essential techniques to help you process, and get unique insights from, big data, 2nd Edition (Paperback, 2nd Revised edition)
Dayong Du
R803 Discovery Miles 8 030 Ships in 10 - 15 working days

This book takes you on a fantastic journey to discover the attributes of big data using Apache Hive. Key Features Grasp the skills needed to write efficient Hive queries to analyze the Big Data Discover how Hive can coexist and work with other tools within the Hadoop ecosystem Uses practical, example-oriented scenarios to cover all the newly released features of Apache Hive 2.3.3 Book DescriptionIn this book, we prepare you for your journey into big data by frstly introducing you to backgrounds in the big data domain, alongwith the process of setting up and getting familiar with your Hive working environment. Next, the book guides you through discovering and transforming the values of big data with the help of examples. It also hones your skills in using the Hive language in an effcient manner. Toward the end, the book focuses on advanced topics, such as performance, security, and extensions in Hive, which will guide you on exciting adventures on this worthwhile big data journey. By the end of the book, you will be familiar with Hive and able to work effeciently to find solutions to big data problems What you will learn Create and set up the Hive environment Discover how to use Hive's definition language to describe data Discover interesting data by joining and filtering datasets in Hive Transform data by using Hive sorting, ordering, and functions Aggregate and sample data in different ways Boost Hive query performance and enhance data security in Hive Customize Hive to your needs by using user-defined functions and integrate it with other tools Who this book is forIf you are a data analyst, developer, or simply someone who wants to quickly get started with Hive to explore and analyze Big Data in Hadoop, this is the book for you. Since Hive is an SQL-like language, some previous experience with SQL will be useful to get the most out of this book.

Mastering Numerical Computing with NumPy - Master scientific computing and perform complex operations with ease (Paperback):... Mastering Numerical Computing with NumPy - Master scientific computing and perform complex operations with ease (Paperback)
Umit Mert Cakmak, Mert Cuhadaroglu
R931 Discovery Miles 9 310 Ships in 10 - 15 working days

Enhance the power of NumPy and start boosting your scientific computing capabilities Key Features Grasp all aspects of numerical computing and understand NumPy Explore examples to learn exploratory data analysis (EDA), regression, and clustering Access NumPy libraries and use performance benchmarking to select the right tool Book DescriptionNumPy is one of the most important scientific computing libraries available for Python. Mastering Numerical Computing with NumPy teaches you how to achieve expert level competency to perform complex operations, with in-depth coverage of advanced concepts. Beginning with NumPy's arrays and functions, you will familiarize yourself with linear algebra concepts to perform vector and matrix math operations. You will thoroughly understand and practice data processing, exploratory data analysis (EDA), and predictive modeling. You will then move on to working on practical examples which will teach you how to use NumPy statistics in order to explore US housing data and develop a predictive model using simple and multiple linear regression techniques. Once you have got to grips with the basics, you will explore unsupervised learning and clustering algorithms, followed by understanding how to write better NumPy code while keeping advanced considerations in mind. The book also demonstrates the use of different high-performance numerical computing libraries and their relationship with NumPy. You will study how to benchmark the performance of different configurations and choose the best for your system. By the end of this book, you will have become an expert in handling and performing complex data manipulations. What you will learn Perform vector and matrix operations using NumPy Perform exploratory data analysis (EDA) on US housing data Develop a predictive model using simple and multiple linear regression Understand unsupervised learning and clustering algorithms with practical use cases Write better NumPy code and implement the algorithms from scratch Perform benchmark tests to choose the best configuration for your system Who this book is forMastering Numerical Computing with NumPy is for you if you are a Python programmer, data analyst, data engineer, or a data science enthusiast, who wants to master the intricacies of NumPy and build solutions for your numeric and scientific computational problems. You are expected to have familiarity with mathematics to get the most out of this book.

Hands-On Data Visualization with Bokeh - Interactive web plotting for Python using Bokeh (Paperback): Kevin Jolly Hands-On Data Visualization with Bokeh - Interactive web plotting for Python using Bokeh (Paperback)
Kevin Jolly
R834 Discovery Miles 8 340 Ships in 10 - 15 working days

Learn how to create interactive and visually aesthetic plots using the Bokeh package in Python Key Features A step by step approach to creating interactive plots with Bokeh Go from installation all the way to deploying your very own Bokeh application Work with a real time datasets to practice and create your very own plots and applications Book DescriptionAdding a layer of interactivity to your plots and converting these plots into applications hold immense value in the field of data science. The standard approach to adding interactivity would be to use paid software such as Tableau, but the Bokeh package in Python offers users a way to create both interactive and visually aesthetic plots for free. This book gets you up to speed with Bokeh - a popular Python library for interactive data visualization. The book starts out by helping you understand how Bokeh works internally and how you can set up and install the package in your local machine. You then use a real world data set which uses stock data from Kaggle to create interactive and visually stunning plots. You will also learn how to leverage Bokeh using some advanced concepts such as plotting with spatial and geo data. Finally you will use all the concepts that you have learned in the previous chapters to create your very own Bokeh application from scratch. By the end of the book you will be able to create your very own Bokeh application. You will have gone through a step by step process that starts with understanding what Bokeh actually is and ends with building your very own Bokeh application filled with interactive and visually aesthetic plots. What you will learn Installing Bokeh and understanding its key concepts Creating plots using glyphs, the fundamental building blocks of Bokeh Creating plots using different data structures like NumPy and Pandas Using layouts and widgets to visually enhance your plots and add a layer of interactivity Building and hosting applications on the Bokeh server Creating advanced plots using spatial data Who this book is forThis book is well suited for data scientists and data analysts who want to perform interactive data visualization on their web browsers using Bokeh. Some exposure to Python programming will be helpful, but prior experience with Bokeh is not required.

Beginning Data Science with Python and Jupyter - Use powerful industry-standard tools within Jupyter and the Python ecosystem... Beginning Data Science with Python and Jupyter - Use powerful industry-standard tools within Jupyter and the Python ecosystem to unlock new, actionable insights from your data (Paperback)
Alex Galea
R484 Discovery Miles 4 840 Ships in 10 - 15 working days

Getting started with data science doesn't have to be an uphill battle. This step-by-step guide is ideal for beginners who know a little Python and are looking for a quick, fast-paced introduction. Key Features Get up and running with the Jupyter ecosystem and some example datasets Learn about key machine learning concepts like SVM, KNN classifiers and Random Forests Discover how you can use web scraping to gather and parse your own bespoke datasets Book DescriptionGet to grips with the skills you need for entry-level data science in this hands-on Python and Jupyter course. You'll learn about some of the most commonly used libraries that are part of the Anaconda distribution, and then explore machine learning models with real datasets to give you the skills and exposure you need for the real world. We'll finish up by showing you how easy it can be to scrape and gather your own data from the open web, so that you can apply your new skills in an actionable context. What you will learn Get up and running with the Jupyter ecosystem and some example datasets Learn about key machine learning concepts like SVM, KNN classifiers, and Random Forests Plan a machine learning classification strategy and train classification, models Use validation curves and dimensionality reduction to tune and enhance your models Discover how you can use web scraping to gather and parse your own bespoke datasets Scrape tabular data from web pages and transform them into Pandas DataFrames Create interactive, web-friendly visualizations to clearly communicate your findings Who this book is forThis book is ideal for professionals with a variety of job descriptions across large range of industries, given the rising popularity and accessibility of data science. You'll need some prior experience with Python, with any prior work with libraries like Pandas, Matplotlib and Pandas providing you a useful head start.

Powershell And Odbc - Working with the Dataview (Paperback): Richard Thomas Edwards Powershell And Odbc - Working with the Dataview (Paperback)
Richard Thomas Edwards
R359 Discovery Miles 3 590 Ships in 10 - 15 working days
VB.Net And OLEDB - Working with the Datatable (Paperback): Richard Thomas Edwards VB.Net And OLEDB - Working with the Datatable (Paperback)
Richard Thomas Edwards
R370 Discovery Miles 3 700 Ships in 10 - 15 working days
VB.Net And SQL Client - Working with the Datatable (Paperback): Richard Thomas Edwards VB.Net And SQL Client - Working with the Datatable (Paperback)
Richard Thomas Edwards
R370 Discovery Miles 3 700 Ships in 10 - 15 working days
Free Delivery
Pinterest Twitter Facebook Google+
You may like...
Convergence of Big Data Technologies and…
Govind P. Gupta Hardcover R7,069 Discovery Miles 70 690
Big Data - Concepts, Methodologies…
Information Reso Management Association Hardcover R18,647 Discovery Miles 186 470
Organizational Planning and Analysis…
Rupert Morrison Hardcover R3,718 Discovery Miles 37 180
Cross-Cultural Analysis of Image-Based…
Lisa Keller, Robert Keller, … Hardcover R3,459 Discovery Miles 34 590
Challenges and Applications of Data…
V. Sathiyamoorthi, Atilla Elci Hardcover R7,116 Discovery Miles 71 160
Queer Data - Using Gender, Sex and…
Kevin Guyan Hardcover R2,391 Discovery Miles 23 910
Big Data Analytics for Sustainable…
Anandakumar Haldorai, Arulmurugan Ramu Hardcover R7,075 Discovery Miles 70 750
Deep Learning For Beginners - 2…
Steven Cooper Hardcover R791 R676 Discovery Miles 6 760
Advanced Classification Techniques for…
Chinmay Chakraborty Hardcover R7,475 Discovery Miles 74 750
Roman's Data Science How to monetize…
Roman Zykov Hardcover R966 R798 Discovery Miles 7 980

 

Partners