0
Your cart

Your cart is empty

Browse All Departments
Price
  • R100 - R250 (6)
  • R250 - R500 (70)
  • R500+ (1,192)
  • -
Status
Format
Author / Contributor
Publisher

Books > Computing & IT > Applications of computing > Databases > Data capture & analysis

Big Data Analytics with SAS (Paperback): David Pope Big Data Analytics with SAS (Paperback)
David Pope
R1,164 Discovery Miles 11 640 Ships in 18 - 22 working days

Leverage the capabilities of SAS to process and analyze Big Data About This Book * Combine SAS with platforms such as Hadoop, SAP HANA, and Cloud Foundry-based platforms for effecient Big Data analytics * Learn how to use the web browser-based SAS Studio and iPython Jupyter Notebook interfaces with SAS * Practical, real-world examples on predictive modeling, forecasting, optimizing and reporting your Big Data analysis with SAS Who This Book Is For SAS professionals and data analysts who wish to perform analytics on Big Data using SAS to gain actionable insights will find this book to be very useful. If you are a data science professional looking to perform large-scale analytics with SAS, this book will also help you. A basic understanding of SAS will be helpful, but is not mandatory. What You Will Learn * Configure a free version of SAS in order do hands-on exercises dealing with data management, analysis, and reporting. * Understand the basic concepts of the SAS language which consists of the data step (for data preparation) and procedures (or PROCs) for analysis. * Make use of the web browser based SAS Studio and iPython Jupyter Notebook interfaces for coding in the SAS, DS2, and FedSQL programming languages. * Understand how the DS2 programming language plays an important role in Big Data preparation and analysis using SAS * Integrate and work efficiently with Big Data platforms like Hadoop, SAP HANA, and cloud foundry based systems. In Detail SAS has been recognized by Money Magazine and Payscale as one of the top business skills to learn in order to advance one's career. Through innovative data management, analytics, and business intelligence software and services, SAS helps customers solve their business problems by allowing them to make better decisions faster. This book introduces the reader to the SAS and how they can use SAS to perform efficient analysis on any size data, including Big Data. The reader will learn how to prepare data for analysis, perform predictive, forecasting, and optimization analysis and then deploy or report on the results of these analyses. While performing the coding examples within this book the reader will learn how to use the web browser based SAS Studio and iPython Jupyter Notebook interfaces for working with SAS. Finally, the reader will learn how SAS's architecture is engineered and designed to scale up and/or out and be combined with the open source offerings such as Hadoop, Python, and R. By the end of this book, you will be able to clearly understand how you can efficiently analyze Big Data using SAS. Style and approach The book starts off by introducing the reader to SAS and the SAS programming language which provides data management, analytical, and reporting capabilities. Most chapters include hands on examples which highlights how SAS provides The Power to Know (c). The reader will learn that if they are looking to perform large-scale data analysis that SAS provides an open platform engineered and designed to scale both up and out which allows the power of SAS to combine with open source offerings such as Hadoop, Python, and R.

Mastering Kibana 6.x - Visualize your Elastic Stack data with histograms, maps, charts, and graphs (Paperback): Anurag... Mastering Kibana 6.x - Visualize your Elastic Stack data with histograms, maps, charts, and graphs (Paperback)
Anurag Srivastava
R1,002 Discovery Miles 10 020 Ships in 18 - 22 working days

Get to grips with Kibana and its advanced functions to create interactive visualizations and dashboards Key Features Explore visualizations and perform histograms, stats, and map analytics Unleash X-Pack and Timelion, and learn alerting, monitoring, and reporting features Manage dashboards with Beats and create machine learning jobs for faster analytics Book DescriptionKibana is one of the popular tools among data enthusiasts for slicing and dicing large datasets and uncovering Business Intelligence (BI) with the help of its rich and powerful visualizations. To begin with, Mastering Kibana 6.x quickly introduces you to the features of Kibana 6.x, before teaching you how to create smart dashboards in no time. You will explore metric analytics and graph exploration, followed by understanding how to quickly customize Kibana dashboards. In addition to this, you will learn advanced analytics such as maps, hits, and list analytics. All this will help you enhance your skills in running and comparing multiple queries and filters, influencing your data visualization skills at scale. With Kibana's Timelion feature, you can analyze time series data with histograms and stats analytics. By the end of this book, you will have created a speedy machine learning job using X-Pack capabilities. What you will learn Create unique dashboards with various intuitive data visualizations Visualize Timelion expressions with added histograms and stats analytics Integrate X-Pack with your Elastic Stack in simple steps Extract data from Elasticsearch for advanced analysis and anomaly detection using dashboards Build dashboards from web applications for application logs Create monitoring and alerting dashboards using Beats Who this book is forMastering Kibana 6.x is for you if you are a big data engineer, DevOps engineer, or data scientist aspiring to go beyond data visualization at scale and gain maximum insights from their large datasets. Basic knowledge of Elasticstack will be an added advantage, although not mandatory.

Apache Spark Deep Learning Cookbook - Over 80 recipes that streamline deep learning in a distributed environment with Apache... Apache Spark Deep Learning Cookbook - Over 80 recipes that streamline deep learning in a distributed environment with Apache Spark (Paperback)
Ahmed Sherif, Amrith Ravindra
R1,520 Discovery Miles 15 200 Ships in 18 - 22 working days

A solution-based guide to put your deep learning models into production with the power of Apache Spark Key Features Discover practical recipes for distributed deep learning with Apache Spark Learn to use libraries such as Keras and TensorFlow Solve problems in order to train your deep learning models on Apache Spark Book DescriptionWith deep learning gaining rapid mainstream adoption in modern-day industries, organizations are looking for ways to unite popular big data tools with highly efficient deep learning libraries. As a result, this will help deep learning models train with higher efficiency and speed. With the help of the Apache Spark Deep Learning Cookbook, you'll work through specific recipes to generate outcomes for deep learning algorithms, without getting bogged down in theory. From setting up Apache Spark for deep learning to implementing types of neural net, this book tackles both common and not so common problems to perform deep learning on a distributed environment. In addition to this, you'll get access to deep learning code within Spark that can be reused to answer similar problems or tweaked to answer slightly different problems. You will also learn how to stream and cluster your data with Spark. Once you have got to grips with the basics, you'll explore how to implement and deploy deep learning models, such as Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN) in Spark, using popular libraries such as TensorFlow and Keras. By the end of the book, you'll have the expertise to train and deploy efficient deep learning models on Apache Spark. What you will learn Set up a fully functional Spark environment Understand practical machine learning and deep learning concepts Apply built-in machine learning libraries within Spark Explore libraries that are compatible with TensorFlow and Keras Explore NLP models such as Word2vec and TF-IDF on Spark Organize dataframes for deep learning evaluation Apply testing and training modeling to ensure accuracy Access readily available code that may be reusable Who this book is forIf you're looking for a practical and highly useful resource for implementing efficiently distributed deep learning models with Apache Spark, then the Apache Spark Deep Learning Cookbook is for you. Knowledge of the core machine learning concepts and a basic understanding of the Apache Spark framework is required to get the best out of this book. Additionally, some programming knowledge in Python is a plus.

Social-Media-Analyse - mehr als nur eine Wordcloud - HMD Best Paper Award 2016 (German, Paperback, 1. Aufl. 2017): Matthias... Social-Media-Analyse - mehr als nur eine Wordcloud - HMD Best Paper Award 2016 (German, Paperback, 1. Aufl. 2017)
Matthias Boeck, Felix Koebler, Eva Anderl, Linda Le
R321 Discovery Miles 3 210 Ships in 10 - 15 working days

Die Autoren legen beispielhafte Analysemethoden von Social-Media-Daten dar: deskriptive und Data-Mining-Methoden. Mit deren Hilfe werden kundenorientierte Geschaftsmassnahmen eingeleitet und ein stetiges Abwagen zwischen vollautomatisierten und manuellen, kostenintensiven Reports gesteuert. Das Werk liefert eine UEbersicht zu aktuell diskutierten Themen wie begleitende Emotionen, Vernetzung der interagierenden User oder Verbindung von Themen. Als Gewinn fur ein Unternehmen mussen die Analysen durch eine strategische Prozedur geleitet werden, um Erkenntnisse in konkrete Handlungsempfehlungen zu uberfuhren. Neben den Potenzialen durch die Anwendung komplexerer Analysemethoden gibt es auch konzeptionelle, technische und ethische Herausforderungen, wie die Autoren veranschaulichen.

Mastering Apache Solr 7.x - An expert guide to advancing, optimizing, and scaling your enterprise search (Paperback): Sandeep... Mastering Apache Solr 7.x - An expert guide to advancing, optimizing, and scaling your enterprise search (Paperback)
Sandeep Nair, Chintan Mehta, Dharmesh Vasoya
R1,074 Discovery Miles 10 740 Ships in 18 - 22 working days

Accelerate your enterprise search engine and bring relevancy in your search analytics Key Features A practical guide in building expertise with Indexing, Faceting, Clustering and Pagination Master the management and administration of Enterprise Search Applications and services seamlessly Handle multiple data inputs such as JSON, xml, pdf, doc, xls,ppt, csv and much more. Book DescriptionApache Solr is the only standalone enterprise search server with a REST-like application interface. providing highly scalable, distributed search and index replication for many of the world's largest internet sites. To begin with, you would be introduced to how you perform full text search, multiple filter search, perform dynamic clustering and so on helping you to brush up the basics of Apache Solr. You will also explore the new features and advanced options released in Apache Solr 7.x which will get you numerous performance aspects and making data investigation simpler, easier and powerful. You will learn to build complex queries, extensive filters and how are they compiled in your system to bring relevance in your search tools. You will learn to carry out Solr scoring, elements affecting the document score and how you can optimize or tune the score for the application at hand. You will learn to extract features of documents, writing complex queries in re-ranking the documents. You will also learn advanced options helping you to know what content is indexed and how the extracted content is indexed. Throughout the book, you would go through complex problems with solutions along with varied approaches to tackle your business needs. By the end of this book, you will gain advanced proficiency to build out-of-box smart search solutions for your enterprise demands. What you will learn Design schema using schema API to access data in the database Advance querying and fine-tuning techniques for better performance Get to grips with indexing using Client API Set up a fault tolerant and highly available server with newer distributed capabilities, SolrCloud Explore Apache Tika to upload data with Solr Cell Understand different data operations that can be done while indexing Master advanced querying through Velocity Search UI, faceting and Query Re-ranking, pagination and spatial search Learn to use JavaScript, Python, SolrJ and Ruby for interacting with Solr Who this book is forThe book would rightly appeal to developers, software engineers, data engineers and database architects who are building or seeking to build enterprise-wide effective search engines for business intelligence. Prior experience of Apache Solr or Java programming is must to take the best of this book.

Mastering Numerical Computing with NumPy - Master scientific computing and perform complex operations with ease (Paperback):... Mastering Numerical Computing with NumPy - Master scientific computing and perform complex operations with ease (Paperback)
Umit Mert Cakmak, Mert Cuhadaroglu
R901 Discovery Miles 9 010 Ships in 18 - 22 working days

Enhance the power of NumPy and start boosting your scientific computing capabilities Key Features Grasp all aspects of numerical computing and understand NumPy Explore examples to learn exploratory data analysis (EDA), regression, and clustering Access NumPy libraries and use performance benchmarking to select the right tool Book DescriptionNumPy is one of the most important scientific computing libraries available for Python. Mastering Numerical Computing with NumPy teaches you how to achieve expert level competency to perform complex operations, with in-depth coverage of advanced concepts. Beginning with NumPy's arrays and functions, you will familiarize yourself with linear algebra concepts to perform vector and matrix math operations. You will thoroughly understand and practice data processing, exploratory data analysis (EDA), and predictive modeling. You will then move on to working on practical examples which will teach you how to use NumPy statistics in order to explore US housing data and develop a predictive model using simple and multiple linear regression techniques. Once you have got to grips with the basics, you will explore unsupervised learning and clustering algorithms, followed by understanding how to write better NumPy code while keeping advanced considerations in mind. The book also demonstrates the use of different high-performance numerical computing libraries and their relationship with NumPy. You will study how to benchmark the performance of different configurations and choose the best for your system. By the end of this book, you will have become an expert in handling and performing complex data manipulations. What you will learn Perform vector and matrix operations using NumPy Perform exploratory data analysis (EDA) on US housing data Develop a predictive model using simple and multiple linear regression Understand unsupervised learning and clustering algorithms with practical use cases Write better NumPy code and implement the algorithms from scratch Perform benchmark tests to choose the best configuration for your system Who this book is forMastering Numerical Computing with NumPy is for you if you are a Python programmer, data analyst, data engineer, or a data science enthusiast, who wants to master the intricacies of NumPy and build solutions for your numeric and scientific computational problems. You are expected to have familiarity with mathematics to get the most out of this book.

Hacking University - Computer Hacking and Learn Linux 2 Manuscript Bundle: Essential Beginners Guide on How to Become an... Hacking University - Computer Hacking and Learn Linux 2 Manuscript Bundle: Essential Beginners Guide on How to Become an Amateur Hacker and A Complete Step by Step Guide To Learn And Conquer the Linux Operating System (Paperback)
Isaac D Cody
R505 Discovery Miles 5 050 Ships in 18 - 22 working days
Mastering Microsoft Power BI - Expert techniques for effective data analytics and business intelligence (Paperback): Brett... Mastering Microsoft Power BI - Expert techniques for effective data analytics and business intelligence (Paperback)
Brett Powell
R1,511 Discovery Miles 15 110 Ships in 18 - 22 working days

Design, create and manage robust Power BI solutions to gain meaningful business insights Key Features Master all the dashboarding and reporting features of Microsoft Power BI Combine data from multiple sources, create stunning visualizations and publish your reports across multiple platforms A comprehensive guide with real-world use cases and examples demonstrating how you can get the best out of Microsoft Power BI Book DescriptionThis book is intended for business intelligence professionals responsible for the design and development of Power BI content as well as managers, architects and administrators who oversee Power BI projects and deployments. The chapters flow from the planning of a Power BI project through the development and distribution of content to the administration of Power BI for an organization. BI developers will learn how to create sustainable and impactful Power BI datasets, reports, and dashboards. This includes connecting to data sources, shaping and enhancing source data, and developing an analytical data model. Additionally, top report and dashboard design practices are described using features such as Bookmarks and the Power KPI visual. BI managers will learn how Power BI's tools work together such as with the On-premises data gateway and how content can be staged and securely distributed via Apps. Additionally, both the Power BI Report Server and Power BI Premium are reviewed. By the end of this book, you will be confident in creating effective charts, tables, reports or dashboards for any kind of data using the tools and techniques in Microsoft Power BI. What you will learn Build efficient data retrieval and transformation processes with the Power Query M Language Design scalable, user-friendly DirectQuery and Import Data Models Develop visually rich, immersive, and interactive reports and dashboards Maintain version control and stage deployments across development, test, and production environments Manage and monitor the Power BI Service and the On-premises data gateway Develop a fully on-premise solution with the Power BI Report Server Scale up a Power BI solution via Power BI Premium capacity and migration to Azure Analysis Services or SQL Server Analysis Services Who this book is forBusiness Intelligence professionals and existing Power BI users looking to master Power BI for all their data visualization and dashboarding needs will find this book to be useful. While understanding of the basic BI concepts is required, some exposure to Microsoft Power BI will be helpful.

Practical Big Data Analytics - Hands-on techniques to implement enterprise analytics and machine learning using Hadoop, Spark,... Practical Big Data Analytics - Hands-on techniques to implement enterprise analytics and machine learning using Hadoop, Spark, NoSQL and R (Paperback)
Nataraj Dasgupta
R1,206 Discovery Miles 12 060 Ships in 18 - 22 working days

Get command of your organizational Big Data using the power of data science and analytics Key Features A perfect companion to boost your Big Data storing, processing, analyzing skills to help you take informed business decisions Work with the best tools such as Apache Hadoop, R, Python, and Spark for NoSQL platforms to perform massive online analyses Get expert tips on statistical inference, machine learning, mathematical modeling, and data visualization for Big Data Book DescriptionBig Data analytics relates to the strategies used by organizations to collect, organize and analyze large amounts of data to uncover valuable business insights that otherwise cannot be analyzed through traditional systems. Crafting an enterprise-scale cost-efficient Big Data and machine learning solution to uncover insights and value from your organization's data is a challenge. Today, with hundreds of new Big Data systems, machine learning packages and BI Tools, selecting the right combination of technologies is an even greater challenge. This book will help you do that. With the help of this guide, you will be able to bridge the gap between the theoretical world of technology with the practical ground reality of building corporate Big Data and data science platforms. You will get hands-on exposure to Hadoop and Spark, build machine learning dashboards using R and R Shiny, create web-based apps using NoSQL databases such as MongoDB and even learn how to write R code for neural networks. By the end of the book, you will have a very clear and concrete understanding of what Big Data analytics means, how it drives revenues for organizations, and how you can develop your own Big Data analytics solution using different tools and methods articulated in this book. What you will learn - Get a 360-degree view into the world of Big Data, data science and machine learning - Broad range of technical and business Big Data analytics topics that caters to the interests of the technical experts as well as corporate IT executives - Get hands-on experience with industry-standard Big Data and machine learning tools such as Hadoop, Spark, MongoDB, KDB+ and R - Create production-grade machine learning BI Dashboards using R and R Shiny with step-by-step instructions - Learn how to combine open-source Big Data, machine learning and BI Tools to create low-cost business analytics applications - Understand corporate strategies for successful Big Data and data science projects - Go beyond general-purpose analytics to develop cutting-edge Big Data applications using emerging technologies Who this book is forThe book is intended for existing and aspiring Big Data professionals who wish to become the go-to person in their organization when it comes to Big Data architecture, analytics, and governance. While no prior knowledge of Big Data or related technologies is assumed, it will be helpful to have some programming experience.

Demographic Methods and Concepts (Paperback, New): Donald T. Rowland Demographic Methods and Concepts (Paperback, New)
Donald T. Rowland
R792 Discovery Miles 7 920 Ships in 4 - 6 working days

Demographic Methods and Concepts makes accessible the most commonly needed techniques for working with population statistics, irrespective of the reader's mathematical background. For the first time in such a text, concepts and practical strategies needed in the interpretation of demographic indices and data are included. Spreadsheet training exercises enable students to acquire the computer skills needed for demographic work. The accompanying free CD-ROM contains innovative, fully integrated learning modules as well as applications facilitating demographic studies.

Tableau Questions & Answers - Guide to Tableau concepts and FAQs (Paperback): Chandraish Sinha Tableau Questions & Answers - Guide to Tableau concepts and FAQs (Paperback)
Chandraish Sinha
R298 Discovery Miles 2 980 Ships in 18 - 22 working days
Seven NoSQL Databases in a Week - Get up and running with the fundamentals and functionalities of seven of the most popular... Seven NoSQL Databases in a Week - Get up and running with the fundamentals and functionalities of seven of the most popular NoSQL databases (Paperback)
Xun (Brian) Wu, Sudarshan Kadambi, Devram Kandhare, Aaron Ploetz
R974 Discovery Miles 9 740 Ships in 18 - 22 working days

A beginner's guide to get you up and running with Cassandra, DynamoDB, HBase, InfluxDB, MongoDB, Neo4j, and Redis Key Features Covers the basics of 7 NoSQL databases and how they are used in the enterprises Quick introduction to MongoDB, DynamoDB, Redis, Cassandra, Neo4j, InfluxDB, and HBase Includes effective techniques for database querying and management Book DescriptionThis is the golden age of open source NoSQL databases. With enterprises having to work with large amounts of unstructured data and moving away from expensive monolithic architecture, the adoption of NoSQL databases is rapidly increasing. Being familiar with the popular NoSQL databases and knowing how to use them is a must for budding DBAs and developers. This book introduces you to the different types of NoSQL databases and gets you started with seven of the most popular NoSQL databases used by enterprises today. We start off with a brief overview of what NoSQL databases are, followed by an explanation of why and when to use them. The book then covers the seven most popular databases in each of these categories: MongoDB, Amazon DynamoDB, Redis, HBase, Cassandra, InfluxDB, and Neo4j. The book doesn't go into too much detail about each database but teaches you enough to get started with them. By the end of this book, you will have a thorough understanding of the different NoSQL databases and their functionalities, empowering you to select and use the right database according to your needs. What you will learn Understand how MongoDB provides high-performance, high-availability, and automatic scaling Interact with your Neo4j instances via database queries, Python scripts, and Java application code Get familiar with common querying and programming methods to interact with Redis Study the different types of problems Cassandra can solve Work with HBase components to support common operations such as creating tables and reading/writing data Discover data models and work with CRUD operations using DynamoDB Discover what makes InfluxDB a great choice for working with time-series data Who this book is forIf you are a budding DBA or a developer who wants to get started with the fundamentals of NoSQL databases, this book is for you. Relational DBAs who want to get insights into the various offerings of popular NoSQL databases will also find this book to be very useful.

Learning Kibana 5.0 (Paperback): Bahaaldine Azarmi Learning Kibana 5.0 (Paperback)
Bahaaldine Azarmi
R1,055 Discovery Miles 10 550 Ships in 18 - 22 working days

Exploit the visualization capabilities of Kibana and build powerful interactive dashboards About This Book * Introduction to data-driven architecture and the Elastic stack * Build effective dashboards for data visualization and explore datasets with Elastic Graph * A comprehensive guide to learning scalable data visualization techniques in Kibana Who This Book Is For If you are a developer, data visualization engineer, or data scientist who wants to get the best of data visualization at scale then this book is perfect for you. A basic understanding of Elasticsearch and Logstash is required to make the best use of this book. What You Will Learn * How to create visualizations in Kibana * Ingest log data, structure an Elasticsearch cluster, and create visualization assets in Kibana * Embed Kibana visualization on web pages * Scaffold, develop, and deploy new Kibana & Timelion customizations * Build a metrics dashboard in Timelion based on time series data * Use the Graph plugin visualization feature and leverage a graph query * Create, implement, package, and deploy a new custom plugin * Use Prelert to solve anomaly detection challenges In Detail Kibana is an open source data visualization platform that allows you to interact with your data through stunning, powerful graphics. Its simple, browser-based interface enables you to quickly create and share dynamic dashboards that display changes to Elasticsearch queries in real time. In this book, you'll learn how to use the Elastic stack on top of a data architecture to visualize data in real time. All data architectures have different requirements and expectations when it comes to visualizing the data, whether it's logging analytics, metrics, business analytics, graph analytics, or scaling them as per your business requirements. This book will help you master Elastic visualization tools and adapt them to the requirements of your project. You will start by learning how to use the basic visualization features of Kibana 5. Then you will be shown how to implement a pure metric analytics architecture and visualize it using Timelion, a very recent and trendy feature of the Elastic stack. You will learn how to correlate data using the brand-new Graph visualization and build relationships between documents. Finally, you will be familiarized with the setup of a Kibana development environment so that you can build a custom Kibana plugin. By the end of this book you will have all the information needed to take your Elastic stack skills to a new level of data visualization. Style and approach This book takes a comprehensive, step-by-step approach to working with the visualization aspects of the Elastic stack. Every concept is presented in a very easy-to-follow manner that shows you both the logic and method of implementation. Real world cases are referenced to highlight how each of the key concepts can be put to practical use.

Python Social Media Analytics (Paperback): Siddhartha Chatterjee, Michal Krystyanczuk Python Social Media Analytics (Paperback)
Siddhartha Chatterjee, Michal Krystyanczuk
R1,291 Discovery Miles 12 910 Ships in 18 - 22 working days

Leverage the power of Python to collect, process, and mine deep insights from social media data About This Book * Acquire data from various social media platforms such as Facebook, Twitter, YouTube, GitHub, and more * Analyze and extract actionable insights from your social data using various Python tools * A highly practical guide to conducting efficient social media analytics at scale Who This Book Is For If you are a programmer or a data analyst familiar with the Python programming language and want to perform analyses of your social data to acquire valuable business insights, this book is for you. The book does not assume any prior knowledge of any data analysis tool or process. What You Will Learn * Understand the basics of social media mining * Use PyMongo to clean, store, and access data in MongoDB * Understand user reactions and emotion detection on Facebook * Perform Twitter sentiment analysis and entity recognition using Python * Analyze video and campaign performance on YouTube * Mine popular trends on GitHub and predict the next big technology * Extract conversational topics on public internet forums * Analyze user interests on Pinterest * Perform large-scale social media analytics on the cloud In Detail Social Media platforms such as Facebook, Twitter, Forums, Pinterest, and YouTube have become part of everyday life in a big way. However, these complex and noisy data streams pose a potent challenge to everyone when it comes to harnessing them properly and benefiting from them. This book will introduce you to the concept of social media analytics, and how you can leverage its capabilities to empower your business. Right from acquiring data from various social networking sources such as Twitter, Facebook, YouTube, Pinterest, and social forums, you will see how to clean data and make it ready for analytical operations using various Python APIs. This book explains how to structure the clean data obtained and store in MongoDB using PyMongo. You will also perform web scraping and visualize data using Scrappy and Beautifulsoup. Finally, you will be introduced to different techniques to perform analytics at scale for your social data on the cloud, using Python and Spark. By the end of this book, you will be able to utilize the power of Python to gain valuable insights from social media data and use them to enhance your business processes. Style and approach This book follows a step-by-step approach to teach readers the concepts of social media analytics using the Python programming language. To explain various data analysis processes, real-world datasets are used wherever required.

Tableau 10 Bootcamp (Paperback): Joshua N Milligan, Donabel Santos Tableau 10 Bootcamp (Paperback)
Joshua N Milligan, Donabel Santos
R915 Discovery Miles 9 150 Ships in 18 - 22 working days

Sharpen your data visualization skills with Tableau 10 Bootcamp. About This Book * Make informed decisions using powerful visualizations in Tableau * Learn effective data storytelling to transform how your business uses ideas * Use this extensive bootcamp that makes you an efficient Tableau user in a short span of time Who This Book Is For This book caters to business, data, and analytics professionals who want to build rich interactive visualizations using Tableau Desktop. Familiarity with previous versions of Tableau will be helpful, but not necessary. What You Will Learn * Complete practical Tableau tasks with each chapter * Build different types of charts in Tableau with ease * Extend data using calculated fields and parameters * Prepare and refine data for analysis * Create engaging and interactive dashboards * Present data effectively using story points In Detail Tableau is a leading visual analytics software that can uncover insights for better and smarter decision-making. Tableau has an uncanny ability to beautify your data, compared to other BI tools, which makes it an ideal choice for performing fast and easy visual analysis. A military camp style fast-paced learning book that builds your understanding of Tableau 10 in no time. This day based learning guide contains the best elements from two of our published books, Learning Tableau 10 - Second Edition and Tableau 10 Business Intelligence Cookbook, and delivers practical, learning modules in manageable chunks. Each chunk is delivered in a "day", and each "day" is a productive day. Each day builds your competency in Tableau. You will increase your competence in integrating analytics and forecasting to enhance data analysis during the course of this Bootcamp. Each chapter presents core concepts and key takeaways about a topic in Tableau and provides a series of hands-on exercises. In addition to these exercises, at the end of the chapter, you will find self-check quizzes and extra drills to challenge you, to take what you learned to the next level. To summarize, this book will equip you with step-by-step instructions through rigorous tasks, practical callouts, and various real-world examples and assignments to reinforce your understanding of Tableau 10. Style and approach A fast paced book filled with highly-effective real-world examples to help you build new things and help you in solving problems in newer and unseen ways.

R: Recipes for Analysis, Visualization and Machine Learning (Paperback): Viswa Viswanathan, Shanthi Viswanathan, Atmajitsinh... R: Recipes for Analysis, Visualization and Machine Learning (Paperback)
Viswa Viswanathan, Shanthi Viswanathan, Atmajitsinh Gohil, Yu-Wei, Chiu (David Chiu)
R2,189 Discovery Miles 21 890 Ships in 18 - 22 working days

Get savvy with R language and actualize projects aimed at analysis, visualization and machine learning About This Book * Proficiently analyze data and apply machine learning techniques * Generate visualizations, develop interactive visualizations and applications to understand various data exploratory functions in R * Construct a predictive model by using a variety of machine learning packages Who This Book Is For This Learning Path is ideal for those who have been exposed to R, but have not used it extensively yet. It covers the basics of using R and is written for new and intermediate R users interested in learning. This Learning Path also provides in-depth insights into professional techniques for analysis, visualization, and machine learning with R - it will help you increase your R expertise, regardless of your level of experience. What You Will Learn * Get data into your R environment and prepare it for analysis * Perform exploratory data analyses and generate meaningful visualizations of the data * Generate various plots in R using the basic R plotting techniques * Create presentations and learn the basics of creating apps in R for your audience * Create and inspect the transaction dataset, performing association analysis with the Apriori algorithm * Visualize associations in various graph formats and find frequent itemset using the ECLAT algorithm * Build, tune, and evaluate predictive models with different machine learning packages * Incorporate R and Hadoop to solve machine learning problems on big data In Detail The R language is a powerful, open source, functional programming language. At its core, R is a statistical programming language that provides impressive tools to analyze data and create high-level graphics. This Learning Path is chock-full of recipes. Literally! It aims to excite you with awesome projects focused on analysis, visualization, and machine learning. We'll start off with data analysis - this will show you ways to use R to generate professional analysis reports. We'll then move on to visualizing our data - this provides you with all the guidance needed to get comfortable with data visualization with R. Finally, we'll move into the world of machine learning - this introduces you to data classification, regression, clustering, association rule mining, and dimension reduction. This Learning Path combines some of the best that Packt has to offer in one complete, curated package. It includes content from the following Packt products: * R Data Analysis Cookbook by Viswa Viswanathan and Shanthi Viswanathan * R Data Visualization Cookbook by Atmajitsinh Gohil * Machine Learning with R Cookbook by Yu-Wei, Chiu (David Chiu) Style and approach This course creates a smooth learning path that will teach you how to analyze data and create stunning visualizations. The step-by-step instructions provided for each recipe in this comprehensive Learning Path will show you how to create machine learning projects with R.

Apache Spark for Data Science Cookbook (Paperback): Padma Priya Chitturi Apache Spark for Data Science Cookbook (Paperback)
Padma Priya Chitturi
R1,200 Discovery Miles 12 000 Ships in 18 - 22 working days

Over insightful 90 recipes to get lightning-fast analytics with Apache Spark About This Book * Use Apache Spark for data processing with these hands-on recipes * Implement end-to-end, large-scale data analysis better than ever before * Work with powerful libraries such as MLLib, SciPy, NumPy, and Pandas to gain insights from your data Who This Book Is For This book is for novice and intermediate level data science professionals and data analysts who want to solve data science problems with a distributed computing framework. Basic experience with data science implementation tasks is expected. Data science professionals looking to skill up and gain an edge in the field will find this book helpful. What You Will Learn * Explore the topics of data mining, text mining, Natural Language Processing, information retrieval, and machine learning. * Solve real-world analytical problems with large data sets. * Address data science challenges with analytical tools on a distributed system like Spark (apt for iterative algorithms), which offers in-memory processing and more flexibility for data analysis at scale. * Get hands-on experience with algorithms like Classification, regression, and recommendation on real datasets using Spark MLLib package. * Learn about numerical and scientific computing using NumPy and SciPy on Spark. * Use Predictive Model Markup Language (PMML) in Spark for statistical data mining models. In Detail Spark has emerged as the most promising big data analytics engine for data science professionals. The true power and value of Apache Spark lies in its ability to execute data science tasks with speed and accuracy. Spark's selling point is that it combines ETL, batch analytics, real-time stream analysis, machine learning, graph processing, and visualizations. It lets you tackle the complexities that come with raw unstructured data sets with ease. This guide will get you comfortable and confident performing data science tasks with Spark. You will learn about implementations including distributed deep learning, numerical computing, and scalable machine learning. You will be shown effective solutions to problematic concepts in data science using Spark's data science libraries such as MLLib, Pandas, NumPy, SciPy, and more. These simple and efficient recipes will show you how to implement algorithms and optimize your work. Style and approach This book contains a comprehensive range of recipes designed to help you learn the fundamentals and tackle the difficulties of data science. This book outlines practical steps to produce powerful insights into Big Data through a recipe-based approach.

Apache Spark 2.x for Java Developers (Paperback): Sourav Gulati, Sumit Kumar Apache Spark 2.x for Java Developers (Paperback)
Sourav Gulati, Sumit Kumar
R1,308 Discovery Miles 13 080 Ships in 18 - 22 working days

Unleash the data processing and analytics capability of Apache Spark with the language of choice: Java About This Book * Perform big data processing with Spark-without having to learn Scala! * Use the Spark Java API to implement efficient enterprise-grade applications for data processing and analytics * Go beyond mainstream data processing by adding querying capability, Machine Learning, and graph processing using Spark Who This Book Is For If you are a Java developer interested in learning to use the popular Apache Spark framework, this book is the resource you need to get started. Apache Spark developers who are looking to build enterprise-grade applications in Java will also find this book very useful. What You Will Learn * Process data using different file formats such as XML, JSON, CSV, and plain and delimited text, using the Spark core Library. * Perform analytics on data from various data sources such as Kafka, and Flume using Spark Streaming Library * Learn SQL schema creation and the analysis of structured data using various SQL functions including Windowing functions in the Spark SQL Library * Explore Spark Mlib APIs while implementing Machine Learning techniques to solve real-world problems * Get to know Spark GraphX so you understand various graph-based analytics that can be performed with Spark In Detail Apache Spark is the buzzword in the big data industry right now, especially with the increasing need for real-time streaming and data processing. While Spark is built on Scala, the Spark Java API exposes all the Spark features available in the Scala version for Java developers. This book will show you how you can implement various functionalities of the Apache Spark framework in Java, without stepping out of your comfort zone. The book starts with an introduction to the Apache Spark 2.x ecosystem, followed by explaining how to install and configure Spark, and refreshes the Java concepts that will be useful to you when consuming Apache Spark's APIs. You will explore RDD and its associated common Action and Transformation Java APIs, set up a production-like clustered environment, and work with Spark SQL. Moving on, you will perform near-real-time processing with Spark streaming, Machine Learning analytics with Spark MLlib, and graph processing with GraphX, all using various Java packages. By the end of the book, you will have a solid foundation in implementing components in the Spark framework in Java to build fast, real-time applications. Style and approach This practical guide teaches readers the fundamentals of the Apache Spark framework and how to implement components using the Java language. It is a unique blend of theory and practical examples, and is written in a way that will gradually build your knowledge of Apache Spark.

Implementing Tableau Server - A Guide to implementing Tableau Server (Paperback): Chandraish Sinha Implementing Tableau Server - A Guide to implementing Tableau Server (Paperback)
Chandraish Sinha
R416 Discovery Miles 4 160 Ships in 18 - 22 working days
Hacking University - Computer Hacking and Mobile Hacking 2 Manuscript Bundle: Essential Beginners Guide on How to Become an... Hacking University - Computer Hacking and Mobile Hacking 2 Manuscript Bundle: Essential Beginners Guide on How to Become an Amateur Hacker and Hacking Mobile Devices, Tablets, Game Consoles, and Apps. (Hacking, How to Hack, Hacking for Beginners, Computer, (Paperback)
Isaac D Cody
R501 Discovery Miles 5 010 Ships in 18 - 22 working days
pgRouting - A Practical Guide (Paperback): Regina O. Obe, Leo S. Hsu pgRouting - A Practical Guide (Paperback)
Regina O. Obe, Leo S. Hsu; Edited by Gary E. Sherman
R1,276 Discovery Miles 12 760 Ships in 18 - 22 working days
Big Data Beyond the Hype: A Guide to Conversations for Today's Data Center (Paperback, Ed): Paul Zikopoulos, Dirk Deroos,... Big Data Beyond the Hype: A Guide to Conversations for Today's Data Center (Paperback, Ed)
Paul Zikopoulos, Dirk Deroos, Christopher Bienko, Marc Andrews, Rick Buglio
R539 R508 Discovery Miles 5 080 Save R31 (6%) Ships in 18 - 22 working days

Gain insight into how to govern and consume IBM's unique in-motion and at-rest Big Data analytic capabilitiesA. R. Ammons once said, "A word too much repeated falls out of being", and although the term Big Data sometimes seems to be "too much repeated", it's not about to fall "out of being". That said, it is subject to a lot of hype. The term Big Data is a bit of a misnomer. Truth be told, we're not even big fans of the term--despite the fact that it is so prominently displayed on the cover of this book--because it implies that other data is somehow small (it might be) or that this particular type of data is large in size (it can be, but doesn't have to be). This is Big Data in a nutshell: It is the ability to retain, process, and understand data like never before. It can mean more data than what you are using today; but it can also mean different kinds of data, a venture into the unstructured world where most of today's data resides. The Big Data opportunity. It's a shift, rift, lift, or cliff for your business--this book is going to help you experience the shift and lift, while those that don't work to get beyond the hype end up in a rift or cliff. In this book you will learn how cognitive computing systems, like IBM Watson, fit into the Big Data world. You'll learn how Big Data needs a "ground-to-cloud" architecture, what a Data Refinery looks like, and theimportance of a next generation data platform. Gain an understanding of the concepts of data-in-motion, data-at-rest (technologies like Hadoop play here, as well as others), the role that NoSQL and polyglot play in a leading edge analytics architecture, and more. Get details about the Big Data platform manifesto and why it is a must for any Big Data project. Capturing, storing, refining, transforming, governing, securing, and analyzing data, traditionally or as a service, are important topics alsocovered in this book.

Essential Bioinformatics (Hardcover): Ashwani Kumar Essential Bioinformatics (Hardcover)
Ashwani Kumar
R4,060 Discovery Miles 40 600 Ships in 9 - 17 working days

A flood of data means that many of the challenges in biology are now challenges in computing. Bioinformatics, the application of computational techniques to analyse the information associated with biomolecules on a large-scale, has now firmly established itself as a discipline in molecular biology, and encompasses a wide range of subject areas from structural biology, genomics to gene expression studies. In this text we provide an introduction and overview of the current state of the field. We discuss the main principles that underpin bioinformatics analyses, look at the types of biological information and databases that are commonly used, and finally examine some of the studies that are being conducted, particularly with reference to transcription regulatory systems. The aims of bioinformatics are threefold. First, at its simplest bioinformatics organises data in a way that allows researchers to access existing information and to submit new entries as they are produced, e.g. the Protein Data Bank for 3D macromolecular structures . While data-curation is an essential task, the information stored in these databases is essentially useless until analysed. Thus the purpose of bioinformatics extends much further. The second aim is to develop tools and resources that aid in the analysis of data. For example, having sequenced a particular protein, it is of interest to compare it with previously characterised sequences. This needs more than just a simple text-based search and programs such as FASTA and PSI-BLAST must consider what comprises a biologically significant match. Development of such resources dictates expertise in computational theory as well as a thorough understanding of biology. The third aim is to use these tools to analyse the data and interpret the results in a biologically meaningful manner. Traditionally, biological studies examined individual systems in detail, and frequently compared them with a few that are related. In bioinformatics, we can now conduct global analyses of all the available data with the aim of uncovering common principles that apply across many systems and highlight novel feature.

Hacking University - Learn Python Computer Programming from Scratch & Precisely Learn How The Linux Operating Command Line... Hacking University - Learn Python Computer Programming from Scratch & Precisely Learn How The Linux Operating Command Line Works 2 Manuscript Bundle: The Ultimate Beginners Guide in Mastering Python and a Complete Step by Step Guide in Learning Linux (Paperback)
Isaac D Cody
R523 Discovery Miles 5 230 Ships in 18 - 22 working days
Mastering Python Data Analysis (Paperback): Magnus Vilhelm Persson, Luiz Felipe Martins Mastering Python Data Analysis (Paperback)
Magnus Vilhelm Persson, Luiz Felipe Martins
R1,283 Discovery Miles 12 830 Ships in 18 - 22 working days

Become an expert at using Python for advanced statistical analysis of data using real-world examples About This Book * Clean, format, and explore data using graphical and numerical summaries * Leverage the IPython environment to efficiently analyze data with Python * Packed with easy-to-follow examples to develop advanced computational skills for the analysis of complex data Who This Book Is For If you are a competent Python developer who wants to take your data analysis skills to the next level by solving complex problems, then this advanced guide is for you. Familiarity with the basics of applying Python libraries to data sets is assumed. What You Will Learn * Read, sort, and map various data into Python and Pandas * Recognise patterns so you can understand and explore data * Use statistical models to discover patterns in data * Review classical statistical inference using Python, Pandas, and SciPy * Detect similarities and differences in data with clustering * Clean your data to make it useful * Work in Jupyter Notebook to produce publication ready figures to be included in reports In Detail Python, a multi-paradigm programming language, has become the language of choice for data scientists for data analysis, visualization, and machine learning. Ever imagined how to become an expert at effectively approaching data analysis problems, solving them, and extracting all of the available information from your data? Well, look no further, this is the book you want! Through this comprehensive guide, you will explore data and present results and conclusions from statistical analysis in a meaningful way. You'll be able to quickly and accurately perform the hands-on sorting, reduction, and subsequent analysis, and fully appreciate how data analysis methods can support business decision-making. You'll start off by learning about the tools available for data analysis in Python and will then explore the statistical models that are used to identify patterns in data. Gradually, you'll move on to review statistical inference using Python, Pandas, and SciPy. After that, we'll focus on performing regression using computational tools and you'll get to understand the problem of identifying clusters in data in an algorithmic way. Finally, we delve into advanced techniques to quantify cause and effect using Bayesian methods and you'll discover how to use Python's tools for supervised machine learning. Style and approach This book takes a step-by-step approach to reading, processing, and analyzing data in Python using various methods and tools. Rich in examples, each topic connects to real-world examples and retrieves data directly online where possible. With this book, you are given the knowledge and tools to explore any data on your own, encouraging a curiosity befitting all data scientists.

Free Delivery
Pinterest Twitter Facebook Google+
You may like...
Elementary Statistics - A Guide to Data…
Nancy L. Glenn Griesinger, Daniel Vrinceanu, … Paperback R4,447 R3,784 Discovery Miles 37 840
Big Data Analytics and Its Impact on…
Imad Ibrahim, Rafael Brown, … Paperback R2,148 Discovery Miles 21 480
Handbook of Big Data Analytics, Volume 2…
Vadlamani Ravi, Aswani Kumar Cherukuri Hardcover R3,434 R3,099 Discovery Miles 30 990
Handbook of Big Data Analytics, Volume 1…
Vadlamani Ravi, Aswani Kumar Cherukuri Hardcover R3,428 R3,093 Discovery Miles 30 930
Demystifying Graph Data Science - Graph…
Pethuru Raj, Abhishek Kumar, … Hardcover R3,333 R3,010 Discovery Miles 30 100
Data Mining Patterns - New Methods and…
Pascal Poncelet, Florent Masseglia, … Hardcover R4,570 Discovery Miles 45 700
Cognitive and Soft Computing Techniques…
Akash Kumar Bhoi, Victor Hugo Costa de Albuquerque, … Paperback R2,583 Discovery Miles 25 830
Intelligent Data Analysis - Developing…
Hsiao-Fan Wang Hardcover R4,592 Discovery Miles 45 920
Convergence of Big Data Technologies and…
Govind P. Gupta Hardcover R6,690 Discovery Miles 66 900
Machine Learning for Biometrics…
Partha Pratim Sarangi, Madhumita Panda, … Paperback R2,570 Discovery Miles 25 700

 

Partners