0
Your cart

Your cart is empty

Browse All Departments
  • All Departments
Price
  • R500 - R1,000 (1)
  • R1,000 - R2,500 (2)
  • -
Status
Brand

Showing 1 - 3 of 3 matches in All Departments

Applied Text Analysis with Python - Enabling Language-Aware Data Products with Machine Learning (Paperback): Benjamin Bengfort,... Applied Text Analysis with Python - Enabling Language-Aware Data Products with Machine Learning (Paperback)
Benjamin Bengfort, Rebecca Bilbro, Tony Ojeda
R1,638 R1,127 Discovery Miles 11 270 Save R511 (31%) Ships in 12 - 17 working days

From news and speeches to informal chatter on social media, natural language is one of the richest and most underutilized sources of data. Not only does it come in a constant stream, always changing and adapting in context; it also contains information that is not conveyed by traditional data sources. The key to unlocking natural language is through the creative application of text analytics. This practical book presents a data scientist’s approach to building language-aware products with applied machine learning. You’ll learn robust, repeatable, and scalable techniques for text analysis with Python, including contextual and linguistic feature engineering, vectorization, classification, topic modeling, entity resolution, graph analysis, and visual steering. By the end of the book, you’ll be equipped with practical methods to solve any number of complex real-world problems. Preprocess and vectorize text into high-dimensional feature representations Perform document classification and topic modeling Steer the model selection process with visual diagnostics Extract key phrases, named entities, and graph structures to reason about data in text Build a dialog framework to enable chatbots and language-driven interaction Use Spark to scale processing power and neural networks to scale model complexity

Data Analytics with Hadoop (Paperback): Benjamin Bengfort, Jenny Kim Data Analytics with Hadoop (Paperback)
Benjamin Bengfort, Jenny Kim
R858 R634 Discovery Miles 6 340 Save R224 (26%) Ships in 12 - 17 working days

Ready to use statistical and machine-learning techniques across large data sets? This practical guide shows you why the Hadoop ecosystem is perfect for the job. Instead of deployment, operations, or software development usually associated with distributed computing, you'll focus on particular analyses you can build, the data warehousing techniques that Hadoop provides, and higher order data workflows this framework can produce. Data scientists and analysts will learn how to perform a wide range of techniques, from writing MapReduce and Spark applications with Python to using advanced modeling and data management with Spark MLlib, Hive, and HBase. You'll also learn about the analytical processes and data systems available to build and empower data products that can handle-and actually require-huge amounts of data. Understand core concepts behind Hadoop and cluster computing Use design patterns and parallel analytical algorithms to create distributed data analysis jobs Learn about data management, mining, and warehousing in a distributed context using Apache Hive and HBase Use Sqoop and Apache Flume to ingest data from relational databases Program complex Hadoop and Spark applications with Apache Pig and Spark DataFrames Perform machine learning techniques such as classification, clustering, and collaborative filtering with Spark's MLlib

Practical Data Science Cookbook - (Paperback, 2nd Revised edition): Prabhanjan Tattar, Tony Ojeda, Sean Patrick Murphy,... Practical Data Science Cookbook - (Paperback, 2nd Revised edition)
Prabhanjan Tattar, Tony Ojeda, Sean Patrick Murphy, Benjamin Bengfort, Abhijit Dasgupta
R1,339 Discovery Miles 13 390 Ships in 10 - 15 working days

Over 85 recipes to help you complete real-world data science projects in R and Python About This Book * Tackle every step in the data science pipeline and use it to acquire, clean, analyze, and visualize your data * Get beyond the theory and implement real-world projects in data science using R and Python * Easy-to-follow recipes will help you understand and implement the numerical computing concepts Who This Book Is For If you are an aspiring data scientist who wants to learn data science and numerical programming concepts through hands-on, real-world project examples, this is the book for you. Whether you are brand new to data science or you are a seasoned expert, you will benefit from learning about the structure of real-world data science projects and the programming examples in R and Python. What You Will Learn * Learn and understand the installation procedure and environment required for R and Python on various platforms * Prepare data for analysis by implement various data science concepts such as acquisition, cleaning and munging through R and Python * Build a predictive model and an exploratory model * Analyze the results of your model and create reports on the acquired data * Build various tree-based methods and Build random forest In Detail As increasing amounts of data are generated each year, the need to analyze and create value out of it is more important than ever. Companies that know what to do with their data and how to do it well will have a competitive advantage over companies that don't. Because of this, there will be an increasing demand for people that possess both the analytical and technical abilities to extract valuable insights from data and create valuable solutions that put those insights to use. Starting with the basics, this book covers how to set up your numerical programming environment, introduces you to the data science pipeline, and guides you through several data projects in a step-by-step format. By sequentially working through the steps in each chapter, you will quickly familiarize yourself with the process and learn how to apply it to a variety of situations with examples using the two most popular programming languages for data analysis-R and Python. Style and approach This step-by-step guide to data science is full of hands-on examples of real-world data science tasks. Each recipe focuses on a particular task involved in the data science pipeline, ranging from readying the dataset to analytics and visualization

Free Delivery
Pinterest Twitter Facebook Google+
You may like...
Tower Sign - Beware Of The Dog…
R60 R46 Discovery Miles 460
Peptine Pro Canine/Feline Hydrolysed…
R369 R299 Discovery Miles 2 990
Pure Pleasure Non-Fitted Electric…
 (16)
R289 Discovery Miles 2 890
Loot
Nadine Gordimer Paperback  (2)
R398 R330 Discovery Miles 3 300
Loot
Nadine Gordimer Paperback  (2)
R398 R330 Discovery Miles 3 300
Loot
Nadine Gordimer Paperback  (2)
R398 R330 Discovery Miles 3 300
Multi-Functional Bamboo Standing Laptop…
 (1)
R995 R500 Discovery Miles 5 000
White Glo Coffee & Tea Drinkers' Formula…
R60 Discovery Miles 600
Pink Non-Stretch Fabric Plaster Roll
R10 Discovery Miles 100
Lucky Metal Cut Throat Razer Carrier
R30 Discovery Miles 300

 

Partners