0
Your cart

Your cart is empty

Browse All Departments
Price
  • R100 - R250 (7)
  • R250 - R500 (60)
  • R500+ (1,251)
  • -
Status
Format
Author / Contributor
Publisher

Books > Computing & IT > Applications of computing > Databases > Data capture & analysis

Data Engineering with Apache Spark, Delta Lake, and Lakehouse - Create scalable pipelines that ingest, curate, and aggregate... Data Engineering with Apache Spark, Delta Lake, and Lakehouse - Create scalable pipelines that ingest, curate, and aggregate complex data in a timely and secure way (Paperback)
Manoj Kukreja, Danil Zburivsky
R1,353 Discovery Miles 13 530 Ships in 10 - 15 working days

Understand the complexities of modern-day data engineering platforms and explore strategies to deal with them with the help of use case scenarios led by an industry expert in big data Key Features Become well-versed with the core concepts of Apache Spark and Delta Lake for building data platforms Learn how to ingest, process, and analyze data that can be later used for training machine learning models Understand how to operationalize data models in production using curated data Book DescriptionIn the world of ever-changing data and schemas, it is important to build data pipelines that can auto-adjust to changes. This book will help you build scalable data platforms that managers, data scientists, and data analysts can rely on. Starting with an introduction to data engineering, along with its key concepts and architectures, this book will show you how to use Microsoft Azure Cloud services effectively for data engineering. You'll cover data lake design patterns and the different stages through which the data needs to flow in a typical data lake. Once you've explored the main features of Delta Lake to build data lakes with fast performance and governance in mind, you'll advance to implementing the lambda architecture using Delta Lake. Packed with practical examples and code snippets, this book takes you through real-world examples based on production scenarios faced by the author in his 10 years of experience working with big data. Finally, you'll cover data lake deployment strategies that play an important role in provisioning the cloud resources and deploying the data pipelines in a repeatable and continuous way. By the end of this data engineering book, you'll know how to effectively deal with ever-changing data and create scalable data pipelines to streamline data science, ML, and artificial intelligence (AI) tasks. What you will learn Discover the challenges you may face in the data engineering world Add ACID transactions to Apache Spark using Delta Lake Understand effective design strategies to build enterprise-grade data lakes Explore architectural and design patterns for building efficient data ingestion pipelines Orchestrate a data pipeline for preprocessing data using Apache Spark and Delta Lake APIs Automate deployment and monitoring of data pipelines in production Get to grips with securing, monitoring, and managing data pipelines models efficiently Who this book is forThis book is for aspiring data engineers and data analysts who are new to the world of data engineering and are looking for a practical guide to building scalable data platforms. If you already work with PySpark and want to use Delta Lake for data engineering, you'll find this book useful. Basic knowledge of Python, Spark, and SQL is expected.

Basic - Eine Einfuhrung in 10 Lektionen Mit Zahlreichen Programmbeispielen, 95 UEbungsaufgaben Und Deren Vollstandigen... Basic - Eine Einfuhrung in 10 Lektionen Mit Zahlreichen Programmbeispielen, 95 UEbungsaufgaben Und Deren Vollstandigen Loesungen (German, Paperback, 2nd 2., Korr. Aufl. ed.)
J Kwiatkowski, Barndt
R1,647 Discovery Miles 16 470 Ships in 10 - 15 working days

Lernziele - Aufbau und Funktionsweise einer Rechenanlage Hardware: Prozessor, Speicher, Ein- und Ausgabegerate - Software: Anwendersoftware, Systemsoftware Programmiersprachen: parametrische, problemorientierte, maschinenarienie und Maschinen-Sprachen Betriebssystem: Kommandosprache (Job Contra! Language) 1. 1 Hardware Eine moderne Rechenanlage ist ein kompliziertes System von vielen miteinander zusammenarbeitenden Einheiten. Sie koennen grob untergliedert werden in die Zentraleinheit (CPU = Central Processor Unit) und die Peripherie. Mit Periphe- rie bezeichnet man im wesentlichen die Ein- und Ausgabe- sowie die Speicher- gerate. Die Gesamtheit aller Einheiten einer Rechenanlage nennt man Konfigu- ration; die Standardkonfiguration besteht aus der Zentraleinheit, einem Eingabe-, einem Ausgabe- und einem Speichergerat. Die logische Verschaltung der einzelnen Einheiten einer Konfiguration ist auf mehrere Arten moeglich und wird als Rechnerarchitektur bezeichnet. UEber eine Schnittstelle sind zwei Einhei- ten miteinander verbunden und koennen Programmbefehle und Daten austau- schen. Die Untergliederung einer Rechenanlage in einzelne Einheiten gilt fur Gross- rechenanlagen und Kleincomputer gleichermassen. Die Abb. 1/1-1/3 zeigen Computersysteme unterschiedlicher Groesse. Bezuglich ihrer Groesse kann man die Rechenanlagen in drei Klassen einteilen: Grossrechner, Minicomputer und Mikrocomputer. Diese Aufteilung hat sich heute immer mehr durchgesetzt, die Grenzen sind allerdings fliessend, und innerhalb einer Gruppe lassen sich weitere Differenzierungen vornehmen. Man darf dabei auch nicht vergessen, dass eine Rechenanlage, die heute von der Leistungsfahigkeit her als "Kleinrechner" einge- stuft wird, vor noch nicht allzu langer Zeit als Rechner "mittlerer Groesse" be- zeichnet wurde. In Abb. 1/4 ist gezeigt, wie man die Klassen der Mikro- und Minicomputer weiter differenzieren kann.

Excel 2021 - Everything you need to know about Excel to go from Beginner to Expert (Paperback): Nora E Wright Excel 2021 - Everything you need to know about Excel to go from Beginner to Expert (Paperback)
Nora E Wright
R512 R419 Discovery Miles 4 190 Save R93 (18%) Ships in 10 - 15 working days
Basic-Wegweiser Fur Den Commodore 64 - Datenverarbeitung Mit Basic 2.0, Basic 4.0 Und Simon's Basic (German, Paperback,... Basic-Wegweiser Fur Den Commodore 64 - Datenverarbeitung Mit Basic 2.0, Basic 4.0 Und Simon's Basic (German, Paperback, 1984 ed.)
Ekkehard Kaier
R2,173 Discovery Miles 21 730 Ships in 10 - 15 working days

Das Wegweiser-Buch weist Wege zum erfolgreichen Einsatz des Commo- dore 64. Das Wegweiser-Buch vermittelt aktuelles Grundlagenwissen zur Datenver- arbeitung bzw. Informatik: -Was ist Hardware, Software und Firmware? - Was sind Grosscomputer und Mikrocomputer? - Was sind Datenstrukturen und Programmstrukturen? -Was sind Betriebssysteme und Anwenderprogramme? - Was heisst, fertige Programm-Pakete einsetzen'? -Was beinhaltet das eigene Programmieren? Nach der Lekture dieses Abschnitts sind Sie in der Lage, den Commodore 64 in den Gesamtrahmen des Gebiets Datenverarbeitung/Informatik einzu- ordnen. Das Wegweiser-Buch gibt eine erste Bedienungsanleitung: -Wie bediene ich Tastatur, Bildschirm, Floppy bzw. Disketteneinheit und Drucker des Commodore 64? -Wie erstelle ich mein erstes Programm in der Programmiersprache BASIC 2.0? -Welche Befehle umfasst BASIC 2.0 (zu jedem Befehl wird ein Beispiel an- gegeben)? -Welche Moeglichkeiten bieten die drei Sprachversionen BASIC 2.0, BASIC 4.0 und SIMON's BASIC? - Laufen die Programme des Commodore 64 auf anderen Mikrocomputern von Commodore? Nach dem Durcharbeiten dieses Abschnitts koennen Sie Ihren Commodore 64 bedienen, Programme laufen lassen und einfache BASIC-Programme selbst erstellen und speichern. Das Wegweiser-Buch enthalt einen kompletten Programmierkurs mit folgen- den grundlegenden BASIC-Anwendungen: - Programme mit den wichtigen Ablaufstrukturen (Folge-, Auswahl-, W- derholungs-und Unterprogrammstrukturen). - Verarbeitung von Text, Ein-/Ausgabe und Tabellen. - Maschinennahe Programmierung( ... Bit fur Bit). - Suchen, Sortieren, Mischen und Gruppieren von Daten. - Sequentielle Datei und Direktzugriff-Datei mit den Sprachen BASIC 2.0 und BASIC 4.0. Vorwort VI - Normale Grafik mit der Standardsprache BASIC 2.0. - Hl RES-Grafik und Sprite-Grafik mit SIMON's BASIC.

Datenverarbeitung Im Luftverkehr (German, Paperback, 1984 ed.): Gunther Becher Datenverarbeitung Im Luftverkehr (German, Paperback, 1984 ed.)
Gunther Becher
R1,638 Discovery Miles 16 380 Ships in 10 - 15 working days

Die Luftverkehrsgesellschaften gehoeren zu den Pionieren in der An- wendung der automatisierten Datenverarbeitung. Sie haben neben ein- fachen und gut strukturierten Abrechnungsaufgaben schon sehr fruh komplexe und hochintegrierte Anwendungen mit ihrer Hilfe erschlossen. Beispiele hierfur sind die Flugreservierungssysteme der grossen Ge- sellschaften, die bereits seit Anfang der 60er Jahre im praktischen Einsatz sind, die Kommunikation zwischen den Gesellschaften innerhalb der IA T A, die sehr komplexen und aktuellen Dispositionsaufgaben, die beim Flugverkehr jeder grossen Gesellschaft wahrzunehmen sind. Aus den Loesungskonzepten fur diese wichtigen Problemstellungen konnten auch fur anspruchsvolle Anwendungsbereiche in Wirtschaftsunterneh- mungen und in der Wissenschaft vielerlei Anregungen gewonnen werden. Ebenfalls wurden dadurch die Entwicklungen bei den Herstellern von ADV- Systemen massgebend beeinflusst. VI Das vorliegende Buch gibt einen UEberblick uber die Aufgabenstellungen der Informationsverarbeitung bei Luftverkehrsgesellschaften. Ins- besondere wird auch die Nutzung der Informationsverarbeitung aller grossen internationalen Gesellschaften in sehr ubersichtlicher Form herausgearbeitet, und es werden die verschiedenen Loesungen gegen- ubergestellt; ausserdem erhalt der Leser einen UEberblick uber die historische Entwicklung in diesem wichtigen Anwendungsbereich. Der Verfasser ist seit zwanzig Jahren fur die Datenverarbeitung bei der Deutschen Lufthansa AG verantwortlich und jetzt im Vorstand dieser Ge- sellschaft, wo er fur die Bereiche Planung und Steuerung, Datenverar- beitung und Fernmeldedienste, Rechnungswesen, Finanzen und das Justitiariat zustandig ist. Fur den Themenbereich des Buches besitzt er eine sehr hohe Kompetenz. Wir sind uberzeugt, dass er mit seiner Schrift auch ausserhalb des Bereichs der Luftverkehrsgesellschaften auf sehr grosses Interesse stossen wird.

Basic-Wegweiser Fur Den Apple II - Datenverarbeitung Mit Applesoft--Basic Fur Apple II/IIe Und Kompatible Mikrocomputer... Basic-Wegweiser Fur Den Apple II - Datenverarbeitung Mit Applesoft--Basic Fur Apple II/IIe Und Kompatible Mikrocomputer (German, Paperback, 1984 ed.)
Ekkehard Kaier
R1,649 Discovery Miles 16 490 Ships in 10 - 15 working days

Das vorliegende Wegweiser-Buch weist Wege zum erfolgreichen Einsatz von Mikrocomputern der Apple II-Famiiie wie Apple lle, Apple li-Plus und sprachgleicher Systeme. Das Wegweiser-Buch vermittelt aktuelles Grundlagenwissen zur Datenver- arbeitung: -Was ist Hardware, Software und Firmware? - Was sind Grosscomputer und Mikrocomputer? -Was sind Datenstrukturen und Programmstrukturen? - Was sind Betriebssysteme und Anwenderprogramme? -Was heisst, fertige Programm-Pakete einsetzen'? -Was beinhaltet das eigene Programmieren? Das Wegweiser-Buch gibt eine erste Benutzungsanleitung: -Wie bedient man den Apple II? -Wie erstellt man das erste eigene Anwenderprogramm? -Wie setzt man die verfugbaren Systemprogramme ein? Das Wegweiser-Buch enthalt einen kompletten Programmierkurs in der Programmiersprache Applesoft-BASIC, der Wege zu grundlegenden Anwen- dungsmoeglichkeiten weist: - Programme mit Schleifen und Unterprogrammen. -Text-, Tabellen-und Grafikverarbeitung. - Formen der Tastatureingabe und Druckausgabe. - Maschinennahe Programmierung in Assembler. -Suchen, Sortieren, Mischen und Gruppieren von Daten. -Sequentielle, direkte/random, index-sequentielle und verkettete Organisation einer Datei. - Datei mit zeigerverketteter Liste und binarem Baum. Das Wegweiser-Buch soll die vom Hersteller gelieferten System-Handbucher keinesfalls ersetzen, sondern erganzen: ln den Apple-Handbuchern werden Programmiersprachen beschrieben (z.B. Applesoft-BASIC Programmierhandbuch), Betriebssysteme (z. B. DOS- Handbuch), technische Eigenschaften (z.B. Apple lle Benutzer Handbuch) oder spezielle Gerate (z. B. Grafik Tablett Handbuch) und Software (z. B. Apple Writer). VI Vorwort Das Wegweiser-Buch hingegen beschreibt die Grundlagen der Datenver- arbeitung, um sie an zahlreichen Anwendungsmoeglichkeiten fur den Apple II zu demonstrieren und zu veranschaulichen. Im Wegweiser-Buch sind 80 Programm-Beispiele als Codierung in Applesoft- BASIC (List) und als Ausfuhrung (Run) wiedergegeben sowie vollstandig beschrieben.

Limitless Analytics with Azure Synapse - An end-to-end analytics service for data processing, management, and ingestion for BI... Limitless Analytics with Azure Synapse - An end-to-end analytics service for data processing, management, and ingestion for BI and ML requirements (Paperback)
Prashant Kumar Mishra, Mukesh Kumar
R1,347 Discovery Miles 13 470 Ships in 10 - 15 working days

Leverage the Azure analytics platform's key analytics services to deliver unmatched intelligence for your data Key Features Learn to ingest, prepare, manage, and serve data for immediate business requirements Bring enterprise data warehousing and big data analytics together to gain insights from your data Develop end-to-end analytics solutions using Azure Synapse Book DescriptionAzure Synapse Analytics, which Microsoft describes as the next evolution of Azure SQL Data Warehouse, is a limitless analytics service that brings enterprise data warehousing and big data analytics together. With this book, you'll learn how to discover insights from your data effectively using this platform. The book starts with an overview of Azure Synapse Analytics, its architecture, and how it can be used to improve business intelligence and machine learning capabilities. Next, you'll go on to choose and set up the correct environment for your business problem. You'll also learn a variety of ways to ingest data from various sources and orchestrate the data using transformation techniques offered by Azure Synapse. Later, you'll explore how to handle both relational and non-relational data using the SQL language. As you progress, you'll perform real-time streaming and execute data analysis operations on your data using various languages, before going on to apply ML techniques to derive accurate and granular insights from data. Finally, you'll discover how to protect sensitive data in real time by using security and privacy features. By the end of this Azure book, you'll be able to build end-to-end analytics solutions while focusing on data prep, data management, data warehousing, and AI tasks. What you will learn Explore the necessary considerations for data ingestion and orchestration while building analytical pipelines Understand pipelines and activities in Synapse pipelines and use them to construct end-to-end data-driven workflows Query data using various coding languages on Azure Synapse Focus on Synapse SQL and Synapse Spark Manage and monitor resource utilization and query activity in Azure Synapse Connect Power BI workspaces with Azure Synapse and create or modify reports directly from Synapse Studio Create and manage IP firewall rules in Azure Synapse Who this book is forThis book is for data architects, data scientists, data engineers, and business analysts who are looking to get up and running with the Azure Synapse Analytics platform. Basic knowledge of data warehousing will be beneficial to help you understand the concepts covered in this book more effectively.

Endliche Koerper - Verstehen, Rechnen, Anwenden (German, Paperback, 2nd 2., Uberarb. Aufl. 2008 ed.): Hans Kurzweil Endliche Koerper - Verstehen, Rechnen, Anwenden (German, Paperback, 2nd 2., Uberarb. Aufl. 2008 ed.)
Hans Kurzweil
R635 Discovery Miles 6 350 Ships in 12 - 17 working days

In jedem Handy, CD-Player und Computer steckt ein Chip, der lineare Gleichungssysteme uber einem endlichen Korper blitzschnell lost, um fehlerbehaftetes Datenmaterial zu korrigieren; dieses Buch erklart das mathematische Innenleben eines solchen Chips. Endliche Korper sind Zahlenbereiche (sog. Galoisfelder) mit nur endlich vielen Zahlen, die man aber addieren, subtrahieren, multiplizieren und dividieren kann. Das Hauptanliegen des Buches ist es, auf elementare Weise zu erklaren und zu uben, wie diese Rechungen ausgefuhrt werden. Es wendet sich an jeden, dem die mathematischen Sprache nicht fremd ist und der wissen mochte, wie endliche Korper funktionieren. Vorausgesetzt wird eine gewisse Vertrautheit mit Grundbegriffen der linearen Algebra, wie sie etwa in einer Vorlesung Ingenieurmathematik geubt werden. Obwohl der Text zielgerichtet ist, bietet er auch eine elementare Einfuhrung in die Algebra, denn endliche Korper konnen ohne algebraische Begriffe Gruppe, Vektorraum, Ring, Korper und Polynom nicht erklart werden."

Linear Regression Analysis 2e (Hardcover, 2nd Edition): G.A.F. Seber Linear Regression Analysis 2e (Hardcover, 2nd Edition)
G.A.F. Seber
R4,563 Discovery Miles 45 630 Ships in 9 - 15 working days

An extensive treatment of a key method in the statistician’s toolbox

For more than two decades, the First Edition of Linear Regression Analysis has been an authoritative resource for one of the most common methods of handling statistical data. There have been many advances in the field over the last twenty years, including the development of more efficient and accurate regression computer programs, new ways of fitting regressions, and new methods of model selection and prediction. Linear Regression Analysis, Second Edition, revises and expands this standard text, providing extensive coverage of state-of-the-art theory and applications of linear regression analysis.

Requiring no specialized knowledge beyond a good grasp of matrix algebra and some acquaintance with straight-line regression and simple analysis of variance models, this new edition features:

  • Up-to-date accounts of computational methods and algorithms currently in use without getting entrenched in minor computing details
  • A careful and detailed survey of the research literature, making this a highly useful reference
  • Expanded coverage of diagnostics, and more discussion of methods of model fitting, model selection and prediction
  • More than 200 problems throughout the book plus outline solutions

Concise, mathematically clear, and comprehensive, Linear Regression Analysis, Second Edition, serves as both a reliable reference for the practitioner and a valuable textbook for the student.

Theta-Funktionen Und Elliptische Funktionen Fur Ti-59 - Mathematische Routinen Der Physik, Chemie Und Technik Teil IV (German,... Theta-Funktionen Und Elliptische Funktionen Fur Ti-59 - Mathematische Routinen Der Physik, Chemie Und Technik Teil IV (German, Paperback, 1983 ed.)
Peter Kahlig
R1,632 Discovery Miles 16 320 Ships in 10 - 15 working days

Die speziellen Funktionen, insbesondere die elliptischen Funktionen, hatten in Physik und Technik stets ungeheure Bedeutung und wurden in den Vorlesungen der Hochschulen im 19. und be- ginnenden 20. Jahrhundert entsprechend berucksichtigt. Spater sind die speziellen Funktionen aus den Vorlesungen verschwunden, und es traten die strukturellen Gesichtspunkte der Mathematik in den Vordergrund. Dies hat zu einer gewissen Entfremdung zwischen Anwendungen und theoretischer Ausbildung im Vorlesungsbetrieb gefuhrt. Allerdings haben im angelsachsischen Sprachraum die speziellen Funktionen stets durch Bucher und Tafelwerke Berucksichtigung gefunden, entsprechend ihrer Bedeutung sowohl fur Anwendungen der Mathematik wie auch fur theoretische Begriffsbildungen. Durch den Computer (bzw. seinen kleinen Bruder, den Taschenrechner) ist die Freude am numerischen Rechnen ganz bedeutend gestiegen. Wegen erhoehter Genauigkeitsanspruche in Physik- und Technik kann man sich heute nicht mehr mit linearen Naherungen begnugen (beruhmtes Beispiel: das Pendel); dadurch braucht auch der Ingenieur und Physiker die speziellen Funktionen, im besonderen die elliptischen Funktionen. Es ist zu begrussen, dass der Autor P. Kahlig, von dem schon analoge Veroeffentlichungen vorliegen, einen Band uber die elliptischen Funktionen und die Theta-Funktionen herausbringt, deren Bedeutung auch fur die Warmeleitung und Diffusion ja wohlbekannt ist. Genaue Funktionswerte sind somit jedem Anwender schnell und leicht zuganglich. Es ist diesem Band weite Verbreitung zu wunschen. Univ. Prof. Dr. Dr. h.c. Edmund Hlawka, Institut fur Analysis, Technische Mathematik und Versicherungs- mathematik der Technischen Universitat sowie Institut fur Mathematik der Universitat Wien. Wirkl. Mitglied der OEsterr. Akademie der Wissenschaften, Mitglied der Deutschen Akademie der Naturforscher, korrespond.

Grundzuge Der Datenverarbeitung - Methoden Und Konzepte Fur Die Anwendungen (German, Paperback, 2nd 2. Aufl. 1983 ed.): Kurt... Grundzuge Der Datenverarbeitung - Methoden Und Konzepte Fur Die Anwendungen (German, Paperback, 2nd 2. Aufl. 1983 ed.)
Kurt Bauknecht, Carl-August Zehnder
R1,691 Discovery Miles 16 910 Ships in 10 - 15 working days

Die Datenverarbeitung spielt seit vie len Jahren eine wesentliche Rolle in einer Vielzahl von Anwendungen in Dienstleistung, Verwal tung und Industrie; in der Forschung ist die Verwendung des Com puters nicht mehr wegzudenken. Es ist daher erstaunlich, wie schmal dennoch vielerorts die Kenntnisse Uber die Computerwelt sind: An wender sind froh, dass der Computer produktiv fUr sie arbeitet, und sie wahren vorsichtige Distanz zur "geheimnisvollen" und sich rasch andernden Computertechnik. Die Computer-Fachleute ihrerseits leben in der Welt der Spezialisten und pflegen ihre technische Sprache, erhaben Uber die Alltagsprobleme des Anwenders. Die beiden Autoren erleben diese Einseitigkeiten seit Jahren in ihrer Tatigkeit als Dozenten einerseits, als engagierte Praktiker anderseits. Dennoch hoffen sie, dass es gerade mit diesem neuen EinfUhrungsbuch gelingt, die einseitigen Positionen abzubauen. Denn auch hinter der schnellen technischen Entwicklung des Compu ters stecken bleibende und einfache Prinzipien der Informatik, die es darzustellen und zu verstehen gilt. Zwei Interessentenkreise sind damit primar angesprochen: Studenten verschiedenster Richtungen (Ingenieure, Oekonomen, Fachinformatiker) sollen erkennen, welche Konzepte des Compu ters und der Datentechnik fUr die Anwendung eine direkte Rolle spielen. Anwender (aus kommerzieller oder technisch-wissenschaftlicher Umgebung) sollen die grundsatzlichen Methoden und Strukturen sehen, welche hinter ihren taglichen Computer-Anwendungen stehen."

Distributed Data Systems with Azure Databricks - Create, deploy, and manage enterprise data pipelines (Paperback): Alan... Distributed Data Systems with Azure Databricks - Create, deploy, and manage enterprise data pipelines (Paperback)
Alan Bernardo Palacio
R1,288 Discovery Miles 12 880 Ships in 10 - 15 working days

Quickly build and deploy massive data pipelines and improve productivity using Azure Databricks Key Features Get to grips with the distributed training and deployment of machine learning and deep learning models Learn how ETLs are integrated with Azure Data Factory and Delta Lake Explore deep learning and machine learning models in a distributed computing infrastructure Book DescriptionMicrosoft Azure Databricks helps you to harness the power of distributed computing and apply it to create robust data pipelines, along with training and deploying machine learning and deep learning models. Databricks' advanced features enable developers to process, transform, and explore data. Distributed Data Systems with Azure Databricks will help you to put your knowledge of Databricks to work to create big data pipelines. The book provides a hands-on approach to implementing Azure Databricks and its associated methodologies that will make you productive in no time. Complete with detailed explanations of essential concepts, practical examples, and self-assessment questions, you'll begin with a quick introduction to Databricks core functionalities, before performing distributed model training and inference using TensorFlow and Spark MLlib. As you advance, you'll explore MLflow Model Serving on Azure Databricks and implement distributed training pipelines using HorovodRunner in Databricks. Finally, you'll discover how to transform, use, and obtain insights from massive amounts of data to train predictive models and create entire fully working data pipelines. By the end of this MS Azure book, you'll have gained a solid understanding of how to work with Databricks to create and manage an entire big data pipeline. What you will learn Create ETLs for big data in Azure Databricks Train, manage, and deploy machine learning and deep learning models Integrate Databricks with Azure Data Factory for extract, transform, load (ETL) pipeline creation Discover how to use Horovod for distributed deep learning Find out how to use Delta Engine to query and process data from Delta Lake Understand how to use Data Factory in combination with Databricks Use Structured Streaming in a production-like environment Who this book is forThis book is for software engineers, machine learning engineers, data scientists, and data engineers who are new to Azure Databricks and want to build high-quality data pipelines without worrying about infrastructure. Knowledge of Azure Databricks basics is required to learn the concepts covered in this book more effectively. A basic understanding of machine learning concepts and beginner-level Python programming knowledge is also recommended.

Python for Data Analysis - From the Beginner to Expert Crash Course 3.0 that will Change your Life as a Digital Programmer... Python for Data Analysis - From the Beginner to Expert Crash Course 3.0 that will Change your Life as a Digital Programmer Thanks to the Minimalism of this Manual. Deep Machine Learning and Big Data (Paperback)
Mik Arduino
R459 Discovery Miles 4 590 Ships in 10 - 15 working days
Thick Big Data - Doing Digital Social Sciences (Hardcover): Dariusz Jemielniak Thick Big Data - Doing Digital Social Sciences (Hardcover)
Dariusz Jemielniak
R2,772 Discovery Miles 27 720 Ships in 10 - 15 working days

The social sciences are becoming datafied. The questions once considered the domain of sociologists are now answered by data scientists operating on large datasets and breaking with methodological tradition, for better or worse. The traditional social sciences, such as sociology or anthropology, are under the double threat of becoming marginalized or even irrelevant, both from new methods of research which require more computational skills and from increasing competition from the corporate world which gains an additional advantage based on data access. However, unlike data scientists, sociologists and anthropologists have a long history of doing qualitative research. The more quantified datasets we have, the more difficult it is to interpret them without adding layers of qualitative interpretation. Big Data therefore needs Thick Data. This book presents the available arsenal of new methods and tools for studying society both quantitatively and qualitatively, opening ground for the social sciences to take the lead in analysing digital behaviour. It shows that Big Data can and should be supplemented and interpreted through thick data as well as cultural analysis. Thick Big Data is critically important for students and researchers in the social sciences to understand the possibilities of digital analysis, both in the quantitative and qualitative area, and to successfully build mixed-methods approaches.

Data For Executives - How to Influence Stakeholders and Achieve Success (Paperback): Nick Hobbie Data For Executives - How to Influence Stakeholders and Achieve Success (Paperback)
Nick Hobbie
R759 R617 Discovery Miles 6 170 Save R142 (19%) Ships in 10 - 15 working days
Introduction to Simula 67 (German, Paperback, 1981 ed.): Gunther Lamprecht Introduction to Simula 67 (German, Paperback, 1981 ed.)
Gunther Lamprecht
R1,661 Discovery Miles 16 610 Ships in 10 - 15 working days
Managing Data Quality - A practical guide (Paperback): Tim King, Julian Schwarzenbach Managing Data Quality - A practical guide (Paperback)
Tim King, Julian Schwarzenbach
R902 R819 Discovery Miles 8 190 Save R83 (9%) Ships in 9 - 15 working days

Data is an increasingly important business asset and enabler for organisational activities. Data quality is a key aspect of data management and failure to understand it increases organisational risk and decreases efficiency and profitability. This book explains data quality management in practical terms, focusing on three key areas - the nature of data in enterprises, the purpose and scope of data quality management, and implementing a data quality management system, in line with ISO 8000-61.

Azure Databricks Cookbook - Accelerate and scale real-time analytics solutions using the Apache Spark-based analytics service... Azure Databricks Cookbook - Accelerate and scale real-time analytics solutions using the Apache Spark-based analytics service (Paperback)
Phani Raj, Vinod Jaiswal
R1,478 Discovery Miles 14 780 Ships in 10 - 15 working days

Get to grips with building and productionizing end-to-end big data solutions in Azure and learn best practices for working with large datasets Key Features Integrate with Azure Synapse Analytics, Cosmos DB, and Azure HDInsight Kafka Cluster to scale and analyze your projects and build pipelines Use Databricks SQL to run ad hoc queries on your data lake and create dashboards Productionize a solution using CI/CD for deploying notebooks and Azure Databricks Service to various environments Book DescriptionAzure Databricks is a unified collaborative platform for performing scalable analytics in an interactive environment. The Azure Databricks Cookbook provides recipes to get hands-on with the analytics process, including ingesting data from various batch and streaming sources and building a modern data warehouse. The book starts by teaching you how to create an Azure Databricks instance within the Azure portal, Azure CLI, and ARM templates. You'll work through clusters in Databricks and explore recipes for ingesting data from sources, including files, databases, and streaming sources such as Apache Kafka and EventHub. The book will help you explore all the features supported by Azure Databricks for building powerful end-to-end data pipelines. You'll also find out how to build a modern data warehouse by using Delta tables and Azure Synapse Analytics. Later, you'll learn how to write ad hoc queries and extract meaningful insights from the data lake by creating visualizations and dashboards with Databricks SQL. Finally, you'll deploy and productionize a data pipeline as well as deploy notebooks and Azure Databricks service using continuous integration and continuous delivery (CI/CD). By the end of this Azure book, you'll be able to use Azure Databricks to streamline different processes involved in building data-driven apps. What you will learn Read and write data from and to various Azure resources and file formats Build a modern data warehouse with Delta Tables and Azure Synapse Analytics Explore jobs, stages, and tasks and see how Spark lazy evaluation works Handle concurrent transactions and learn performance optimization in Delta tables Learn Databricks SQL and create real-time dashboards in Databricks SQL Integrate Azure DevOps for version control, deploying, and productionizing solutions with CI/CD pipelines Discover how to use RBAC and ACLs to restrict data access Build end-to-end data processing pipeline for near real-time data analytics Who this book is forThis recipe-based book is for data scientists, data engineers, big data professionals, and machine learning engineers who want to perform data analytics on their applications. Prior experience of working with Apache Spark and Azure is necessary to get the most out of this book.

Mathematische Routinen Der Physik, Chemie Und Technik Fur Aos-Rechner (German, Paperback, 1979 ed.): Peter Kahlig Mathematische Routinen Der Physik, Chemie Und Technik Fur Aos-Rechner (German, Paperback, 1979 ed.)
Peter Kahlig
R1,653 Discovery Miles 16 530 Ships in 10 - 15 working days
iPhone 12 USER GUIDE - A Complete Step By Step Manual On How To Use The 2020 iPhone 12 Series For Beginners And Seniors To... iPhone 12 USER GUIDE - A Complete Step By Step Manual On How To Use The 2020 iPhone 12 Series For Beginners And Seniors To Master Your New Device Like A Pro. With iOS 14 Updates. (Paperback)
Donald L McGuire
R335 Discovery Miles 3 350 Ships in 10 - 15 working days
Microsoft Power BI Demystified - step by step guide on how to create interactive dashboard and reports using Power BI... Microsoft Power BI Demystified - step by step guide on how to create interactive dashboard and reports using Power BI (Paperback)
Elijah Falode
R806 Discovery Miles 8 060 Ships in 10 - 15 working days
Excel 2021 (Paperback): Jiayi Simonds Excel 2021 (Paperback)
Jiayi Simonds
R454 Discovery Miles 4 540 Ships in 10 - 15 working days
Data Analytics Made Easy - Analyze and present data to make informed decisions without writing any code (Paperback): Andrea De... Data Analytics Made Easy - Analyze and present data to make informed decisions without writing any code (Paperback)
Andrea De Mauro; Foreword by Francesco Marzoni, Andrew J. Walter
R966 Discovery Miles 9 660 Ships in 10 - 15 working days

Learn how to gain insights from your data as well as machine learning and become a presentation pro who can create interactive dashboards Key Features Enhance your presentation skills by implementing engaging data storytelling and visualization techniques Learn the basics of machine learning and easily apply machine learning models to your data Improve productivity by automating your data processes Book DescriptionData Analytics Made Easy is an accessible beginner's guide for anyone working with data. The book interweaves four key elements: Data visualizations and storytelling - Tired of people not listening to you and ignoring your results? Don't worry; chapters 7 and 8 show you how to enhance your presentations and engage with your managers and co-workers. Learn to create focused content with a well-structured story behind it to captivate your audience. Automating your data workflows - Improve your productivity by automating your data analysis. This book introduces you to the open-source platform, KNIME Analytics Platform. You'll see how to use this no-code and free-to-use software to create a KNIME workflow of your data processes just by clicking and dragging components. Machine learning - Data Analytics Made Easy describes popular machine learning approaches in a simplified and visual way before implementing these machine learning models using KNIME. You'll not only be able to understand data scientists' machine learning models; you'll be able to challenge them and build your own. Creating interactive dashboards - Follow the book's simple methodology to create professional-looking dashboards using Microsoft Power BI, giving users the capability to slice and dice data and drill down into the results. What you will learn Understand the potential of data and its impact on your business Import, clean, transform, combine data feeds, and automate your processes Influence business decisions by learning to create engaging presentations Build real-world models to improve profitability, create customer segmentation, automate and improve data reporting, and more Create professional-looking and business-centric visuals and dashboards Open the lid on the black box of AI and learn about and implement supervised and unsupervised machine learning models Who this book is forThis book is for beginners who work with data and those who need to know how to interpret their business/customer data. The book also covers the high-level concepts of data workflows, machine learning, data storytelling, and visualizations, which are useful for managers. No previous math, statistics, or computer science knowledge is required.

Game of Colors: Moderne Bewegtbildproduktion - Theorie und Praxis fur Film, Video und Fernsehen (German, Hardcover, 1. Aufl.... Game of Colors: Moderne Bewegtbildproduktion - Theorie und Praxis fur Film, Video und Fernsehen (German, Hardcover, 1. Aufl. 2016)
Eberhard Hasche, Patrick Ingwer
R1,837 Discovery Miles 18 370 Ships in 12 - 17 working days

Die Umstellung auf die Digitaltechnik kommt einer Revolution in der Film- und TV-Produktion gleich, fur die neue Techniken eingesetzt werden: Scene-linear Color Workflow, digitale Kameratechnik, Digital Compositing, Depth- und Deep-Compositing, Stereo3D, 3D-Modelling und Rendering zur Verwendung in Live-Action-Footage sowie Lidar-unterstutztes Matchmoving und Keying von Greenscreen-Aufnahmen sind Kernthemen dieses Buchs, die zu neuen Workflow-bezogenen Produktionsketten fuhren. Die Autoren erlautern die Grundlagen dieser modernen Produktionsketten in Film, Fernsehen und VFX fur professionelle Anwender.

Python Natural Language Processing Cookbook - Over 50 recipes to understand, analyze, and generate text for implementing... Python Natural Language Processing Cookbook - Over 50 recipes to understand, analyze, and generate text for implementing language processing tasks (Paperback)
Zhenya Antic
R1,098 Discovery Miles 10 980 Ships in 10 - 15 working days

Get to grips with solving real-world NLP problems, such as dependency parsing, information extraction, topic modeling, and text data visualization Key Features Analyze varying complexities of text using popular Python packages such as NLTK, spaCy, sklearn, and gensim Implement common and not-so-common linguistic processing tasks using Python libraries Overcome the common challenges faced while implementing NLP pipelines Book DescriptionPython is the most widely used language for natural language processing (NLP) thanks to its extensive tools and libraries for analyzing text and extracting computer-usable data. This book will take you through a range of techniques for text processing, from basics such as parsing the parts of speech to complex topics such as topic modeling, text classification, and visualization. Starting with an overview of NLP, the book presents recipes for dividing text into sentences, stemming and lemmatization, removing stopwords, and parts of speech tagging to help you to prepare your data. You'll then learn ways of extracting and representing grammatical information, such as dependency parsing and anaphora resolution, discover different ways of representing the semantics using bag-of-words, TF-IDF, word embeddings, and BERT, and develop skills for text classification using keywords, SVMs, LSTMs, and other techniques. As you advance, you'll also see how to extract information from text, implement unsupervised and supervised techniques for topic modeling, and perform topic modeling of short texts, such as tweets. Additionally, the book shows you how to develop chatbots using NLTK and Rasa and visualize text data. By the end of this NLP book, you'll have developed the skills to use a powerful set of tools for text processing. What you will learn Become well-versed with basic and advanced NLP techniques in Python Represent grammatical information in text using spaCy, and semantic information using bag-of-words, TF-IDF, and word embeddings Perform text classification using different methods, including SVMs and LSTMs Explore different techniques for topic modeling such as K-means, LDA, NMF, and BERT Work with visualization techniques such as NER and word clouds for different NLP tools Build a basic chatbot using NLTK and Rasa Extract information from text using regular expression techniques and statistical and deep learning tools Who this book is forThis book is for data scientists and professionals who want to learn how to work with text. Intermediate knowledge of Python will help you to make the most out of this book. If you are an NLP practitioner, this book will serve as a code reference when working on your projects.

Free Delivery
Pinterest Twitter Facebook Google+
You may like...
New Methods of Market Research and…
G. Scott Erickson Hardcover R2,900 Discovery Miles 29 000
Data Visualization with Excel Dashboards…
D Kusleika Paperback R769 Discovery Miles 7 690
ISE Data Analytics for Accounting
Vernon Richardson, Katie Terrell, … Paperback R1,858 Discovery Miles 18 580
Ethics of Data and Analytics - Concepts…
Kirsten Martin Paperback R1,799 Discovery Miles 17 990
Functional Aesthetics for Data…
V Setlur Paperback R738 Discovery Miles 7 380
Value-Driven Data - Identifying…
Edosa Odaro Hardcover R2,732 Discovery Miles 27 320
SQL for Data Scientists - A Beginner's…
RMP Teat Paperback R862 Discovery Miles 8 620
Cancer Prediction for Industrial IoT 4.0…
Meenu Gupta, Rachna Jain, … Hardcover R3,992 Discovery Miles 39 920
The Data Warehouse Toolkit, Third…
R. Kimball Paperback R1,657 R1,529 Discovery Miles 15 290
Fundamentals of Data Engineering - Plan…
Joe Reis Paperback R1,353 Discovery Miles 13 530

 

Partners