0
Your cart

Your cart is empty

Browse All Departments
Price
  • R100 - R250 (3)
  • R250 - R500 (80)
  • R500+ (3,623)
  • -
Status
Format
Author / Contributor
Publisher

Books > Computing & IT > Applications of computing > Databases > Data mining

Conceptual Structures: From Information to Intelligence - 18th International Conference on Conceptual Structures, ICCS 2010,... Conceptual Structures: From Information to Intelligence - 18th International Conference on Conceptual Structures, ICCS 2010, Kuching, Sarawak, Malaysia, July 26-30, 2010, Proceedings (Paperback, 2010 ed.)
Madalina Croitoru, Sebastien Ferre, Dickson Lukose
R1,533 Discovery Miles 15 330 Ships in 10 - 15 working days

th The 18 International Conference on Conceptual Structures (ICCS 2010) was the latest in a series of annual conferences that have been held in Europe, A- tralia, and North America since 1993. The focus of the conference has been the representation and analysis of conceptual knowledge for research and practical application. ICCS brings together researchers and practitioners in information and computer sciences as well as social science to explore novel ways that c- ceptual structures can be deployed. Arising from the research on knowledge representation and reasoning with conceptual graphs, over the years ICCS has broadened its scope to include in- vations from a wider range of theories and related practices, among them other forms of graph-based reasoning systems like RDF or existential graphs, formal concept analysis, Semantic Web technologies, ontologies, concept mapping and more. Accordingly, ICCS represents a family of approaches related to conc- tualstructuresthatbuild onthesuccesseswithtechniquesderivedfromarti?cial intelligence, knowledge representation and reasoning, applied mathematics and lattice theory, computational linguistics, conceptual modeling and design, d- grammatic reasoning and logic, intelligent systems and knowledge management. The ICCS 2010 theme "From Information to Intelligence" hints at unve- ing the reasoning capabilities of conceptual structures. Indeed, improvements in storage capacity and performance of computing infrastructure have also - fected the nature of knowledge representation and reasoning (KRR) systems, shifting their focus toward representational power and execution performance. Therefore, KRR research is now faced with a challenge of developing knowledge representation and reasoning structures optimized for such reasonings.

Case-Based Reasoning - 18th International Conference, ICCBR 2010, Alessandria, Italy, July 19-22, 2010 Proceedings (Paperback,... Case-Based Reasoning - 18th International Conference, ICCBR 2010, Alessandria, Italy, July 19-22, 2010 Proceedings (Paperback, 2010 ed.)
Isabelle Bichindaritz, Stefania Montani
R3,037 Discovery Miles 30 370 Ships in 10 - 15 working days

The International Conference on Case-Based Reasoning (ICCBR) is the preeminentinternationalmeetingoncase-basedreasoning(CBR). Through2009, ICCBR (http://www. iccbr. org) had been a biennial conference, held in alter- tion with its sister conference, the European Conference on Case-BasedReas- ing (http://www. eccbr. org), which was located in Europe. At the 2009 ICCBR,the ICCBR ProgramCommittee elected to extend ano?er of consoli- tion with ECCBR. The o?er was accepted by the ECCBR 2010 organizers and they have considered it approved by the ECCBR community, as the two conf- ences share a majority of Program Committee members. ICCBR and ECCBR havebeen theleading conferencesonCBR. From2010,ICCBRandECCBRwill be merged in a single conference series, called ICCBR. As there had been eight previous ICCBR events and nine previous ECCBR events, the combined series is considered the 18th ICCBR. ICCBR 2010 (http://www. iccbr. org/iccbr10) was therefore the 18th in this seriesof internationalconferenceshighlighting the mostsigni?cantcontributions tothe?eldofCBR. TheconferencetookplaceduringJuly19-22,2010inthecity of Alessandria,Italy, on the beautiful campus of the University of Piemonte O- entale "A. Avogadro. " Previous ICCBR conferences were held in Sesimbra, P- tugal(1995),Providence,RhodeIsland,USA(1997),SeeonMonastery,Germany (1999), Vancouver, BC, Canada (2001), Trondheim, Norway (2003), Chicago, Illinois, USA (2005), Belfast, Northern Ireland (2007), and Seattle, Washington, USA (2009). Day 1 of the conference hosted an Applications Track, the second Doctoral Consortium, and the third Computer Cooking Contest. The Applications Track featured ?elded applications and CBR systems demos in industrial and scienti?c settingswithanemphasis ondiscussionandnetworkingbetweenresearchersand industrials.

Advances in Data Mining: Applications and Theoretical Aspects - 10th Industrial Conference, ICDM 2010, Berlin, Germany, July... Advances in Data Mining: Applications and Theoretical Aspects - 10th Industrial Conference, ICDM 2010, Berlin, Germany, July 12-14, 2010. Proceedings (Paperback, 2010 ed.)
Petra Perner
R3,072 Discovery Miles 30 720 Ships in 10 - 15 working days

These are the proceedings of the tenth event of the Industrial Conference on Data Mining ICDM held in Berlin (www.data-mining-forum.de). For this edition the Program Committee received 175 submissions. After the pe- review process, we accepted 49 high-quality papers for oral presentation that are included in this book. The topics range from theoretical aspects of data mining to app- cations of data mining such as on multimedia data, in marketing, finance and telec- munication, in medicine and agriculture, and in process control, industry and society. Extended versions of selected papers will appear in the international journal Trans- tions on Machine Learning and Data Mining (www.ibai-publishing.org/journal/mldm). Ten papers were selected for poster presentations and are published in the ICDM Poster Proceeding Volume by ibai-publishing (www.ibai-publishing.org). In conjunction with ICDM four workshops were held on special hot applicati- oriented topics in data mining: Data Mining in Marketing DMM, Data Mining in LifeScience DMLS, the Workshop on Case-Based Reasoning for Multimedia Data CBR-MD, and the Workshop on Data Mining in Agriculture DMA. The Workshop on Data Mining in Agriculture ran for the first time this year. All workshop papers will be published in the workshop proceedings by ibai-publishing (www.ibai-publishing.org). Selected papers of CBR-MD will be published in a special issue of the international journal Transactions on Case-Based Reasoning (www.ibai-publishing.org/journal/cbr).

Natural Language Processing and Information Systems - 15th International Conference  on Applications of Natural Language to... Natural Language Processing and Information Systems - 15th International Conference on Applications of Natural Language to Information Systems, NLDB 2010, Cardiff, UK, June 23-25, 2010, Proceedings (Paperback, 2010 ed.)
Christina J. Hopfe, Yacine Rezgui, Elisabeth Metais, Alun Preece, Haijiang Li
R1,566 Discovery Miles 15 660 Ships in 10 - 15 working days

th The 15 International Conference on Applications of Natural Language to Information Systems (NLDB 2010) took place during June 23-25 in Cardiff (UK). Since the first edition in 1995, the NLDB conference has been aiming at bringing together resear- ers, people working in industry and potential users interested in various applications of natural language in the database and information system area. However, in order to reflect the growing importance of accessing information from a diverse collection of sources (Web, Databases, Sensors, Cloud) in an equally wide range of contexts (- cluding mobile and tethered), the theme of the 15th International Conference on - plications of Natural Language to Information Systems 2010 was "Communicating with Anything, Anywhere in Natural Language. " Natural languages and databases are core components in the development of inf- mation systems. Natural language processing (NLP) techniques may substantially enhance most phases of the information system lifecycle, starting with requirement analysis, specification and validation, and going up to conflict resolution, result pr- essing and presentation. Furthermore, natural language-based query languages and user interfaces facilitate the access to information for all and allow for new paradigms in the usage of computerized services. Hot topics such as information retrieval (IR), software engineering applications, hidden Markov models, natural language interfaces and semantic networks and graphs imply a complete fusion of databases, IR and NLP techniques.

Agent and Multi-Agent Systems: Technologies and Applications - 4th KES International Symposium, KES-AMSTA 2010, Gdynia, Poland,... Agent and Multi-Agent Systems: Technologies and Applications - 4th KES International Symposium, KES-AMSTA 2010, Gdynia, Poland, June 23-25, 2010. Proceedings, Part I (Paperback, 2010 ed.)
Piotr Jedrzejowicz, Ngoc Thanh Nguyen, Robert J. Howlett, Lakhmi C. Jain
R1,608 Discovery Miles 16 080 Ships in 10 - 15 working days

Simulation and Decision Making, Multi-Agent Applications, Management and e-Business, Mobile Agents and Robots, and Machine Learning. In addition to the main tracks of the symposium there were the following five special sessions: Agent- Based Optimization (ABO2010), Agent-Enabled Social Computing (AESC2010), Digital Economy (DE2010), Using Intelligent Systems for Information Technology Assessment (ISITA2010) and a Doctoral Track. Accepted and presented papers highlight new trends and challenges in agent and multi-agent research. We hope these results will be of value to the research com- nity working in the fields of artificial intelligence, collective computational intel- gence, robotics, machine learning and, in particular, agent and multi-agent systems technologies and applications. We would like to express our sincere thanks to the Honorary Chairs, Romuald Cwilewicz, President of the Gdynia Maritime University, Poland, and Lakhmi C. Jain, University of South Australia, Australia, for their support. Our special thanks go to the Local Organizing Committee chaired by Ireneusz Czarnowski, who did very solid and excellent work. Thanks are due to the Program Co-chairs, all Program and Reviewer Committee members and all the additional - viewers for their valuable efforts in the review process, which helped us to guarantee the highest quality of selected papers for the conference. We cordially thank the - ganizers and chairs of special sessions, which essentially contributed to the success of the conference.

Agent and Multi-Agent Systems: Technologies and Applications - 4th KES International Symposium, KES-AMSTA 2010, Gdynia, Poland,... Agent and Multi-Agent Systems: Technologies and Applications - 4th KES International Symposium, KES-AMSTA 2010, Gdynia, Poland, June 23-25, 2010. Proceedings, Part II (Paperback, 2010 ed.)
Piotr Jedrzejowicz, Ngoc Thanh Nguyen, Robert J. Howlett, Lakhmi C. Jain
R1,598 Discovery Miles 15 980 Ships in 10 - 15 working days

Presented experiments show that usage ofevolutionary approach to feature - duction is justi?ed.Feature selection as well as construction gives goodresults. It is noticeable that attribute construction's best results assign higher classi?- tion accuracy than feature selection alone.That is why, carrying out selection before construction to decrease searchingspace isagoodsolution. Because of indeterministicbehavior of neuralnetworks,it was di?cultto - ducefeaturesetincaseofusingthemto evaluatecandidateresults.Forexample, aneuralnetworklearntverywellondatathatwasdescribedbyfullattributeset, but when thisset was decreased it had huge problems to do this duringrequired number ofepochs.That suggests that usingC4.5 ismuchmore preferred. Numerous experiments havebeen performed and observed.Analysis ofabove results allowsto put the hypothesisthat it is worth to use Construction module as the feature set reduction. But experiments show that Constructormodule does not work sowell whenitusesthe whole initial set offeatures - the search space istoo large.Soit is worth to use ?rstly Selectorand nextConstructor. The second important issue isthatConstructor destructs the semanticmeaning of the features.New constructed features are notunderstandableforusers.In some real-liveproblems measuring offeature values isquite expensive, forsuch problems selector seems to be more suitable because itdiminishes a number of realfeatures.To constructonefeaturesa number ofreal(measured)featurescan be required. Obtainedresults haveencouragedus to extendour system,especiallythe c- structormodule.Weplan to developenlarged set offunctionsFwhich allowsto use the system with data containingdi?erenttype offeatures,not only nume- cal. Such system will be veri?ed usingagreater number ofbenchmark data sets as well as real data. Acknowledgments. This work ispartially ?nanced fromthe Ministryof S- ence and Higher Education Republic of Polandresources in 2008-2010 years as a Poland-Singapore joint research project 65/N-SINGAPORE/2007/0.

User Modeling, Adaptation, and Personalization - 18th International Conference, UMAP 2010, Big Island, HI, USA, June 20-24,... User Modeling, Adaptation, and Personalization - 18th International Conference, UMAP 2010, Big Island, HI, USA, June 20-24, 2010, Proceedings (Paperback, Edition.)
Paul De Bra, Alfred Kobsa, David Chin
R1,603 Discovery Miles 16 030 Ships in 10 - 15 working days

The 18th InternationalConference on User Modeling, Adaptation and Person- ization(UMAP 2010)took placeon BigIsland, HawaiiduringJune 20-24,2010. It was the second conference after UMAP 2009 in Trento, Italy, which merged the successful biannual User Modeling (UM) and Adaptive Hypermedia (AH) conference series. The ResearchPapertrackof the conferencewaschairedbyPaulDe Brafrom the Eindhoven University of Technology and Alfred Kobsa from the University ofCalifornia,Irvine. TheywereassistedbyaninternationalProgramCommittee of 80 leading ?gures in the AH and UM communities as well as highly promising youngerresearchers. Papersinthe ResearchPapertrackweregenerallyreviewed by three and sometimes even four reviewers, with one of them acting as a lead who initiates a discussion between reviewers and reconciles their opinions in a meta-review. The conferencesolicitedLong ResearchPapersof up to 12pagesin length, whichrepresentoriginalreports of substantivenew research. In addition, theconferenceacceptedShortResearchPapersofuptosixpagesinlength,whose merit was assessed more in terms of originality and importance than maturity and technical validation. The Research Paper track received 161 submission, with 112 in the long and 49 in the short paper category. Of these, 26 long and 6 short papers were accepted, resulting in an acceptance rate of 23. 2% for long papers and 19. 9% overall. Many authors of rejected papers were encouraged to resubmit to the Poster and Demo track of the conference. Following the example of UMAP 2009, the conference also had an Ind- try Paper track chaired by Bhaskar Mehta from Google, Zur .. ich, Switzerland and Kurt Partridge from PARC, Palo Alto, USA.

Combinatorial Pattern Matching - 21st Annual Symposium, CPM 2010, New York, NY, USA, June 21-23, 2010, Proceedings, (Paperback,... Combinatorial Pattern Matching - 21st Annual Symposium, CPM 2010, New York, NY, USA, June 21-23, 2010, Proceedings, (Paperback, Edition.)
Amihood Amir, Laxmi Parida
R1,581 Discovery Miles 15 810 Ships in 10 - 15 working days

The papers contained in this volume were presented at the 21st Annual S- posium on Combinatorial Pattern Matching (CPM 2010) held at NYU-Poly, Brooklyn, New York during June 21-23, 2010. Allthe paperspresentedatthe conferenceareoriginalresearchcontributions. We received 53 submissions from 21 countries. Each paper was reviewed by at least three reviewers. The committee decided to accept 28 papers. The program also includes three invited talks by Zvi Galil from Tel Aviv University, Israel, Richard M. Karp from University of California at Berkeley, USA, and Je?rey S. Vitter from Texas A&M University, USA. The objective of the annual CPM meetings is to provide an international forum for research in combinatorial pattern matching and related applications. It addresses issues of searching and matching strings and more complicated p- terns such as trees, regular expressions, graphs, point sets, and arrays. The goal is to derive non-trivialcombinatorialproperties of suchstructures and to exploit these properties in order to either achieve superior performance for the cor- sponding computational problems or pinpoint conditions under which searches cannot be performed e?ciently. The meeting also deals with problems in c- putational biology, data compression and data mining, coding, information - trieval, natural language processing and pattern recognition. TheAnnual SymposiumonCombinatorialPatternMatchingstartedin 1990, andhassincetakenplaceeveryyear.PreviousCPM meetingswereheld inParis, London, Tucson, Padova, Asilomar, Helsinki, Laguna Beach, Aarhus, Pisc- away, Warwick, Montreal, Jerusalem, Fukuoka, Morelia, Istanbul, Jeju Island, Barcelona, London, Ontario, Pisa, and Lille.

Intelligence and Security Informatics - Pacific Asia Workshop, PAISI 2010, Hyderabad, India, June 21, 2010 Proceedings... Intelligence and Security Informatics - Pacific Asia Workshop, PAISI 2010, Hyderabad, India, June 21, 2010 Proceedings (Paperback, Edition.)
Hsinchun Chen, Michael Chau, Shu-Hsing Li, Shalini Urs, Srinath Srinivasa, …
R1,522 Discovery Miles 15 220 Ships in 10 - 15 working days

Intelligence and security informatics (ISI) is concerned with the study of the devel- ment and use of advanced information technologies and systems for national, inter- tional, and societal security-related applications. The annual IEEE International Conference series on ISI (http://www. isiconference. org/) was started in 2003. In 2006, the Workshop on ISI (http://isi. se. cuhk. edu. hk/2006/) was held in Singapore in c- junction with the Pacific Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2006), with over 100 contributors and participants from all over the world. This would become the start of a new series of ISI meetings in the Pacific Asia region. PAISI 2007 (http://isi. se. cuhk. edu. hk/2007/) was then held in Chengdu, China. PAISI 2008 (http://isi. se. cuhk. edu. hk/2008/) was held in Taipei, Taiwan, in conjunction with IEEE ISI 2008. PAISI 2009 (http://www. business. hku. hk/paisi/2009/) was held in Bangkok, Thailand, in conjunction with PAKDD 2009. These past ISI conferences and workshops brought together academic researchers, law enforcement and intel- gence experts, information technology consultants and practitioners to discuss their research and practice related to various ISI topics. These topics include ISI data m- agement, data and text mining for ISI applications, terrorism informatics, deception and intent detection, terrorist and criminal social network analysis, public health and bio-security, crime analysis, cyber-infrastructure protection, transportation infrastr- ture security, policy studies and evaluation, information assurance, enterprise risk management, information systems security, among others.

Practical Google Analytics and Google Tag Manager for Developers (Paperback, 1st ed.): Jonathan Weber Practical Google Analytics and Google Tag Manager for Developers (Paperback, 1st ed.)
Jonathan Weber
R2,350 R2,219 Discovery Miles 22 190 Save R131 (6%) Ships in 9 - 15 working days

Whether you're a marketer with development skills or a full-on web developer/analyst, Practical Google Analytics and Google Tag Manager for Developers shows you how to implement Google Analytics using Google Tag Manager to jumpstart your web analytics measurement. There's a reason that so many organizations use Google Analytics. Effective collection of data with Google Analytics can reduce customer acquisition costs, provide priceless feedback on new product initiatives, and offer insights that will grow a customer or client base. So where does Google Tag Manager fit in? Google Tag Manager allows for unprecedented collaboration between marketing and technical teams, lightning fast updates to your site, and standardization of the most common tags for on-site tracking an d marketing efforts. To achieve the rich data you're really after to better serve your users' needs, you'll need the tools Google Tag Manager provides for a best-in-class implementation of Google Analytics measurement on your site. Written by data evangelist and Google Analytics expert Jonathan Weber and the team at LunaMetrics, this book offers foundational knowledge, a collection of practical Google Tag Manager recipes, well-tested best practices, and troubleshooting tips to get your implementation in tip-top condition. It covers topics including: * Google Analytics implementation via Google Tag Manager * How to customize Google Analytics for your unique situation * Using Google Tag Manager to track and analyze interactions across multiple devices and touch points * How to extract data from Google Analytics and use Google BigQuery to analyze Big Data questions What You'll Learn Implementation approaches for Google Analytics, including common pitfalls and troubleshooting strategies. How to use tools like Google Tag Manager and jQuery to jumpstart your Google Analytics implementation. How to track metrics beyond page views to other critical user interactions, such as clicks on outbound links or downloads, scrolling and page engagement, usage of AJAX forms, and much more. How to incorporate additional, customized data into Google Analytics to track individual users or enrich data about their behavior. Who This Book Is For Web developers, data analysts, and marketers with a basic familiarity with Google Analytics from an end-user perspective, as well as some knowledge of HTML and JavaScript.

Computational Linguistics and Intelligent Text Processing - 11th International Conference, CICLing 2010, Iasi, Romania, March... Computational Linguistics and Intelligent Text Processing - 11th International Conference, CICLing 2010, Iasi, Romania, March 21-27, 2010, Proceedings (Paperback, Edition.)
Alexander Gelbukh
R3,108 Discovery Miles 31 080 Ships in 10 - 15 working days

th CICLing 2010 was the 11 Annual Conference on Intelligent Text Processing and Computational Linguistics. The CICLing conferences provide a wide-scope forum for discussion of the art and craft of natural language processing research as well as the best practices in its applications. This volume contains three invited papers and the regular papers accepted for oral presentation at the conference. The papers accepted for poster pres- tation were published in a special issue of another journal (see information on thewebsite).Since 2001,theproceedingsofCICLingconferenceshavebeen p- lished in Springer's Lecture Notes in Computer Science series, as volumes 2004, 2276, 2588, 2945, 3406, 3878, 4394, 4919, and 5449. The volume is structured into 12 sections: - Lexical Resources - Syntax and Parsing - Word Sense Disambiguation and Named Entity Recognition - Semantics and Dialog - Humor and Emotions - Machine Translation and Multilingualism - Information Extraction - Information Retrieval - Text Categorization and Classi?cation - Plagiarism Detection - Text Summarization - Speech Generation The 2010 event received a record high number of submissions in the - year history of the CICLing series. A total of 271 papers by 565 authors from 47 countriesweresubmittedforevaluationbytheInternationalProgramCommittee (see Tables 1 and 2). This volume contains revised versions of 61 papers, by 152 authors, selected for oral presentation; the acceptance rate was 23%.

Demand Prediction in Retail - A Practical Guide to Leverage Data and Predictive Analytics (Hardcover, 1st ed. 2022): Maxime C.... Demand Prediction in Retail - A Practical Guide to Leverage Data and Predictive Analytics (Hardcover, 1st ed. 2022)
Maxime C. Cohen, Paul-Emile Gras, Arthur Pentecoste, Renyu Zhang
R2,381 R2,001 Discovery Miles 20 010 Save R380 (16%) Ships in 9 - 15 working days

From data collection to evaluation and visualization of prediction results, this book provides a comprehensive overview of the process of predicting demand for retailers. Each step is illustrated with the relevant code and implementation details to demystify how historical data can be leveraged to predict future demand. The tools and methods presented can be applied to most retail settings, both online and brick-and-mortar, such as fashion, electronics, groceries, and furniture. This book is intended to help students in business analytics and data scientists better master how to leverage data for predicting demand in retail applications. It can also be used as a guide for supply chain practitioners who are interested in predicting demand. It enables readers to understand how to leverage data to predict future demand, how to clean and pre-process the data to make it suitable for predictive analytics, what the common caveats are in terms of implementation and how to assess prediction accuracy.

Database Systems for Advanced Applications. DASFAA 2022 International Workshops - BDMS, BDQM, GDMA, IWBT, MAQTDS, and PMBD,... Database Systems for Advanced Applications. DASFAA 2022 International Workshops - BDMS, BDQM, GDMA, IWBT, MAQTDS, and PMBD, Virtual Event, April 11-14, 2022, Proceedings (Paperback, 1st ed. 2022)
Uday Kiran Rage, Vikram Goyal, P. Krishna Reddy
R2,198 Discovery Miles 21 980 Ships in 12 - 17 working days

This volume constitutes the papers of several workshops which were held in conjunction with the 27th International Conference on Database Systems for Advanced Applications, DASFAA 2022, held as virtual event in April 2022. The 30 revised full papers presented in this book were carefully reviewed and selected from 65 submissions. DASFAA 2022 presents the following five workshops: * First workshop on Pattern mining and Machine learning in Big complex Databases (PMBD 2021) * 6th International Workshop on Graph Data Management and Analysis (GDMA 2022) * First International Workshop on Blockchain Technologies (IWBT2022) * 8th International Workshop on Big Data Management and Service (BDMS 2022) * First workshop on Managing Air Quality Through Data Science * 7th International Workshop on Big Data Quality Management (BDQM 2022).

Data Mining for the Social Sciences - An Introduction (Paperback): Paul Attewell, David Monaghan Data Mining for the Social Sciences - An Introduction (Paperback)
Paul Attewell, David Monaghan
R1,013 R875 Discovery Miles 8 750 Save R138 (14%) Ships in 12 - 17 working days

We live in a world of big data: the amount of information collected on human behavior each day is staggering, and exponentially greater than at any time in the past. Additionally, powerful algorithms are capable of churning through seas of data to uncover patterns. Providing a simple and accessible introduction to data mining, Paul Attewell and David B. Monaghan discuss how data mining substantially differs from conventional statistical modeling familiar to most social scientists. The authors also empower social scientists to tap into these new resources and incorporate data mining methodologies in their analytical toolkits. Data Mining for the Social Sciences demystifies the process by describing the diverse set of techniques available, discussing the strengths and weaknesses of various approaches, and giving practical demonstrations of how to carry out analyses using tools in various statistical software packages.

Beginning Apache Spark 2 - With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning... Beginning Apache Spark 2 - With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning library (Paperback, 1st ed.)
Hien Luu
R992 Discovery Miles 9 920 Ships in 9 - 15 working days

Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work with it. Along the way, you'll discover resilient distributed datasets (RDDs); use Spark SQL for structured data; and learn stream processing and build real-time applications with Spark Structured Streaming. Furthermore, you'll learn the fundamentals of Spark ML for machine learning and much more. After you read this book, you will have the fundamentals to become proficient in using Apache Spark and know when and how to apply it to your big data applications. What You Will Learn Understand Spark unified data processing platform How to run Spark in Spark Shell or Databricks Use and manipulate RDDs Deal with structured data using Spark SQL through its operations and advanced functions Build real-time applications using Spark Structured Streaming Develop intelligent applications with the Spark Machine Learning library Who This Book Is For Programmers and developers active in big data, Hadoop, and Java but who are new to the Apache Spark platform.

People Skills for Analytical Thinkers (Paperback): Gilbert Eijkelenboom People Skills for Analytical Thinkers (Paperback)
Gilbert Eijkelenboom
R516 Discovery Miles 5 160 Ships in 9 - 15 working days
Handbook of Statistical Analysis and Data Mining Applications (Hardcover, 2nd edition): Robert Nisbet, Gary D. Miner, Ken Yale Handbook of Statistical Analysis and Data Mining Applications (Hardcover, 2nd edition)
Robert Nisbet, Gary D. Miner, Ken Yale
R2,746 R2,261 Discovery Miles 22 610 Save R485 (18%) Ships in 12 - 17 working days

Handbook of Statistical Analysis and Data Mining Applications, Second Edition, is a comprehensive professional reference book that guides business analysts, scientists, engineers and researchers, both academic and industrial, through all stages of data analysis, model building and implementation. The handbook helps users discern technical and business problems, understand the strengths and weaknesses of modern data mining algorithms and employ the right statistical methods for practical application. This book is an ideal reference for users who want to address massive and complex datasets with novel statistical approaches and be able to objectively evaluate analyses and solutions. It has clear, intuitive explanations of the principles and tools for solving problems using modern analytic techniques and discusses their application to real problems in ways accessible and beneficial to practitioners across several areas-from science and engineering, to medicine, academia and commerce.

The Data and Analytics Playbook - Proven Methods for Governed Data and Analytic Quality (Paperback): Lowell Fryman, Gregory... The Data and Analytics Playbook - Proven Methods for Governed Data and Analytic Quality (Paperback)
Lowell Fryman, Gregory Lampshire, Dan Meers
R1,248 Discovery Miles 12 480 Ships in 12 - 17 working days

The Data and Analytics Playbook: Proven Methods for Governed Data and Analytic Quality explores the way in which data continues to dominate budgets, along with the varying efforts made across a variety of business enablement projects, including applications, web and mobile computing, big data analytics, and traditional data integration. The book teaches readers how to use proven methods and accelerators to break through data obstacles to provide faster, higher quality delivery of mission critical programs. Drawing upon years of practical experience, and using numerous examples and an easy to understand playbook, Lowell Fryman, Gregory Lampshire, and Dan Meers discuss a simple, proven approach to the execution of multiple data oriented activities. In addition, they present a clear set of methods to provide reliable governance, controls, risk, and exposure management for enterprise data and the programs that rely upon it. In addition, they discuss a cost-effective approach to providing sustainable governance and quality outcomes that enhance project delivery, while also ensuring ongoing controls. Example activities, templates, outputs, resources, and roles are explored, along with different organizational models in common use today and the ways they can be mapped to leverage playbook data governance throughout the organization.

Data Mining and Predictive Analytics 2e (Hardcover, 2nd Edition): DT Larose Data Mining and Predictive Analytics 2e (Hardcover, 2nd Edition)
DT Larose
R3,786 R3,463 Discovery Miles 34 630 Save R323 (9%) Ships in 12 - 17 working days

Learn methods of data analysis and their application to real-world data sets This updated second edition serves as an introduction to data mining methods and models, including association rules, clustering, neural networks, logistic regression, and multivariate analysis. The authors apply a unified white box approach to data mining methods and models. This approach is designed to walk readers through the operations and nuances of the various methods, using small data sets, so readers can gain an insight into the inner workings of the method under review. Chapters provide readers with hands-on analysis problems, representing an opportunity for readers to apply their newly-acquired data mining expertise to solving real problems using large, real-world data sets. Data Mining and Predictive Analytics, Second Edition: * Offers comprehensive coverage of association rules, clustering, neural networks, logistic regression, multivariate analysis, and R statistical programming language * Features over 750 chapter exercises, allowing readers to assess their understanding of the new material * Provides a detailed case study that brings together the lessons learned in the book * Includes access to the companion website, www.dataminingconsultant.com, with exclusive password-protected instructor content Data Mining and Predictive Analytics, Second Edition will appeal to computer science and statistic students, as well as students in MBA programs, and chief executives.

Building the Snowflake Data Cloud - Monetizing and Democratizing Your Data (Paperback, 1st ed.): Andrew Carruthers Building the Snowflake Data Cloud - Monetizing and Democratizing Your Data (Paperback, 1st ed.)
Andrew Carruthers
R1,733 R1,363 Discovery Miles 13 630 Save R370 (21%) Ships in 10 - 15 working days

Implement the Snowflake Data Cloud using best practices and reap the benefits of scalability and low-cost from the industry-leading, cloud-based, data warehousing platform. This book provides a detailed how-to explanation, and assumes familiarity with Snowflake core concepts and principles. It is a project-oriented book with a hands-on approach to designing, developing, and implementing your Data Cloud with security at the center. As you work through the examples, you will develop the skill, knowledge, and expertise to expand your capability by incorporating additional Snowflake features, tools, and techniques. Your Snowflake Data Cloud will be fit for purpose, extensible, and at the forefront of both Direct Share, Data Exchange, and Snowflake Marketplace. Building the Snowflake Data Cloud helps you transform your organization into monetizing the value locked up within your data. As the digital economy takes hold, with data volume, velocity, and variety growing at exponential rates, you need tools and techniques to quickly categorize, collate, summarize, and aggregate data. You also need the means to seamlessly distribute to release value. This book shows how Snowflake provides all these things and how to use them to your advantage. The book helps you succeed by delivering faster than you can deliver with legacy products and techniques. You will learn how to leverage what you already know, and what you don't, all applied in a Snowflake Data Cloud context. After reading this book, you will discover and embrace the future where the Data Cloud is central. You will be able to position your organization to take advantage by identifying, adopting, and preparing your tooling for the coming wave of opportunity around sharing and monetizing valuable, corporate data. What You Will Learn Understand why Data Cloud is important to the success of your organization Up-skill and adopt Snowflake, leveraging the benefits of cloud platforms Articulate the Snowflake Marketplace and identify opportunities to monetize data Identify tools and techniques to accelerate integration with Data Cloud Manage data consumption by monitoring and controlling access to datasets Develop data load and transform capabilities for use in future projects Who This Book Is For Solution architects seeking implementation patterns to integrate with a Data Cloud; data warehouse developers looking for tips, tools, and techniques to rapidly deliver data pipelines; sales managers who want to monetize their datasets and understand the opportunities that Data Cloud presents; and anyone who wishes to unlock value contained within their data silos

The Visual Imperative - Creating a Visual Culture of Data Discovery (Paperback): Lindy Ryan The Visual Imperative - Creating a Visual Culture of Data Discovery (Paperback)
Lindy Ryan
R1,078 Discovery Miles 10 780 Ships in 12 - 17 working days

Data is powerful. It separates leaders from laggards and it drives business disruption, transformation, and reinvention. Today's most progressive companies are using the power of data to propel their industries into new areas of innovation, specialization, and optimization. The horsepower of new tools and technologies have provided more opportunities than ever to harness, integrate, and interact with massive amounts of disparate data for business insights and value - something that will only continue in the era of the Internet of Things. And, as a new breed of tech-savvy and digitally native knowledge workers rise to the ranks of data scientist and visual analyst, the needs and demands of the people working with data are changing, too. The world of data is changing fast. And, it's becoming more visual. Visual insights are becoming increasingly dominant in information management, and with the reinvigorated role of data visualization, this imperative is a driving force to creating a visual culture of data discovery. The traditional standards of data visualizations are making way for richer, more robust and more advanced visualizations and new ways of seeing and interacting with data. However, while data visualization is a critical tool to exploring and understanding bigger and more diverse and dynamic data, by understanding and embracing our human hardwiring for visual communication and storytelling and properly incorporating key design principles and evolving best practices, we take the next step forward to transform data visualizations from tools into unique visual information assets.

SQL QuickStart Guide - The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL (Paperback):... SQL QuickStart Guide - The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL (Paperback)
Walter Shields
R754 R661 Discovery Miles 6 610 Save R93 (12%) Ships in 10 - 15 working days
The Art of Feature Engineering - Essentials for Machine Learning (Paperback): Pablo Duboue The Art of Feature Engineering - Essentials for Machine Learning (Paperback)
Pablo Duboue
R1,255 Discovery Miles 12 550 Ships in 12 - 17 working days

When machine learning engineers work with data sets, they may find the results aren't as good as they need. Instead of improving the model or collecting more data, they can use the feature engineering process to help improve results by modifying the data's features to better capture the nature of the problem. This practical guide to feature engineering is an essential addition to any data scientist's or machine learning engineer's toolbox, providing new ideas on how to improve the performance of a machine learning solution. Beginning with the basic concepts and techniques, the text builds up to a unique cross-domain approach that spans data on graphs, texts, time series, and images, with fully worked out case studies. Key topics include binning, out-of-fold estimation, feature selection, dimensionality reduction, and encoding variable-length data. The full source code for the case studies is available on a companion website as Python Jupyter notebooks.

Big Data Analytics in Earth, Atmospheric and Ocean  Sciences (Hardcover): T. Huang Big Data Analytics in Earth, Atmospheric and Ocean Sciences (Hardcover)
T. Huang
R3,982 Discovery Miles 39 820 Ships in 12 - 17 working days

Applying tools for data analysis to the rapidly increasing volume of data about the Earth An ever-increasing volume of Earth data is being gathered. These data are "big" not only in size but also in their complexity, different formats, and varied scientific disciplines. As such, big data are disrupting traditional research. New methods and platforms, such as the cloud, are tackling these new challenges. Big Data Analytics in Earth, Atmospheric, and Ocean Sciences explores new tools for the analysis and display of the rapidly increasing volume of data about the Earth. Volume highlights include: An introduction to the breadth of big earth data analytics Architectures developed to support big earth data analytics Different analysis and statistical methods for big earth data Current applications of analytics to Earth science data Challenges to fully implementing big data analytics The American Geophysical Union promotes discovery in Earth and space science for the benefit of humanity. Its publications disseminate scientific knowledge and provide resources for researchers, students, and professionals.

Data Analytics - The Ultimate Beginner's Guide (Paperback): Lee Maxwell Data Analytics - The Ultimate Beginner's Guide (Paperback)
Lee Maxwell
R295 Discovery Miles 2 950 Ships in 10 - 15 working days
Free Delivery
Pinterest Twitter Facebook Google+
You may like...
Cancer Prediction for Industrial IoT 4.0…
Meenu Gupta, Rachna Jain, … Hardcover R3,992 Discovery Miles 39 920
Research Analytics - Boosting University…
Francisco J. Cantu-Ortiz Paperback R1,389 Discovery Miles 13 890
Bayesian Analysis with Excel and R
Conrad Carlberg Paperback R977 Discovery Miles 9 770
The Elements of Statistical Learning…
Trevor Hastie, Robert Tibshirani, … Hardcover R1,977 Discovery Miles 19 770
Fundamentals of Data Engineering - Plan…
Joe Reis Paperback R1,544 R1,353 Discovery Miles 13 530
Becoming a Data Head - How to Think…
AJ Gutman Paperback R755 Discovery Miles 7 550
The Top Ten Algorithms in Data Mining
Xindong Wu, Vipin Kumar Hardcover R2,970 Discovery Miles 29 700
Functional Aesthetics for Data…
V Setlur Paperback R738 Discovery Miles 7 380
Handbook of Educational Data Mining
Cristobal Romero, Sebastian Ventura, … Hardcover R4,637 Discovery Miles 46 370
Big Data and Analytics Applications in…
Gregory Richards Paperback R1,355 Discovery Miles 13 550

 

Partners