0
Your cart

Your cart is empty

Books > Computing & IT > Computer communications & networking

Buy Now

The Four Generations of Entity Resolution (Paperback) Loot Price: R1,702
Discovery Miles 17 020
The Four Generations of Entity Resolution (Paperback): George Papadakis, Ekaterini Ioannou, Emanouil Thanos, Themis Palpanas

The Four Generations of Entity Resolution (Paperback)

George Papadakis, Ekaterini Ioannou, Emanouil Thanos, Themis Palpanas

Series: Synthesis Lectures on Data Management

 (sign in to rate)
Loot Price R1,702 Discovery Miles 17 020 | Repayment Terms: R160 pm x 12*

Bookmark and Share

Expected to ship within 10 - 15 working days

Entity Resolution (ER) lies at the core of data integration and cleaning and, thus, a bulk of the research examines ways for improving its effectiveness and time efficiency. The initial ER methods primarily target Veracity in the context of structured (relational) data that are described by a schema of well-known quality and meaning. To achieve high effectiveness, they leverage schema, expert, and/or external knowledge. Part of these methods are extended to address Volume, processing large datasets through multi-core or massive parallelization approaches, such as the MapReduce paradigm. However, these early schema-based approaches are inapplicable to Web Data, which abound in voluminous, noisy, semi-structured, and highly heterogeneous information. To address the additional challenge of Variety, recent works on ER adopt a novel, loosely schema-aware functionality that emphasizes scalability and robustness to noise. Another line of present research focuses on the additional challenge of Velocity, aiming to process data collections of a continuously increasing volume. The latest works, though, take advantage of the significant breakthroughs in Deep Learning and Crowdsourcing, incorporating external knowledge to enhance the existing words to a significant extent. This synthesis lecture organizes ER methods into four generations based on the challenges posed by these four Vs. For each generation, we outline the corresponding ER workflow, discuss the state-of-the-art methods per workflow step, and present current research directions. The discussion of these methods takes into account a historical perspective, explaining the evolution of the methods over time along with their similarities and differences. The lecture also discusses the available ER tools and benchmark datasets that allow expert as well as novice users to make use of the available solutions.

General

Imprint: Springer International Publishing AG
Country of origin: Switzerland
Series: Synthesis Lectures on Data Management
Release date: March 2021
First published: 2021
Authors: George Papadakis • Ekaterini Ioannou • Emanouil Thanos • Themis Palpanas
Dimensions: 235 x 191 x 17mm (L x W x T)
Format: Paperback
Pages: 152
ISBN-13: 978-3-03-100750-7
Languages: English
Subtitles: English
Categories: Books > Computing & IT > General theory of computing > Data structures
Books > Computing & IT > Computer programming > Algorithms & procedures
Books > Computing & IT > Computer communications & networking > General
Promotions
LSN: 3-03-100750-6
Barcode: 9783031007507

Is the information for this product incomplete, wrong or inappropriate? Let us know about it.

Does this product have an incorrect or missing image? Send us a new image.

Is this product missing categories? Add more categories.

Review This Product

No reviews yet - be the first to create one!

You might also like..

CISA - Certified Information Systems…
Cannon Paperback R1,774 R1,415 Discovery Miles 14 150
CompTIA Security+ Guide To Network…
Mark Ciampa Paperback R1,389 R1,290 Discovery Miles 12 900
Managing Business Projects - The…
Frank Einhorn Paperback R515 Discovery Miles 5 150
Guide to Networking Essentials
Greg Tomsho Paperback R1,423 R1,319 Discovery Miles 13 190
Network+ Guide to Networks
Jill West, Jean Andrews, … Paperback R1,285 Discovery Miles 12 850
Scale-Free Networks - Complex Webs in…
Guido Caldarelli Hardcover R4,274 Discovery Miles 42 740
CCNA 200-301 Network Simulator
Sean Wilkins Digital product license key R5,302 R3,116 Discovery Miles 31 160
The Gathering Cloud
J. R. Carpenter Paperback R429 Discovery Miles 4 290
PCI Dss: A Pocket Guide
IT Governance Paperback R425 Discovery Miles 4 250
Two-Factor Authentication
Mark Stanislav Paperback R543 Discovery Miles 5 430
Loose Leaf for Data Communications and…
Behrouz A. Forouzan Loose-leaf R2,766 R2,208 Discovery Miles 22 080
ISO27001/ISO27002 - A Pocket Guide
Alan Calder Paperback R695 Discovery Miles 6 950

See more

Partners