0
Your cart

Your cart is empty

Books > Computing & IT > Applications of computing > Databases > Data mining

Buy Now

Multidimensional Mining of Massive Text Data (Paperback) Loot Price: R1,509
Discovery Miles 15 090
Multidimensional Mining of Massive Text Data (Paperback): Chao Zhang, Jiawei Han

Multidimensional Mining of Massive Text Data (Paperback)

Chao Zhang, Jiawei Han

Series: Synthesis Lectures on Data Mining and Knowledge Discovery

 (sign in to rate)
Loot Price R1,509 Discovery Miles 15 090 | Repayment Terms: R141 pm x 12*

Bookmark and Share

Expected to ship within 18 - 22 working days

Unstructured text, as one of the most important data forms, plays a crucial role in data-driven decision making in domains ranging from social networking and information retrieval to scientific research and healthcare informatics. In many emerging applications, people's information need from text data is becoming multidimensional-they demand useful insights along multiple aspects from a text corpus. However, acquiring such multidimensional knowledge from massive text data remains a challenging task. This book presents data mining techniques that turn unstructured text data into multidimensional knowledge. We investigate two core questions. (1) How does one identify task-relevant text data with declarative queries in multiple dimensions? (2) How does one distill knowledge from text data in a multidimensional space? To address the above questions, we develop a text cube framework. First, we develop a cube construction module that organizes unstructured data into a cube structure, by discovering latent multidimensional and multi-granular structure from the unstructured text corpus and allocating documents into the structure. Second, we develop a cube exploitation module that models multiple dimensions in the cube space, thereby distilling from user-selected data multidimensional knowledge. Together, these two modules constitute an integrated pipeline: leveraging the cube structure, users can perform multidimensional, multigranular data selection with declarative queries; and with cube exploitation algorithms, users can extract multidimensional patterns from the selected data for decision making. The proposed framework has two distinctive advantages when turning text data into multidimensional knowledge: flexibility and label-efficiency. First, it enables acquiring multidimensional knowledge flexibly, as the cube structure allows users to easily identify task-relevant data along multiple dimensions at varied granularities and further distill multidimensional knowledge. Second, the algorithms for cube construction and exploitation require little supervision; this makes the framework appealing for many applications where labeled data are expensive to obtain.

General

Imprint: Springer International Publishing AG
Country of origin: Switzerland
Series: Synthesis Lectures on Data Mining and Knowledge Discovery
Release date: March 2019
First published: 2019
Authors: Chao Zhang • Jiawei Han
Dimensions: 235 x 191mm (L x W)
Format: Paperback
Pages: 183
ISBN-13: 978-3-03-100786-6
Languages: English
Subtitles: English
Categories: Books > Science & Mathematics > Mathematics > Probability & statistics
Books > Computing & IT > Applications of computing > Databases > Data mining
Promotions
LSN: 3-03-100786-7
Barcode: 9783031007866

Is the information for this product incomplete, wrong or inappropriate? Let us know about it.

Does this product have an incorrect or missing image? Send us a new image.

Is this product missing categories? Add more categories.

Review This Product

No reviews yet - be the first to create one!

Partners