Mining of massive datasets / Jure Leskovec, Anand Rajaraman, Jeffrey David Ullman, Standford University.

Leskovec, Jurij
Call Number
006.312
Author
Leskovec, Jurij, author.
Title
Mining of massive datasets / Jure Leskovec, Anand Rajaraman, Jeffrey David Ullman, Standford University.
Edition
Second edition.
Physical Description
1 online resource (xi, 467 pages) : digital, PDF file(s).
Notes
Title from publisher's bibliographic system (viewed on 05 Oct 2015).
Contents
Data mining -- MapReduce and the new software stack -- Finding similar items -- Mining data streams -- Link analysis -- Frequent itemsets -- Clustering -- Advertising on the Web -- Recommendation systems -- Mining social-network graphs -- Dimensionality reduction -- Large-scale machine learning.
Summary
Written by leading authorities in database and Web technologies, this book is essential reading for students and practitioners alike. The popularity of the Web and Internet commerce provides many extremely large datasets from which information can be gleaned by data mining. This book focuses on practical algorithms that have been used to solve key problems in data mining and can be applied successfully to even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. Other chapters cover the PageRank idea and related tricks for organizing the Web, the problems of finding frequent itemsets and clustering. This second edition includes new and extended coverage on social networks, machine learning and dimensionality reduction.
Added Author
Rajaraman, Anand, author.
Ullman, Jeffrey D., 1942- author.
Subject
DATA MINING.
BIG DATA.
Multimedia
Total Ratings: 0
No records found to display.
 
 
 
02611nam a22003978i 4500
001
 
 
vtls001585571
003
 
 
VRT
005
 
 
20200921122800.0
006
 
 
m|||||o||d||||||||
007
 
 
cr||||||||||||
008
 
 
200921s2014||||enk     o     ||1 0|eng|d
020
$a 9781139924801 (ebook)
020
$z 9781107077232 (hardback)
035
$a (UkCbUP)CR9781139924801
039
9
$y 202009211228 $z santha
040
$a UkCbUP $b eng $e rda $c UkCbUP
050
0
0
$a HF5415.125 $b .L46 2014
082
0
0
$a 006.312 $2 23
100
1
$a Leskovec, Jurij, $e author.
245
1
0
$a Mining of massive datasets / $c Jure Leskovec, Anand Rajaraman, Jeffrey David Ullman, Standford University.
250
$a Second edition.
264
1
$a Cambridge : $b Cambridge University Press, $c 2014.
300
$a 1 online resource (xi, 467 pages) : $b digital, PDF file(s).
336
$a text $b txt $2 rdacontent
337
$a computer $b c $2 rdamedia
338
$a online resource $b cr $2 rdacarrier
500
$a Title from publisher's bibliographic system (viewed on 05 Oct 2015).
505
0
$a Data mining -- MapReduce and the new software stack -- Finding similar items -- Mining data streams -- Link analysis -- Frequent itemsets -- Clustering -- Advertising on the Web -- Recommendation systems -- Mining social-network graphs -- Dimensionality reduction -- Large-scale machine learning.
520
$a Written by leading authorities in database and Web technologies, this book is essential reading for students and practitioners alike. The popularity of the Web and Internet commerce provides many extremely large datasets from which information can be gleaned by data mining. This book focuses on practical algorithms that have been used to solve key problems in data mining and can be applied successfully to even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. Other chapters cover the PageRank idea and related tricks for organizing the Web, the problems of finding frequent itemsets and clustering. This second edition includes new and extended coverage on social networks, machine learning and dimensionality reduction.
650
0
$a DATA MINING.
650
0
$a BIG DATA.
700
1
$a Rajaraman, Anand, $e author.
700
1
$a Ullman, Jeffrey D., $d 1942- $e author.
776
0
8
$i Print version: $z 9781107077232
856
4
0
$u https://doi.org/10.1017/CBO9781139924801
999
$a VIRTUA               
No Reviews to Display
Summary
Written by leading authorities in database and Web technologies, this book is essential reading for students and practitioners alike. The popularity of the Web and Internet commerce provides many extremely large datasets from which information can be gleaned by data mining. This book focuses on practical algorithms that have been used to solve key problems in data mining and can be applied successfully to even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. Other chapters cover the PageRank idea and related tricks for organizing the Web, the problems of finding frequent itemsets and clustering. This second edition includes new and extended coverage on social networks, machine learning and dimensionality reduction.
Notes
Title from publisher's bibliographic system (viewed on 05 Oct 2015).
Contents
Data mining -- MapReduce and the new software stack -- Finding similar items -- Mining data streams -- Link analysis -- Frequent itemsets -- Clustering -- Advertising on the Web -- Recommendation systems -- Mining social-network graphs -- Dimensionality reduction -- Large-scale machine learning.
Subject
DATA MINING.
BIG DATA.
Multimedia