Hurtig levering
Fremragende Trustpilot
Op til 20% Rabat på nye medlemsordrer
Kurv
Mastering Large Datasets
Af: John T. Wolohan Engelsk Paperback
SPAR
kr 70
Mastering Large Datasets
Af: John T. Wolohan Engelsk Paperback

Summary


Modern data science solutions need to be clean, easy to read, and scalable. In Mastering Large Datasets with Python, author J.T. Wolohan teaches you how to take a small project and scale it up using a functionally influenced approach to Python coding. You’ll explore methods and built-in Python tools that lend themselves to clarity and scalability, like the high-performing parallelism method, as well as distributed technologies that allow for high data throughput. The abundant hands-on exercises in this practical tutorial will lock in these essential skills for any large-scale data science project.

 

Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

About the technology


Programming techniques that work well on laptop-sized data can slow to a crawl—or fail altogether—when applied to massive files or distributed datasets. By mastering the powerful map and reduce paradigm, along with the Python-based tools that support it, you can write data-centric applications that scale efficiently without requiring codebase rewrites as your requirements change.

About the book


Mastering Large Datasets with Python teaches you to write code that can handle datasets of any size. You’ll start with laptop-sized datasets that teach you to parallelize data analysis by breaking large tasks into smaller ones that can run simultaneously. You’ll then scale those same programs to industrial-sized datasets on a cluster of cloud servers. With the map and reduce paradigm firmly in place, you’ll explore tools like Hadoop and PySpark to efficiently process massive distributed datasets, speed up decision-making with machine learning, and simplify your data storage with AWS S3.

What's inside


  • An introduction to the map and reduce paradigm
  • Parallelization with the multiprocessing module and pathos framework
  • Hadoop and Spark for distributed computing
  • Running AWS jobs to process large datasets


About the reader


For Python programmers who need to work faster with more data.

About the author


J. T. Wolohan is a lead data scientist at Booz Allen Hamilton, and a PhD researcher at Indiana University, Bloomington.

 

Table of Contents:

PART 1

1 ¦ Introduction

2 ¦ Accelerating large dataset work: Map and parallel computing

3 ¦ Function pipelines for mapping complex transformations

4 ¦ Processing large datasets with lazy workflows

5 ¦ Accumulation operations with reduce

6 ¦ Speeding up map and reduce with advanced parallelization

PART 2

7 ¦ Processing truly big datasets with Hadoop and Spark

8 ¦ Best practices for large data with Apache Streaming and mrjob

9 ¦ PageRank with map and reduce in PySpark

10 ¦ Faster decision-making with machine learning and PySpark

PART 3

11 ¦ Large datasets in the cloud with Amazon Web Services and S3

12 ¦ MapReduce in the cloud with Amazon’s Elastic MapReduce
Eksklusiv medlemspris 371 kr
Medlemspris 378 kr
Eksklusiv medlemspris og medlemspris er kun for medlemmer. Du bliver automatisk medlem når du køber til eksklusiv medlemspris eller medlemspris. Få 7 dages gratis medlemskab (herefter automatisk 89 kr/30 dage). Læs mere om fordelene
Gratis fragt
23 - 25 hverdage
10 kr
Lavt pakkegebyr
Normalpris 441 kr
Fragt: 59 kr
23 - 25 hverdage
20 kr
Pakkegebyr
Spar 70 kr
Se vores konkurrenters priser her
God 15.826 anmeldelser på

Summary


Modern data science solutions need to be clean, easy to read, and scalable. In Mastering Large Datasets with Python, author J.T. Wolohan teaches you how to take a small project and scale it up using a functionally influenced approach to Python coding. You’ll explore methods and built-in Python tools that lend themselves to clarity and scalability, like the high-performing parallelism method, as well as distributed technologies that allow for high data throughput. The abundant hands-on exercises in this practical tutorial will lock in these essential skills for any large-scale data science project.

 

Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

About the technology


Programming techniques that work well on laptop-sized data can slow to a crawl—or fail altogether—when applied to massive files or distributed datasets. By mastering the powerful map and reduce paradigm, along with the Python-based tools that support it, you can write data-centric applications that scale efficiently without requiring codebase rewrites as your requirements change.

About the book


Mastering Large Datasets with Python teaches you to write code that can handle datasets of any size. You’ll start with laptop-sized datasets that teach you to parallelize data analysis by breaking large tasks into smaller ones that can run simultaneously. You’ll then scale those same programs to industrial-sized datasets on a cluster of cloud servers. With the map and reduce paradigm firmly in place, you’ll explore tools like Hadoop and PySpark to efficiently process massive distributed datasets, speed up decision-making with machine learning, and simplify your data storage with AWS S3.

What's inside


  • An introduction to the map and reduce paradigm
  • Parallelization with the multiprocessing module and pathos framework
  • Hadoop and Spark for distributed computing
  • Running AWS jobs to process large datasets


About the reader


For Python programmers who need to work faster with more data.

About the author


J. T. Wolohan is a lead data scientist at Booz Allen Hamilton, and a PhD researcher at Indiana University, Bloomington.

 

Table of Contents:

PART 1

1 ¦ Introduction

2 ¦ Accelerating large dataset work: Map and parallel computing

3 ¦ Function pipelines for mapping complex transformations

4 ¦ Processing large datasets with lazy workflows

5 ¦ Accumulation operations with reduce

6 ¦ Speeding up map and reduce with advanced parallelization

PART 2

7 ¦ Processing truly big datasets with Hadoop and Spark

8 ¦ Best practices for large data with Apache Streaming and mrjob

9 ¦ PageRank with map and reduce in PySpark

10 ¦ Faster decision-making with machine learning and PySpark

PART 3

11 ¦ Large datasets in the cloud with Amazon Web Services and S3

12 ¦ MapReduce in the cloud with Amazon’s Elastic MapReduce
Produktdetaljer
Sprog: Engelsk
Sider: 312
ISBN-13: 9781617296239
Indbinding: Paperback
Udgave:
ISBN-10: 1617296236
Udg. Dato: 30 mar 2020
Længde: 24mm
Bredde: 187mm
Højde: 234mm
Oplagsdato: 30 mar 2020
Forfatter(e): John T. Wolohan
Forfatter(e) John T. Wolohan


Kategori Databaseprogrammering


Sprog Engelsk


Indbinding Paperback


Sider 312


Udgave


Længde 24mm


Bredde 187mm


Højde 234mm

MEDLEMSFORDELE
GRATIS FRAGT
SPAR OP TIL 90%
Andre har også købt
BOG (INDBUNDET)
Eksklusiv medlemspris kr 380

kr 499
Normalpris
kr 392
Medlemspris
SPAR
kr 119
BOG (HÆFTET)
Eksklusiv medlemspris kr 199

kr 299
Normalpris
kr 209
Medlemspris
SPAR
kr 100
BOG (INDBUNDET)
Eksklusiv medlemspris kr 859

kr 1.499
Normalpris
kr 923
Medlemspris
SPAR
kr 640
BOG (INDBUNDET)
Eksklusiv medlemspris kr 214

kr 320
Normalpris
kr 225
Medlemspris
SPAR
kr 106
BOG (INDBUNDET)
Eksklusiv medlemspris kr 203

kr 300
Normalpris
kr 213
Medlemspris
SPAR
kr 97
BOG (INDBUNDET)
Eksklusiv medlemspris kr 170

kr 270
Normalpris
kr 180
Medlemspris
SPAR
kr 100
BOG (INDBUNDET)
Eksklusiv medlemspris kr 262

kr 349
Normalpris
kr 271
Medlemspris
SPAR
kr 87
BOG (HÆFTET)
Eksklusiv medlemspris kr 275

kr 320
Normalpris
kr 280
Medlemspris
SPAR
kr 45
BOG (HÆFTET)
Eksklusiv medlemspris kr 211

kr 299
Normalpris
kr 220
Medlemspris
SPAR
kr 88
BOG (PAPERBACK)
Eksklusiv medlemspris kr 165

kr 198
Normalpris
kr 168
Medlemspris
SPAR
kr 33
BOG (INDBUNDET)
Eksklusiv medlemspris kr 175

kr 299
Normalpris
kr 187
Medlemspris
SPAR
kr 124
BOG (INDBUNDET)
Eksklusiv medlemspris kr 240

kr 349
Normalpris
kr 251
Medlemspris
SPAR
kr 109
BOG (HÆFTET)
Eksklusiv medlemspris kr 149

kr 249
Normalpris
kr 159
Medlemspris
SPAR
kr 100
BOG (INDBUNDET)
Eksklusiv medlemspris kr 329

kr 499
Normalpris
kr 346
Medlemspris
SPAR
kr 170
BOG (INDBUNDET)
Eksklusiv medlemspris kr 118

kr 129
Normalpris
kr 119
Medlemspris
SPAR
kr 11
BOG (HÆFTET)
Eksklusiv medlemspris kr 152

kr 199
Normalpris
kr 157
Medlemspris
SPAR
kr 47
BOG (HARDBACK)
Eksklusiv medlemspris kr 25

kr 227
Normalpris
kr 45
Medlemspris
SPAR
kr 202
BOG (PAPERBACK)
Eksklusiv medlemspris kr 852

kr 1.403
Normalpris
kr 907
Medlemspris
SPAR
kr 551
BOG (INDBUNDET)
Eksklusiv medlemspris kr 124

kr 299
Normalpris
kr 142
Medlemspris
SPAR
kr 175
BOG (INDBUNDET)
Eksklusiv medlemspris kr 216

kr 279
Normalpris
kr 222
Medlemspris
SPAR
kr 63
Vi anbefaler også
BOG (PAPERBACK)
Eksklusiv medlemspris kr 514

kr 597
Normalpris
kr 522
Medlemspris
SPAR
kr 83
BOG (PAPERBACK)
Eksklusiv medlemspris kr 367

kr 450
Normalpris
kr 375
Medlemspris
SPAR
kr 83
BOG (PAPERBACK)
Eksklusiv medlemspris kr 347

kr 407
Normalpris
kr 353
Medlemspris
SPAR
kr 60
BOG (PAPERBACK)
Eksklusiv medlemspris kr 446

kr 506
Normalpris
kr 452
Medlemspris
SPAR
kr 60
BOG (PAPERBACK)
Eksklusiv medlemspris kr 470

kr 540
Normalpris
kr 477
Medlemspris
SPAR
kr 70
BOG (PAPERBACK)
Eksklusiv medlemspris kr 479

kr 507
Normalpris
kr 482
Medlemspris
SPAR
kr 28
BOG (PAPERBACK)
Eksklusiv medlemspris kr 330

kr 360
Normalpris
kr 333
Medlemspris
SPAR
kr 30
BOG (PAPERBACK)
Eksklusiv medlemspris kr 485

kr 619
Normalpris
kr 498
Medlemspris
SPAR
kr 134
BOG (PAPERBACK)
Eksklusiv medlemspris kr 286

kr 404
Normalpris
kr 298
Medlemspris
SPAR
kr 118
BOG (PAPERBACK)
Eksklusiv medlemspris kr 461

kr 529
Normalpris
kr 468
Medlemspris
SPAR
kr 68
BOG (PAPERBACK)
Eksklusiv medlemspris kr 446

kr 563
Normalpris
kr 458
Medlemspris
SPAR
kr 117
BOG (PAPERBACK)
Eksklusiv medlemspris kr 474

kr 607
Normalpris
kr 487
Medlemspris
SPAR
kr 133
BOG (PAPERBACK)
Eksklusiv medlemspris kr 365

kr 540
Normalpris
kr 383
Medlemspris
SPAR
kr 175
BOG (PAPERBACK)
Eksklusiv medlemspris kr 398

kr 597
Normalpris
kr 418
Medlemspris
SPAR
kr 199
BOG (PAPERBACK)
Eksklusiv medlemspris kr 365

kr 540
Normalpris
kr 383
Medlemspris
SPAR
kr 175
BOG (PAPERBACK)
Eksklusiv medlemspris kr 319

kr 369
Normalpris
kr 324
Medlemspris
SPAR
kr 50
BOG (PAPERBACK)
Eksklusiv medlemspris kr 427

kr 518
Normalpris
kr 436
Medlemspris
SPAR
kr 91
BOG (PAPERBACK)
Eksklusiv medlemspris kr 294

kr 419
Normalpris
kr 307
Medlemspris
SPAR
kr 125
BOG (HARDBACK)
Eksklusiv medlemspris kr 419

kr 474
Normalpris
kr 425
Medlemspris
SPAR
kr 55
BOG (PAPERBACK)
Eksklusiv medlemspris kr 442

kr 524
Normalpris
kr 450
Medlemspris
SPAR
kr 82