Description
Book Synopsis: Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia break down Spark topics into distinct sections, each with unique goals. You'll explore the basic operations and common functions of Spark's structured APIs, as well as Structured Streaming, a new high-level API for building end-to-end streaming applications. Developers and system administrators will learn the fundamentals of monitoring, tuning, and debugging Spark, and explore machine learning techniques and scenarios for employing MLlib, Spark's scalable machine-learning library. Get a gentle overview of big data and Spark. Learn about DataFrames, SQL, and Datasets—Spark's core APIs—through worked examples. Dive into Spark's low-level APIs, RDDs, and execution of SQL and DataFrames. Understand how Spark runs on a cluster. Debug, monitor, and tune Spark clusters and applications. Learn the power of Structured Streaming, Spark's stream-processing engine. Learn how you can apply MLlib to a variety of problems, including classification or recommendation.
Read more
Details
Looking to revolutionize your big data processing? Look no further than Spark: The Definitive Guide. This comprehensive guide, written by the creators of the open-source cluster-computing framework, will take you on a journey into the world of Apache Spark. With a strong emphasis on the latest improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia provide a wealth of knowledge that will make big data processing simple and accessible.
With Spark: The Definitive Guide, you'll gain a deep understanding of Spark's structured APIs, including DataFrames, SQL, and Datasets. Through a series of practical examples, you'll learn how to leverage Spark's core APIs to perform basic operations and common functions. Plus, you'll also dive into Spark's low-level APIs, RDDs, and uncover the inner workings of Spark's execution of SQL and DataFrames.
But the learning doesn't stop there. Spark: The Definitive Guide goes beyond just the basics. This book takes you on a journey through the world of big data, teaching you how Spark runs on a cluster and providing you with invaluable tips on debugging, monitoring, and tuning Spark clusters and applications.
One of the highlights of Spark: The Definitive Guide is its coverage of Structured Streaming, Spark's powerful stream-processing engine. Learn how to build end-to-end streaming applications with ease, thanks to the step-by-step instructions and real-world examples provided by Chambers and Zaharia.
And if you're looking to apply machine learning techniques to your big data problems, Spark: The Definitive Guide has you covered. Discover how to utilize MLlib, Spark's scalable machine-learning library, to tackle a wide range of problems, from classification to recommendation.
With Spark: The Definitive Guide by your side, you'll gain the knowledge and confidence to tackle big data processing head-on. So why wait? Start your journey to becoming a Spark expert today!
Get your copy of Spark: The Definitive Guide and unlock the true potential of big data processing.
Discover More Best Sellers in Databases & Big Data
Shop Databases & Big Data
The Rules of Contagion: Why Things Spread - and Why They Stop
$17.73


Collect, Combine, and Transform Data Using Power Query in Excel and Power BI (Business Skills)
$24.80


$24.67


$27.16


$57.57


$62.49
