If you've successfully used Apache Spark to solve medium sized-problems, but still struggle to realize the "Spark promise" of unparalleled performance on big data, this book is for you. High Performance Spark shows you how take advantage of Spark at scale, so you can grow beyond the novice-level. It's ideal for software engineers, data engineers, developers, and system administrators working with large-scale data applications.
Learn how to make Spark jobs run faster; Productionize exploratory data science with Spark; Handle even larger data sets with Spark; Reduce pipeline running times for faster insights.
Edition: 1st Edition
ISBN: 978-1-49194-320-5
Posted on: 6/25/2016
Format: Pdf
Page Count: 175 Pages
Author: Holden Karau,: Rachel Warren,