Getting Started with Apache Spark
From Inception to Production

Apache Spark is a powerful, multi-purpose execution engine for big data enabling rapid application development and high performance. Jim Scott wrote an in-depth ebook on going beyond the first steps to getting this powerful technology into production on Hadoop.

The ebook features guides and tutorials on a wide range of use cases and topics, whiteboard videos, infographics, and more. Start reading now and learn about:

  • What Spark is and isn't
  • How Spark and Hadoop work together
  • How Spark works in production
  • In-depth use cases for Spark (including running code)