Getting Started with Apache Spark
From Inception to Production
Apache Spark is a powerful, multi-purpose execution engine for big data enabling rapid application development and high performance. Jim Scott wrote an in-depth ebook on going beyond the first steps to getting this powerful technology into production on Hadoop.
The ebook features guides and tutorials on a wide range of use cases and topics, whiteboard videos, infographics, and more. Start reading now and learn about:
- What Spark is and isn't
- How Spark and Hadoop work together
- How Spark works in production
- In-depth use cases for Spark (including running code)