Apache Spark Sheet The Essential Apache Spark Developer Cheat Sheet

Apache Spark is a powerful open source processing engine built around speed, ease of use, and sophisticated analytics. This developer cheat sheet dives into resources for Spark developers, and includes a list of Spark transformations, actions, and persistence methods. You’ll also find information on broadcast and accumulator variables, links to MLlib reference sites, and other handy references to help you get up to speed using Spark for data exploration, analysis, and building big data applications.