Open source commercial distributions for Hadoop fall under one of the three categories of innovations shown below. All distributions, as seen in the figure, are essentially packaging the same set of projects from the Apache Hadoop ecosystem. MapR, represented by the third category on the right, includes the latest revisions as well as innovations in a Converged Data Platform that integrates Hadoop and Spark, real-time database capabilities, and global event streaming with big data enterprise storage, for developing and running innovative data applications. Containerized applications can further leverage the MapR Persistent Application Client Containers to securely access and leverage MapR platform services (MapR-FS, MapR-DB, MapR Streams) as a persistent data store.
Hadoop Ecosystem Innovation