MapR allows you to do more with Hadoop by combining Apache Hadoop with architectural innovations focused on operational excellence in the data center. MapR is the only distribution that is built from the ground up for business-critical production applications.

MapR is a complete distribution for Apache Hadoop that packages more than a dozen projects from the Hadoop ecosystem to provide a broad set of Big Data capabilities for the user. Projects such as Apache HBase, Hive, Pig, Mahout, Flume, Avro, Sqoop, Oozie and Whirr are included along with non-Apache projects such as Cascading and Impala.

MapR supports these Apache projects on an advanced technology platform that not only provides enterprise-grade features such as high availability, disaster recovery, security, and full data protection but also allows Hadoop to be easily accessed as traditional network attached storage (NAS) with read-write capabilities.


Hadoop Production Success

MapR was engineered for the data center with IT operations in mind. MapR enables Hadoop to serve business-critical needs for Big Data applications that cannot afford to lose data, must run on a 24x7 basis, require immediate recovery from node and site failures – all with a smaller data center footprint. MapR supports these capabilities for the broadest set of Hadoop applications from batch analytics to interactive querying and real-time streaming.

Architecture Matters

MapR delivers business-critical production success because of the advanced architecture of the MapR Data Platform, which is 100% binary compatible with the Apache Hadoop distributed file system (HDFS) to ensure plug-and-play compatibility and no vendor lock-in. The MapR Data Platform is a modern, true read-write capable, NFS-mountable distributed file-system written in C++ that directly accesses storage hardware – dramatically improving performance and ease of administration. Unlike other Hadoop distributions that require separate clusters for multiple applications, the data platform is built to process both distributed files and database tables in one unified layer – an engineering feat in its own right. This enables organizations to support both operational (e.g., HBase) and analytic apps (e.g., Apache Drill, Hive, or Impala) on one cluster, significantly reducing costs as you grow your Hadoop deployment.

Open Source Commitment

Along with the architectural innovations that provide business critical success, MapR constantly tests, validates and hardens the core Apache projects before including in its distribution. MapR typically upgrades to the latest versions of the newly released Apache OSS projects within 90 days of their release, and monthly releases to ensure you always have the latest innovations. MapR also makes code and libraries available through GitHub and Maven repositories. In addition, MapR also contributes significantly to open-source community through contributions to projects such as Apache Mahout and Apache Drill.

One Platform for Big Data Applications

MapR provides an enterprise data hub for Big Data with Hadoop at its center. Hadoop provides a general purpose platform for a variety of workloads including data storage, integration from multiple sources, database operations, analytics, search, and real-time stream processing. MapR provides the most advanced distribution for Hadoop with native file and table support and dynamic workload management to support more applications with a smaller data center footprint and lowest TCO.