MapR Technologies and EMC Announce Technology Licensing Agreement for Next Generation Hadoop Distribution
San Jose, CA

Game-changing innovations bring unmatched performance, reliability, and manageability to Apache Hadoop
MapR Technologies, Inc. today announced a software licensing agreement with EMC Corporation (NYSE:EMC) in which MapR Technologies will be part of the recently announced EMC® Greenplum® HD Enterprise Edition, a 100 percent interface-compatible implementation of the Apache Hadoop software stack. The new EMC system will incorporate MapR Technologies' pre-integrated, tested and hardened distribution for Apache Hadoop.
"EMC is focused on delivering the best-in-class solutions for Big Data. We evaluated the various Hadoop software offerings and believe that MapR is the clear enterprise- class innovation leader. With MapR, we are able to provide an unmatched solution for high availability, fault tolerance, and enterprise-class support and service. Combined with the EMC Greenplum Database we will enable the co-processing of both structured and unstructured data within a single, seamless solution," said Scott Yara, Co-Founder of Greenplum and Vice President of Products, Data Computing Division, EMC.
Although a number of Hadoop distributions are available, they fail to address underlying customer concerns such as single points of failure, lack of snapshots and mirroring, and poor performance.
"This is a major advancement for Hadoop users everywhere. MapR's innovations coupled with EMC's big data analytics capabilities and service will allow more people to use the power of big data analytics and enable substantial market growth," said John Webster, Senior Analyst, Evaluator Group. "MapR has managed to innovate on performance, cost reduction, dependability and ease-of-use all at once. This marks a major shift for the Hadoop market."
MapR's innovations transform Hadoop into a dependable compute platform while also increasing performance. Specific MapR advances make Hadoop, easy, dependable and fast and include:

  • NFS direct access allows users to use the NFS protocol to simply load and access data directly in a Hadoop cluster and enables standard tools and utilities to work directly on data contained in Hadoop.
  • Heatmap user interface provides full cluster visibility and control.


  • All single points of failure are eliminated in the Hadoop stack
  • JobTracker High Availability ensures continuous job execution.
  • Distributed NameNode with High Availability addresses major reliability issue while also improving performance and scale.
  • Snapshots allow point-in-time data protection and recovery.
  • Mirroring for business continuity includes wide area replication support.


  • Significant speed and efficiency improvements result in faster execution with half the hardware required by other distributions.

"Today marks an exciting milestone as we announce our partnership with EMC and unveil the industry's best distribution for Apache Hadoop that will advance and grow the entire market," said John Schroeder, CEO and Co-Founder, MapR Technologies. "We listened to customers, partners and the community about where Hadoop needed major investment and addressed those areas by delivering breakthrough innovations."
About the Data Computing Division of EMC
EMC's Data Computing Division is driving the future of data warehousing and analytics with breakthrough products including the EMC Greenplum Data Computing Appliance, EMC Greenplum Database, EMC Greenplum Community Edition, EMC Greenplum HD – Enterprise ready Apache Hadoop, and EMC Greenplum Chorus™-the industry's first Enterprise Data Cloud platform. The division's products embody the power of open systems, cloud computing, virtualization and social collaboration-enabling global organizations to gain greater insight and value from their data than ever before possible.

MapR is a trademark of MapR Technologies, Inc. EMC and Greenplum are trademarks or registered trademarks of EMC Corporation in the U.S. and other countries. Apache Hadoop and Hadoop is a trademark of the Apache Software Foundation. All other trademarks are the property of their respective owners.

About MapR Technologies

About MapR Technologies

MapR enables organizations to create disruptive advantage and long-term value from their data with the industry’s only Converged Data Platform, which delivers distributed processing, real-time analytics, and enterprise-grade requirements across cloud and on-premise environments–while leveraging the significant ongoing development in open source technologies including Spark and Hadoop. Organizations with the most demanding production needs, including sub-second response for fraud prevention, secure and highly available data-driven insights for better healthcare, petabyte analysis for threat detection, and integrated operational and analytic processing for improved customer experiences, run on MapR. A majority of customers achieves payback in fewer than 12 months and realizes greater than 5X ROI. MapR ensures customer success through world-class professional services and with free on-demand training that 50,000 developers, data analysts and administrators have used to close the big data skills gap. Amazon, Cisco, Google, HPE, Microsoft, SAP, and Teradata are part of the worldwide MapR partner ecosystem. Investors include Future Fund, Google Capital, Lightspeed Venture Partners, Mayfield Fund, NEA, Qualcomm Ventures and Redpoint Ventures. Connect with MapR on LinkedIn, and Twitter.

Media Contacts

Beth Winkowski
MapR Technologies, Inc.
(978) 649-7189

Kim Pegnato
MapR Technologies, Inc.
(781) 620-0016