MapR 5.0 Extends Hadoop for New Class of Real-Time Applications
San Jose, CA

Auto-synchronizes storage, database and search within and across data centers

 MapR Technologies, Inc., provider of the top-ranked distribution for Apache™ Hadoop®, today unveiled at Hadoop Summit version 5.0 of the MapR Distribution including Hadoop, extending its lead in real-time Hadoop, security,  and self-service data exploration and agility.  

MapR 5.0 is architected for processing big and fast data on a single data platform that enables a new class of real-time applications. Organizations are increasingly deploying multiple applications on a single Hadoop cluster, with 18% of MapR customers deploying over 50 separate applications on a single cluster. The latest MapR release auto synchronizes storage, database and search indices to support complex, real-time applications to increase revenue, reduce operational costs and mitigate risk.  MapR 5.0 also includes comprehensive security auditing, Apache Drill support, and the latest Hadoop 2.7 and YARN features.

“With the newest release of the MapR Distribution, we continue to lead the market in delivering reliable and real-time Hadoop to the enterprise,” said Anil Gadre, senior vice president of product management, MapR Technologies.  “We help enable the ‘as-it-happens’ business where organizations can shorten their data-to-action cycle.  Our product is deployed at customer sites and industries that are highly regulated due to their use of sensitive data, which proves that MapR is architected for enterprise-grade security requirements.”

 “Designed as a large-scale batch data analysis system, Hadoop is not often associated with operational analytics or transaction processing,” said Carl W. Olofson, research vice president, data management software research, IDC.  “Hadoop adds tremendous value for decision management at the strategic and operational levels, but still is emerging as a framework for making tactical decisions ‘in the moment.’  With Hadoop innovations,  such as those in MapR 5.0, happening every day, enterprises should consider using Hadoop as a ‘Decision Data Platform’ that functions as a single platform for handling both live operational data and real-time analytics.”

The MapR Distribution including Hadoop, version 5.0 feature overview:

  • Extends the MapR real-time, reliable data transport framework, used in the MapR-DB Table Replication capability, to deliver and synchronize data in real time to external compute engines. The first supported external compute engine is Elasticsearch to enable synchronized full-text search indexes automatically without writing custom code.
  • Adds Hadoop 2.7 including YARN 2.7 support to enable new features like YARN application rolling upgrades to complement the platform-level rolling upgrades already supported by MapR, as well as integrated Docker container support.
  • Enhances MapR industry-leading data governance and security
    • Comprehensive auditing for all data accesses via log files in JSON format, enabling extensive reporting and validation and quick analysis with Apache Drill. This adds to the trusted security capabilities MapR already provides for authentication and authorization.
    • Support for Apache Drill 1.x, including Drill Views. This innovative feature delivers secure access to field-level data in files to ensure only authorized data can be analyzed by specific analysts. Analysts can also be given data governance privileges in which they can share their data sets with other analysts, an important capability for retaining agility in a big data environment.

“We are pleased to be working with MapR on integrating their real-time delivery framework with Elasticsearch,” said Jobi George, global partner director, Elastic. “Customers want search indexes automatically synchronized with the latest data updates.  The MapR architecture makes this easier for application developers who need to let their end users search for data almost immediately after it is updated.”


Version 5.0 of the MapR Distribution will be available in 30 days.   

MapR is a platinum sponsor showcasing its top-ranked, utility-grade Hadoop Distribution architected for real-time, data-centric enterprises this week at Hadoop Summit in booth #P10.

About MapR Technologies

About MapR Technologies

MapR enables organizations to create disruptive advantage and long-term value from their data with the industry’s only Converged Data Platform, which delivers distributed processing, real-time analytics, and enterprise-grade requirements across cloud and on-premise environments–while leveraging the significant ongoing development in open source technologies including Spark and Hadoop. Organizations with the most demanding production needs, including sub-second response for fraud prevention, secure and highly available data-driven insights for better healthcare, petabyte analysis for threat detection, and integrated operational and analytic processing for improved customer experiences, run on MapR. A majority of customers achieves payback in fewer than 12 months and realizes greater than 5X ROI. MapR ensures customer success through world-class professional services and with free on-demand training that 50,000 developers, data analysts and administrators have used to close the big data skills gap. Amazon, Cisco, Google, HPE, Microsoft, SAP, and Teradata are part of the worldwide MapR partner ecosystem. Investors include Future Fund, Google Capital, Lightspeed Venture Partners, Mayfield Fund, NEA, Qualcomm Ventures and Redpoint Ventures. Connect with MapR on LinkedIn, and Twitter.

Media Contacts

Beth Winkowski
MapR Technologies, Inc.
(978) 649-7189

Kim Pegnato
MapR Technologies, Inc.
(781) 620-0016