MapR Performance Benchmark Exceeds 100 Million Data Points Per Second Ingest

Breakthrough results on small cluster opens the door for new IoT applications and real-time data analysis

MapR Technologies, Inc., provider of the top-ranked distribution for Apache™ Hadoop®, today announced at the Tableau Conference, breakthrough performance results achieved using standard open source software, OpenTSDB, running on the MapR Distribution.  Using only four-nodes of a 10-node cluster, the MapR Distribution with its in-Hadoop NoSQL database, MapR-DB, ingested over 100 million data points per second.

By accelerating OpenTSDB performance by 1,000 times on such a small cluster, MapR opens the doors to cost-effectively manage massive volumes of data and enable new applications such as Internet of Things (IoT) and other real-time data analysis applications, including industrial monitoring of manufacturing facilities, predictive maintenance of distributed hardware systems and datacenter monitoring.

“The accelerated performance for OpenTSDB validates the differentiated efficiency and scale that MapR brings to the table,” said Ted Dunning, chief application architect for MapR Technologies.  “OpenTSDB is a widely used database intended to store and analyze time-series data.  Originally designed for only data center monitoring, poor ingest performance had limited the expansion of its use. This benchmark demonstrates a viable option for new applications, such as IoT and other real-time data-analysis applications, using OpenTSDB running on MapR.”

According to estimates from Cisco, there will be approximately 50 billion devices connected by 2020.  These IoT devices include sensors and other embedded data capturing devices that are communicating information continuously and pushing the boundaries of traditional data management platforms.  Healthcare, manufacturing and utilities are examples of industries where decisions based on continuous data analysis can improve business operations.  These devices will be phoning home and sending data.  Time series databases will be critical to store and analyze these data sets.

MapR has published the details required to replicate these tests in the MapR App Gallery and on GitHub.


About MapR Technologies

About MapR Technologies

MapR enables organizations to create disruptive advantage and long-term value from their data with the industry’s only Converged Data Platform, which delivers distributed processing, real-time analytics, and enterprise-grade requirements across cloud and on-premise environments–while leveraging the significant ongoing development in open source technologies including Spark and Hadoop. Organizations with the most demanding production needs, including sub-second response for fraud prevention, secure and highly available data-driven insights for better healthcare, petabyte analysis for threat detection, and integrated operational and analytic processing for improved customer experiences, run on MapR. A majority of customers achieves payback in fewer than 12 months and realizes greater than 5X ROI. MapR ensures customer success through world-class professional services and with free on-demand training that 50,000 developers, data analysts and administrators have used to close the big data skills gap. Amazon, Cisco, Google, HPE, Microsoft, SAP, and Teradata are part of the worldwide MapR partner ecosystem. Investors include Future Fund, Google Capital, Lightspeed Venture Partners, Mayfield Fund, NEA, Qualcomm Ventures and Redpoint Ventures. Connect with MapR on LinkedIn, and Twitter.

Media Contacts

Beth Winkowski
MapR Technologies, Inc.
(978) 649-7189

Kim Pegnato
MapR Technologies, Inc.
(781) 620-0016