MapR Redefines SQL-on-Hadoop with Apache Drill
San Jose, CA

Distributed ANSI SQL query engine for self-service data exploration and JSON analytics

MapR Technologies, Inc., provider of the top-ranked distribution for Apache™ Hadoop®, today announced the addition of Apache Drill 0.5 to the MapR Distribution including Hadoop. Bringing next-generation ANSI SQL to Hadoop, Apache Drill provides instant, self-service data exploration across multiple data sources including modern applications.

“Organizations want to provide access to data stored in Hadoop and NoSQL databases to a broader set of users with existing SQL analysis skills,” said Matt Aslett, research director, data platforms and analytics, 451 Research. "Apache Drill's ability to provide access to data in Hadoop without the need for centralized schemas and also to NoSQL datasets with complex data structures including nested and repeated fields differentiates it from traditional approaches to SQL-on-Hadoop."

Apache Drill provides the flexibility to immediately query complex data in native formats, such as schema-less data, nested data, and data with rapidly-evolving schemas, with minimal IT involvement.  Because SQL queries can run directly on various file formats, live data can be explored as it is coming in, versus spending weeks preparing and managing schemas and setting up ETL tasks.   Additionally, Apache Drill supports ANSI SQL so users can easily leverage their SQL skills and existing investments in business intelligence (BI) tools.

“The vision and innovation that the Apache Drill community has brought to the marketplace heralds a new era of data exploration,” said John Schroeder, CEO and cofounder of MapR Technologies.  “The agility to directly query self-describing data and the flexibility to process complex data types push the envelope in big data analysis and insight.  We are extremely excited by the potential of Drill to transform data-driven companies.”

Organizations that use Apache Drill benefit from:

  • High-performance analysis of data in its native format including self-describing data such as Parquet, JSON files and HBase tables
  • Direct querying of data in HBase tables without defining and maintaining a parallel/overlay schema in the Hive metastore
  • Intuitive SQL extensions to query and work with semi-structured/nested data, such as data from NoSQL stores like MongoDB and online REST APIs
  • Queries that simultaneously combine different Hadoop data sources such as files, HBase tables, and Hive tables

Developers and analysts can leverage existing SQL skillsets and BI tools to:

  • Minimize switching costs and the learning curve for users via the familiar ANSI SQL syntax
  • Continue using familiar BI/analytics tools such as Excel, Tableau and a host of others using standard ODBC/JDBC drivers
  • Enable ad-hoc/low-latency queries on existing Hive tables. Reuse Hive metadata, hundreds of file formats and user defined functions (UDFs) out of the box


Apache Drill 0.5 with the MapR Distribution including Hadoop is currently available.  To see Drill in action, download the software or find more information, visit here

Supporting Quotes

"As an IT organization serving thousands of Cisco internal users who know SQL, we are excited at the prospects of Apache Drill,” said Piyush Bhargava, distinguished engineer, IT, Cisco Systems.  “It fills in a key gap in our Hadoop architecture, opening up end users to explore data directly. Drill's unique approach to do interactive SQL queries without the need for extensions across a variety of data types brings together the familiarity of SQL with the flexibility of Hadoop on the scale of big data."

“We are very excited about the flexibility and performance Drill promises for ad-hoc data exploration,” said Scott Russman, director, software development at Solutionary. “Enabling additional business units to use our Hadoop big data and correlating across our other big data sources from a single ANSI SQL interface will be incredibly powerful and require very little retooling.  This will result in less custom application work and more efficient knowledge sharing throughout the organization.”

“We are excited to work with the Apache Drill community in integrating the latest SQL-on-Hadoop technology into the MicroStrategy Analytics Platform™,” said Paul Zolfaghari, president, MicroStrategy Incorporated.  “Apache Drill provides an interesting alternative for how we can connect our analytics platform to applications running in big data environments. MicroStrategy looks forward to testing the project in the coming weeks.”

“Drill enables new and efficient means for querying data stored in Hadoop,” said Dan Jewett, vice president of product management, Tableau Software. “We’re excited to see how customers take advantage of this technology in their environments.”

"Simba proudly developed the first enterprise-class ODBC drivers for Hadoop, NoSQL, and big data," noted Simba Technologies CEO Amyn Rajan. "We're excited to work again with MapR to mark the next milestone: introducing Drill with ODBC 3.8 connectivity. We share with MapR a commitment to open data on Hadoop since we first partnered three years ago. Now, Drill's support for schema-less data models, best-in-class connectivity, and dynamic SQL means enterprises can easily leverage big data across multiple platforms to their analytic tools, whether they're using Tableau, Excel, Lumira, or another application."



About MapR Technologies

About MapR Technologies

MapR enables organizations to create disruptive advantage and long-term value from their data with the industry’s only Converged Data Platform, which delivers distributed processing, real-time analytics, and enterprise-grade requirements across cloud and on-premise environments–while leveraging the significant ongoing development in open source technologies including Spark and Hadoop. Organizations with the most demanding production needs, including sub-second response for fraud prevention, secure and highly available data-driven insights for better healthcare, petabyte analysis for threat detection, and integrated operational and analytic processing for improved customer experiences, run on MapR. A majority of customers achieves payback in fewer than 12 months and realizes greater than 5X ROI. MapR ensures customer success through world-class professional services and with free on-demand training that 50,000 developers, data analysts and administrators have used to close the big data skills gap. Amazon, Cisco, Google, HPE, Microsoft, SAP, and Teradata are part of the worldwide MapR partner ecosystem. Investors include Future Fund, Google Capital, Lightspeed Venture Partners, Mayfield Fund, NEA, Qualcomm Ventures and Redpoint Ventures. Connect with MapR on LinkedIn, and Twitter.

Media Contacts

Beth Winkowski
MapR Technologies, Inc.
(978) 649-7189

Kim Pegnato
MapR Technologies, Inc.
(781) 620-0016