Apache Drill is a distributed system for interactive ad-hoc analysis of large-scale datasets. Designed to handle up to petabytes of data spread across thousands of servers, the goal of Drill is to respond to ad-hoc queries in a low-latency manner. In this article, Hausenblas and Nadeau introduce Drill's architecture, discuss its extensibility points, and put it into the context of the emerging offerings in the interactive analytics realm.
With Apache Drill, the open source community has taken an important step to make the innovations introduced by Google's Dremel available to a large audience, under a free license. Apache Drill represents a huge leap forward for organizations looking to augment their Big Data processing with interactive queries across massive data sets.