David is a solutions architect with MapR Technologies, specializing in cloud deployments and performance issues. He joined MapR in 2012 from HP's Enterprise Servers organization, where he had worked with strategic software partners for over 15 years.
I’m very pleased to announce the release of a custom EMR bootstrap action to deploy Apache Drill on a MapR cluster. MapR is the only commercial Hadoop distribution available for Amazon’s Elastic MapReduce service (EMR), and this addition allows EMR users to easily deploy and evaluate the powerful Drill query engine.
Clush is an open source tool that allows you to execute commands in parallel across the nodes in your cluster. This blog describes how to install clush and use it to generate a report detailing the configuration of every node in your cluster (what MapR support classifies as a “cluster audit”).
Like most commercial cloud platforms, Google Cloud offers a range of different storage options. The most common options are persistent disk volumes attached to Virtual Machine instances or object store buckets accessed via the Google Storage APIs. Until recently, disk volumes were the only supported storage for Hadoop deployments in the Google Cloud. That situation changed for the better with the release of the Google Cloud Storage Connector for Hadoop.
Blog Sign Up
Sign up and get the top posts from each week delivered to your inbox every Friday!