Featured Author

Jim Bates
Senior Systems Engineer, MapR

Jim brings over 12 years of experience in systems and support engineering to his role as a Senior Systems Engineer at MapR. Jim was previously a Senior Systems Engineer for WANdisco. Prior to that, Jim was a Senior Systems Engineer for Spirent Communications, a multinational telecommunications test company. Jim also held various roles in systems engineering at Mu Dynamics. Earlier in his career, Jim held engineering positions at Tellabs and Advanced Fibre Communications. He began his career as an Officer in the US Army, where he designed and implemented mobile routing networks and jumped out of airplanes with Cisco routers to get the internet working over single channel tactical satellite networks. In addition to his technical roles, Jim has worked as a house framer, a fireman, a janitor, and a paratrooper. His very first job was that of a farmer and rancher, working alongside his father, a time during which he learned his most important life lessons. When not working for MapR, Jim enjoys carpentry work, welding, kayaking, hiking, and spending time with his family.

Jim holds a BS/EE in Electrical Engineering from Texas A&M University (Kingsville).

Author's Posts

Posted on December 22, 2014 by Jim Bates

Over the last few releases, the options for how you store data in Hive has advanced in many ways. In this post, let’s take a look at how to go about determining what Hive table storage format would be best for the data you are using. Starting with a basic table, we’ll look at creating duplicate tables for each of the storage format options, and then comparing queries and data compression. Just keep in mind that the goal of this post is to talk about ways of comparing table formats and compression options, and not define the fastest Hive setup for all things data. After all, the fun is in...

Posted on December 11, 2014 by Jim Bates

There are many great examples out there for using the Hive shell, as well as examples of ways to automate many of the animals in our Hadoop zoo. However, if you’re just getting started, or need something fast that won’t stay around long, then all you need to do is throw a few lines of code together with some existing programs in order to avoid re-inventing the workflow. In this blog post, I’ll share a few quick tips on using the Hive shell inside scripts. We’ll take a look at a simple script that needs to pull an item or count, and then look at two ways to use the Hive shell to get an...

Posted on November 21, 2014 by Jim Bates

One of the challenges with Hadoop is getting value out of it without having to learn all the new skillsets that you need to truly harness Hadoop’s power. The reality of using the MapR Distribution including Hadoop is… you don’t have to know Hadoop to use Hadoop! I recently came up against this again and thought I would throw it out there and hopefully make someone’s journey to their first Hadoop job a no-brainer.

Blog Sign Up

Sign up and get the top posts from each week delivered to your inbox every Friday!