DEV 3100 - HBase for Analysts and Architects


VIRTUAL ONLY - 6.5 Hours Online

About this course

This course introduces the concepts of NoSQL technologies, HBase architecture, schema design, performance tuning, bulk-loading of data, and the storing of complex data structures to Data Analysts, DBAs, and developers.

Prerequisites for Success in the Course

Review the following prerequisites carefully and decide if you are ready to succeed in this programming-oriented course. The Instructor will move forward with lab exercises, assuming that you have mastered the skills listed below.

  • Required:
    • Basic Linux knowledge, including familiarity with basic command-line options such a mv, cp, cd, ls, ssh, and scp
    • Access to, and the ability to use, a laptop with a terminal program installed (such as terminal on the Mac, or PuTTY and WinSCP Windows).
  • Recommended:
  • Optional: Basic Hadoop and database knowledge

Right for you?

  • For Data Analysts and Enterprise Architects interested in designing HBase application schemas
  • This is a non-programming course


This course helps prepare you for the MapR Certified HBase Developer (MCHBD) certification exam.


Included in this 1-day course are

  • Access to a multi-node Amazon Web Services (AWS) cluster
  • Slide Guide pdf
  • Lab Guide pdf
  • Lab Code
Day 1
  • Introduction to HBase
    • Differentiate between RDBMS and HBase
    • Identify typical HBase Use Cases
  • HBase Data Model
    • Describe the HBase data model and data model components
    • Describe how logical data model maps physical storage on disk
    • Use data model operations
    • Create an HBase table
  • HBase Architecture
    • Identify the components of an HBase cluster
    • Describe how the HBase components work together
    • Describe how regions work and their benefits
    • Define the function of minor and major compactions
    • Describe Region Server splits
    • Describe how HBase handles fault tolerance
    • Differentiate MapRDB from HBase
  • Basic Schema Design
    • List the elements of schema design
    • Design row keys for data access patterns
    • Design table shape & column families for data access patterns
    • Define column family properties
    • Design schema for given scenario
  • Design Schemas for Complex Data Structures
    • Transition from relational model to HBase
    • Use intelligent keys
    • Use secondary indexes or Lookup tables
    • Design for other complex data structures
    • Evolve schemas over time
  • Using Hive to Query HBase
    • Use Hive to query HBase/MapR tables