DEV 325 - HBase Schema Design

register-html: 

About this course

Targeted towards data analysts, data architects and application developers, the goal of this course is to enable you to design HBase schemas based on design guidelines. You will learn about the various elements of schema design and how to design for data access patterns. The course offers an in-depth look at designing row keys, avoiding hot-spotting and designing column families. It discusses how to transition from a relational model to an HBase model. You will learn the differences between tall tables and wide tables. Concepts are conveyed through lectures, hands-on labs and analysis of scenarios.

Right for you?

  • For data analysts, data architects, and application developers

Are you ready?

Yes, if you have:
  • Taken DEV 320 – HBase Data Model & Architecture
  • Basic Hadoop knowledge is helpful but not required
  • Prior experience with SQL is preferred but not required
  • A Linux, Windows or MacOS computer with the MapR Sandbox installed. The lab exercises for this class will include installing Hive components on a MapR cluster and executing various commands and sample programs.

What's next?

Certification

This course helps prepare you for the MapR Certified HBase Developer (MCHBD) certification exam.


Syllabus

Lesson 4:
Basic Schema Design
  • List the elements of schema design
  • Design row keys for data access patterns
  • Design table shape and column families for data access patterns
  • Define column family properties
  • Design schema for given scenario
Lesson 5:
Design Schemas for Complex Data Structures
  • Transition from relational model to HBase
  • Use intelligent keys
  • Use secondary indexes or lookup tables
  • Design for other complex data structures
  • Evolve schemas over time
Lesson 6:
Use Hive to Query HBase
  • Use Hive to query HBase/MapR tables