DA 440 - Apache Hive Essentials

register-html: 

Register

About this course

DA 440 is an introductory-level course designed for data analysts and developers. You will learn how Apache Hive fits in the Hadoop ecosystem, how to create and load tables in Hive, and how to query data using the Hive Query Language.

Are you ready?

  • Required:
    • Familiarity with a command-line interface, such as a Unix shell
    • Familiarity with RDBMS database tools, such as SQL
    • Access to, and the ability to use, a laptop with an internet connection and a terminal program installed (such as terminal on the Mac, or PuTTY on Windows).
  • Recommended:

Right for you?

  • For data analysts and developers interested in the data pipeline
  • For data scientists and business analysts who are familiar with SQL and want to use data on an HDFS
  • This is a programming course; you must have some programming experience to do the exercises

What's next?

Syllabus

Apache Hive Essentials
  • Hive in the Hadoop Ecosystem
    • Use cases of Hive
    • Steps in the data pipeline
  • Create and Load Data
    • Create databases, internal tables, external tables, and partitioned tables
    • Learn about data types and casting in Hive
    • Load data into tables and databases
  • Query and Manipulate Data
    • Query, sort, and filter data
    • Manipulate data with user-defined functions

       


Related Resources
SANDBOX

MapR Sandbox with Drill
Get started

BLOG

Advice from the front.
Read

Other Resources

Hive Documentation
Visit

Apache Hive Website
Visit

YOU MAY ALSO LIKE

On-demand Training
DA 410 - Apache Drill Essentials
Learn more

Instructor-led Training
DA 4000 - Apache Drill
Learn more