Get in Touch

Course Outline

  • Section 1: Introduction to Big Data & NoSQL
    • Big Data ecosystem
    • NoSQL overview
    • CAP theorem
    • When to use NoSQL
    • Columnar storage
    • HBase and NoSQL
  • Section 2: HBase Intro
    • Concepts and Design
    • Architecture (HMaster and Region Server)
    • Data integrity
    • HBase ecosystem
    • Lab: Exploring HBase
  • Section 3: HBase Data model
    • Namespaces, Tables, and Regions
    • Rows, columns, column families, and versions
    • HBase Shell and Admin commands
    • Lab: HBase Shell
  • Section 3: Accessing HBase using Java API
    • Introduction to Java API
    • Read/Write path
    • Time Series data
    • Scans
    • Map Reduce
    • Filters
    • Counters
    • Co-processors
    • Labs (multiple): Using HBase Java API to implement time series, Map Reduce, Filters, and counters.
  • Section 4: HBase schema Design: Group session
    • Real-world use cases are presented to students
    • Students work in groups to develop design solutions
    • Discussion, critique, and learning from multiple designs
    • Labs: Implementing a scenario in HBase
  • Section 5: HBase Internals
    • Understanding HBase under the hood
    • Memfile/HFile/WAL
    • HDFS storage
    • Compactions
    • Splits
    • Bloom Filters
    • Caches
    • Diagnostics
  • Section 6: HBase installation and configuration
    • Hardware selection
    • Install methods
    • Common configurations
    • Lab: Installing HBase
  • Section 7: HBase eco-system
    • Developing applications using HBase
    • Interacting with other Hadoop stack components (MapReduce, Pig, Hive)
    • Frameworks around HBase
    • Advanced concepts (co-processors)
    • Labs: Writing HBase applications
  • Section 8: Monitoring And Best Practices
    • Monitoring tools and practices
    • Optimizing HBase
    • HBase in the cloud
    • Real-world use cases of HBase
    • Labs: Checking HBase vitals

Requirements

  • Familiarity with the Java programming language
  • Proficiency in Java, including navigating the Linux command line and editing files using vi or nano
  • A Java IDE such as Eclipse or IntelliJ

Lab environment:

A functional HBase cluster will be provided for student use. Access to the cluster requires an SSH client and a web browser.

Zero Install: There is no need to install HBase software on your personal machines.

 21 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories