Вы находитесь на странице: 1из 6

HBase and Apache Phoenix

Duration: 5 Days

Contents

Day 1
Introduction
The Problem with Relational Database Systems
Nonrelational Database Systems, Not-Only SQL or NoSQL?
Dimensions
Scalability
Database (De-)Normalization
Building Blocks
Backdrop
Tables, Rows, Columns, and Cells
Auto-Sharding
Storage API
Implementation
HBase: The Hadoop Database
History
Nomenclature
Client API: The Basics
CRUD Operations
Put Method
Get Method
Delete Method
Batch Operations
Row Locks
Scans
Miscellaneous Features
The HTable Utility Methods
The Bytes Class
Client API: Advanced Features
Introduction
The ResultScanner Class
Caching Versus Batching
Filters
Introduction to Filters
Comparison Filters
Dedicated Filters
Decorating Filters
FilterList
Custom Filters
Filters
Counters
Introduction to Counters
Single Counters
Multiple Counters
Coprocessors
Introduction to Coprocessors
The Coprocessor Class
Coprocessor Loading
The RegionObserver Class
The MasterObserver Class
Endpoints
HTablePool
Connection Handling

Day 2
Client API: Administrative Features
Schema Definition Tables
Table Properties
Column Families
HBaseAdmin
Basic Operations
Table Operations
Schema Operations
Cluster Operations
Cluster Status Information
Available Clients
Introduction to REST
Native Java
REST
Batch Clients
Architecture
Seek Versus Transfer
B+ Trees
Log-Structured Merge-Trees
Storage
Overview
Write Path
Files
HFile Format
KeyValue Format
Write-Ahead Log
Overview
HLog Class
HLogKey Class
WALEdit Class
LogSyncer Class
LogRoller Class
Replay
Durability
Read Path
Region Lookups
The Region Life Cycle
ZooKeeper
Replication
Life of a Log Edit
Internals
Advanced Usage
Key Design
Concepts
Tall-Narrow Versus Flat-Wide Tables
Partial Key Scans
Pagination
Time Series Data
Time-Ordered Relations
Advanced Schemas
Secondary Indexes
Search Integration
Transactions
Bloom Filters
Versioning
Implicit Versioning
Custom Versioning
Day 3
Performance Tuning
Garbage Collection Tuning
Memstore-Local Allocation Buffer
Compression
Available Codecs
Verifying Installation
Enabling Compression
Optimizing Splits and Compactions
Managed Splitting
Region Hotspotting
Presplitting Regions
Load Balancing
Merging Regions
Client API: Best Practices
Configuration
Load Tests
Performance Evaluation
Apache Phoenix
Phoenix in the Hadoop Ecosystem
Phoenix’s Place in Big Data Systems
 Architecture
o Installing Apache Phoenix
o Installing Java
 Installing HBase
 Installing Apache Phoenix
o Start HBase
o Testing Your Phoenix Installation
 Hadoop Ecosystem and the Role of Phoenix
 Brief Description of Phoenix’s Key Features
o Transactions
o User-Defined Functions
o Secondary Indexes
o SkipScan
o Views
CRUD with Phoenix
Data Types in Phoenix
 Primitive Data Types
 Complex Data Types
Data Model
Steps in data modeling
Phoenix Write Path
Phoenix Read Path
Basic Commands
 HELP
 CREATE
 UPSERT
 SELECT
 ALTER.
 DELETE
 DESCRIBE
 LIST
Working with Phoenix API
Querying Data
Constraints NOT NULL Creating Tables Salted Tables
Dropping Tables
ALTERTables
 Adding Columns.
 Deleting or Replacing Columns
 Renaming a Column
Clauses
 LIMIT
 WHERE
 GROUP BY
 HAVING
 ORDER BY
Logical Operators
 AND
 OR
 IN
 LIKE
 BETWEEN.
Summary.

Day 4
Advanced Querying
Joins
Inner Join
Outer Join
 Left Outer Join.
 Right Outer Join.
 Full Outer Join
Grouped Joins Hash Join
Sort Merge Join.
Join Query Optimizations
 Optimizing Through Configuration Properties
 Optimizing Query.
Subqueries
 IN and NOT IN in Subqueries
 EXISTS and NOT EXISTS Clauses
 ANY, SOME, and ALL Operators with Subqueries
 UPSERT Using Subqueries.
Views.
 Creating Views.
 Dropping Views
Paged Queries
 LIMIT and OFFSET
 Row Value Constructor

Transactions
SQL Transactions.
Transaction Properties
 Atomicity
 Consistency
 Isolation
 Durability
Transaction Control
 COMMIT
 ROLLBACK
 SAVEPOINT
 SET TRANSACTION
Transactions in HBase
 Integrating HBase with Transaction Manager
 Components of Transaction Manager
 Transaction Lifecycle
 Concurrency Control
 Multiversion Concurrency Control
 Optimistic Concurrency Control.
Apache Tephra As a Transaction Manager.
Phoenix Transactions
 Enabling Transactions for Tables
 Committing Transactions
Transaction Limitations in Phoenix
Summary.
Advanced Phoenix Concepts
Secondary Indexes
 Global Index
 Local Index
 Covered Index
 Functional Indexes
 Index Consistency
User Defined Functions Writing Custom User Defined Functions
Phoenix Query Server
 Download.
 Installation
 Setup
 Starting PQS
 Client
 Usage
 Additional PQS Features
Apache Spark Integration
 Setup
 Reading and Writing Using Dataframe

Day 5
Tools & Tuning
Phoenix Tracing Server
 Trace
 Span
 Span Receivers
 Setup
Phoenix Bulk Loading
 Setup
 Gotchas
Advanced Phoenix
 Salted Tables
 Views
HBase-Phoenix Integration with Spark
 JDBC connectivity through PySpark
 CRUD operations from PySpark

Вам также может понравиться