Академический Документы
Профессиональный Документы
Культура Документы
Duration: 5 Days
Contents
Day 1
Introduction
The Problem with Relational Database Systems
Nonrelational Database Systems, Not-Only SQL or NoSQL?
Dimensions
Scalability
Database (De-)Normalization
Building Blocks
Backdrop
Tables, Rows, Columns, and Cells
Auto-Sharding
Storage API
Implementation
HBase: The Hadoop Database
History
Nomenclature
Client API: The Basics
CRUD Operations
Put Method
Get Method
Delete Method
Batch Operations
Row Locks
Scans
Miscellaneous Features
The HTable Utility Methods
The Bytes Class
Client API: Advanced Features
Introduction
The ResultScanner Class
Caching Versus Batching
Filters
Introduction to Filters
Comparison Filters
Dedicated Filters
Decorating Filters
FilterList
Custom Filters
Filters
Counters
Introduction to Counters
Single Counters
Multiple Counters
Coprocessors
Introduction to Coprocessors
The Coprocessor Class
Coprocessor Loading
The RegionObserver Class
The MasterObserver Class
Endpoints
HTablePool
Connection Handling
Day 2
Client API: Administrative Features
Schema Definition Tables
Table Properties
Column Families
HBaseAdmin
Basic Operations
Table Operations
Schema Operations
Cluster Operations
Cluster Status Information
Available Clients
Introduction to REST
Native Java
REST
Batch Clients
Architecture
Seek Versus Transfer
B+ Trees
Log-Structured Merge-Trees
Storage
Overview
Write Path
Files
HFile Format
KeyValue Format
Write-Ahead Log
Overview
HLog Class
HLogKey Class
WALEdit Class
LogSyncer Class
LogRoller Class
Replay
Durability
Read Path
Region Lookups
The Region Life Cycle
ZooKeeper
Replication
Life of a Log Edit
Internals
Advanced Usage
Key Design
Concepts
Tall-Narrow Versus Flat-Wide Tables
Partial Key Scans
Pagination
Time Series Data
Time-Ordered Relations
Advanced Schemas
Secondary Indexes
Search Integration
Transactions
Bloom Filters
Versioning
Implicit Versioning
Custom Versioning
Day 3
Performance Tuning
Garbage Collection Tuning
Memstore-Local Allocation Buffer
Compression
Available Codecs
Verifying Installation
Enabling Compression
Optimizing Splits and Compactions
Managed Splitting
Region Hotspotting
Presplitting Regions
Load Balancing
Merging Regions
Client API: Best Practices
Configuration
Load Tests
Performance Evaluation
Apache Phoenix
Phoenix in the Hadoop Ecosystem
Phoenix’s Place in Big Data Systems
Architecture
o Installing Apache Phoenix
o Installing Java
Installing HBase
Installing Apache Phoenix
o Start HBase
o Testing Your Phoenix Installation
Hadoop Ecosystem and the Role of Phoenix
Brief Description of Phoenix’s Key Features
o Transactions
o User-Defined Functions
o Secondary Indexes
o SkipScan
o Views
CRUD with Phoenix
Data Types in Phoenix
Primitive Data Types
Complex Data Types
Data Model
Steps in data modeling
Phoenix Write Path
Phoenix Read Path
Basic Commands
HELP
CREATE
UPSERT
SELECT
ALTER.
DELETE
DESCRIBE
LIST
Working with Phoenix API
Querying Data
Constraints NOT NULL Creating Tables Salted Tables
Dropping Tables
ALTERTables
Adding Columns.
Deleting or Replacing Columns
Renaming a Column
Clauses
LIMIT
WHERE
GROUP BY
HAVING
ORDER BY
Logical Operators
AND
OR
IN
LIKE
BETWEEN.
Summary.
Day 4
Advanced Querying
Joins
Inner Join
Outer Join
Left Outer Join.
Right Outer Join.
Full Outer Join
Grouped Joins Hash Join
Sort Merge Join.
Join Query Optimizations
Optimizing Through Configuration Properties
Optimizing Query.
Subqueries
IN and NOT IN in Subqueries
EXISTS and NOT EXISTS Clauses
ANY, SOME, and ALL Operators with Subqueries
UPSERT Using Subqueries.
Views.
Creating Views.
Dropping Views
Paged Queries
LIMIT and OFFSET
Row Value Constructor
Transactions
SQL Transactions.
Transaction Properties
Atomicity
Consistency
Isolation
Durability
Transaction Control
COMMIT
ROLLBACK
SAVEPOINT
SET TRANSACTION
Transactions in HBase
Integrating HBase with Transaction Manager
Components of Transaction Manager
Transaction Lifecycle
Concurrency Control
Multiversion Concurrency Control
Optimistic Concurrency Control.
Apache Tephra As a Transaction Manager.
Phoenix Transactions
Enabling Transactions for Tables
Committing Transactions
Transaction Limitations in Phoenix
Summary.
Advanced Phoenix Concepts
Secondary Indexes
Global Index
Local Index
Covered Index
Functional Indexes
Index Consistency
User Defined Functions Writing Custom User Defined Functions
Phoenix Query Server
Download.
Installation
Setup
Starting PQS
Client
Usage
Additional PQS Features
Apache Spark Integration
Setup
Reading and Writing Using Dataframe
Day 5
Tools & Tuning
Phoenix Tracing Server
Trace
Span
Span Receivers
Setup
Phoenix Bulk Loading
Setup
Gotchas
Advanced Phoenix
Salted Tables
Views
HBase-Phoenix Integration with Spark
JDBC connectivity through PySpark
CRUD operations from PySpark