Академический Документы
Профессиональный Документы
Культура Документы
HBase Support
• Relational Databases such as MS SQL, Oracle, etc. have been the mainstay
of the businesses so far.
• These RDBMS been handling the relational and transactional structured
Why NoSQL? data efficiently.
• However, the advent of Web based applications and social networks has
What is NoSQL? made the data unstructured.
• This has changed the landscape for data management.
• Hence, RDBMS have ceased to be a natural fit as the data is no longer
confined to a particular structure.
Need for NoSql
{
“1” : “x”,
“aaaaa” : “y”,
“aaaab” : “world”,
“xyz” : “hello”
“zzzzz” : “woot”
}
HBase Data Model: Row and Column Families
{
“1” : { Top Level Key/map pair is called row
“A” : “x”
}
“aaaaa” : {
“A” : “y”
},
“aaaab” : {
“A” : “world”
},
“xyz” : { A is called as Column Family
“A” : “hello”,
},
}
HBase Data Model: Columns and Column Families
{
“1” : {
“A” : {
“foo”: “x”
},
“aaaaa” : {
“A” : {
“foo” : “y” Foo is the column of family A
},
“aaaab” : {
“A” : {
“foo” : “world”
},
“xyz” : {
“A” : {
“foo” : “hello”
},
},
HBase Data Model: Logical Representation
• HBase Tables are horizontally divided by row key range into what are called “Regions.”
• A region contains all rows in the table between the region’s start key and end key.
• Regions are assigned to the nodes in the cluster on top of commodity machines and managed
by Region Server Daemons. These daemons enable the data reads and writes.
• A region server can serve about 1000 regions.
HBase Architecture: Regions and Regions Servers
• HBase Tables are horizontally divided by row key range into what are called “Regions.”
• A region contains all rows in the table between the region’s start key and end key.
• Regions are assigned to the nodes in the cluster on top of commodity machines and managed
by Region Server Daemons. These daemons enable the data reads and writes.
• A region server can serve about 1000 regions.
HBase Architecture: HBase Master
• META table (RowKey, RegionServer) holds the location of regions on the cluster.
• Zookeeper stores the location to this meta table.
• The client gets a reference of Region Server that hosts the META table from Zookeeper.
• It then queries the META table to get the region server corresponding to row key it wants to access.
• It caches this information along with the META table location.
• The client then reads the data.
HBase Support
Topic 4: Region Server Components
Region Server Components
Hbase Writes
Memstore
This concludes the lesson “HBase Support.”