Академический Документы
Профессиональный Документы
Культура Документы
Ans. The data in a node undergoes replication. The data is copied from one
node to another to ensure fault tolerance. The replication factor is the number
of copies of the data that are sent to different nodes.
Ans. These strategies define the technique how the replicas are placed in a
cluster. There are mainly two types of Replication Strategy:
Simple strategy
Network Topology Strategy
High Scalability
High fault tolerant
Flexible Data storage
Easy data distribution
Tunable Consistency
Efficient Wires
Cassandra Query Language
Key-Value Store – It has a Big Hash Table of keys & values {Example- Riak,
Amazon S3 (Dynamo)}
Column-based Store- Each storage block contains data from only one
column, {Example- HBase, Cassandra}
Graph-based-A network database that uses edges and nodes to represent
and store data. {Example- Neo4J}
Ans. A keyspace contains many column families. They basically represent the
table. Furthermore, it basically defines titles or application specific tables.
What is a Keyspace?
Describe Memtable.
Ans. Memtables are basically a cache space containing content in key and
column format.
Define SSTable.
Ans. SSTable is Sorted String Table. It is a data file that accepts regular Mem
Tables.
Ans. When the user tries to execute any disk I/O for any data, bloom filter
checks the existence of the data.
Ans. The user can use the following syntax to create the keyspace.
1. CREATE KEYSPACE <keyspace name>
2. WITH REPLICATION={'class':'<Replication Strategy>','replication factor':'<No. of replicas>'}
3. AND DURABLE_WRITES='<TRUE/FALSE>';
Both elements work on the principle of tuple having name and value.
However, the former‘s value is a string while the value in latter is a Map of
Columns with different data types.
Unlike Columns, Super Columns do not contain the third component of
timestamp.