Вы находитесь на странице: 1из 4

Define replication factor.

Ans. The data in a node undergoes replication. The data is copied from one
node to another to ensure fault tolerance. The replication factor is the number
of copies of the data that are sent to different nodes.

Define replication strategy.

Ans. These strategies define the technique how the replicas are placed in a
cluster. There are mainly two types of Replication Strategy:
Simple strategy
Network Topology Strategy

Define Simple Strategy.

Ans. It is a strategy for replication. It places the replicas in a clockwise


order of the nodes without considering Data Centre location.

Describe Network Topology Strategy.

Ans. It is a type of Replication strategy in which a cluster is deployed across


Multiple Datacentres.

Name some features of Apache Cassandra.

 High Scalability
 High fault tolerant
 Flexible Data storage
 Easy data distribution
 Tunable Consistency
 Efficient Wires
 Cassandra Query Language

Name different types of NoSQL database.

 Key-Value Store – It has a Big Hash Table of keys & values {Example- Riak,
Amazon S3 (Dynamo)}

 Document-based Store- It stores documents made up of tagged elements.


{Example- CouchDB}

 Column-based Store- Each storage block contains data from only one
column, {Example- HBase, Cassandra}
 Graph-based-A network database that uses edges and nodes to represent
and store data. {Example- Neo4J}

Define NoSQL Database.

Ans. It is a database that deals with the non-relational database. It is referred


to as a Not only SQL database. It provides a mechanism to store and retrieve
the different type of data that includes images, sounds etc.

Give key features of any NoSQL database.


Ans. The features of NoSQL Database are:
 Schema Agnostic
 AutoSharding and Elasticity
 Highly Distributable
 Easily Scalable
 Integrated Caching

Define a column family.

Ans. A keyspace contains many column families. They basically represent the
table. Furthermore, it basically defines titles or application specific tables.

What is a Keyspace?

Ans. Keyspace is the outermost storage unit in a node. It contains many


column families.

Give some advantages of Cassandra.

Ans. These are the advantages if Cassandra:


Since data can be replicated to several nodes, Cassandra is fault tolerant.
Cassandra can handle a large set of data.
Cassandra provides high scalability.

Define commit log.

Ans. It is a mechanism that is used to recover data in case the database


crashes. Every operation that is carried out is saved in the commit log.
Using this the data can be recovered.

Define composite key.


Ans. Composite keys include row key and column name. They are used to
define column family with a concatenation of data of different type.

Describe Memtable.

Ans. Memtables are basically a cache space containing content in key and
column format.

Define SSTable.

Ans. SSTable is Sorted String Table. It is a data file that accepts regular Mem
Tables.

Give a list of Collection data type in Cassandra.

Ans. There are three type of collection data type:


 Set
 List
 Map

Tell the purpose of Bloom Filter in Cassandra.

Ans. When the user tries to execute any disk I/O for any data, bloom filter
checks the existence of the data.

How can a user create a Keyspace in Cassandra.

Ans. The user can use the following syntax to create the keyspace.
1. CREATE KEYSPACE <keyspace name>
2. WITH REPLICATION={'class':'<Replication Strategy>','replication factor':'<No. of replicas>'}
3. AND DURABLE_WRITES='<TRUE/FALSE>';

Define durable writes.


Ans. It commands Cassandra whether to use commit log or not for current
keyspace. The default value is True.

Define Node tool Utility.

Ans. It is a command-line utility that is used for administration and


monitoring.
What is difference between Column and Super Column?

Both elements work on the principle of tuple having name and value.
However, the former‘s value is a string while the value in latter is a Map of
Columns with different data types.
Unlike Columns, Super Columns do not contain the third component of
timestamp.

Explain what is Memtable in Cassandra?


 Cassandra writes the data to a in memory structure known as
Memtable
 It is an in-memory cache with content stored as key/column
 By key Memtable data are sorted
 There is a separate Memtable for each ColumnFamily, and it retrieves
column data from the key

Explain how Cassandra delete Data?


SSTables are immutable and cannot remove a row from SSTables. When a
row needs to be deleted, Cassandra assigns the column value with a special
value called Tombstone. When the data is read, the Tombstone value is
considered as deleted.
Mention what are the main components of Cassandra Data Model?
The main components of Cassandra Data Model are
 Cluster
 Keyspace
 Column
 Column & Family

Вам также может понравиться