Академический Документы
Профессиональный Документы
Культура Документы
BIG DATA ?
• It is a term for collection of data sets so large & complex that becomes difficult
to process using traditional data processing applications.
Big Data
Activities
Normal
Processing
Capabilities
Content Volume
• Social Networking sites like Facebook, LinkedIn, Twitter etc.,
• Mobile device data such as Text messages, Calls data, Apps data etc.,
“Big Data” Sources • Internet Transactions like e-Commerce websites, banking activities etc
• Not Scalable
Business Master data Transactions
Strategy
Business Processes
OLTP
Operations
OLAP
Business Data
Warehouse
Data Mining
Analytics
5 Vs
• Volume
• Velocity
Big Data
Characteristics • Variety
• Value
• Veracity
Apache Hadoop is a framework that allows distributed processing
of large datasets across clusters of commodity of computers using
a simple programming model
Goals of HDFS:
Access to streaming
data
Accommodation of
large data sets
Portability
Phases in Big Data Testing
Performance testing