Академический Документы
Профессиональный Документы
Культура Документы
Problem analysis
Volume of data
o How much data is needed
o Is it a big data or a nano-data problem if so, what is the architecture
Velocity of data
o Can I do the model and application in batch model
o Model in batch and application in real time
o Model and application in real time
Veracity
o How many sources of data, how many are manually collected and how many are
machine collected
o Based on the above, is the data likely to need a lot of cleansing
Variety
o Is the data structured, unstructured?
o Specify all data elements
Variability
o Is the data socially generated and keeps changing the nature frequently or generated in
closed conditions
o Is there going to be a need for non-linear models
Solution analysis