• This data can be in various forms and in various sizes. It can vary from
small data to very big Data.
• Any data that can reside in RAM or memory is considered as small data.
Small data is less than 10s of GBs.
• Any data that can reside in Hard Disk is considered as medium data.
Medium data is in the range of 10s to 1000s of GBs.
• What are aspect of particular domain like Hotel have different aspect
Example for 100 watts pannel, Voc=20V or 21V and Isc=5 A thus
P=100 watts (approx) ---( How to manage it efficiently)
Why Cloud
• Integrated IOT with big data is not the easy required domain knowledge
• category
• Tweets data
Model Analysis
Data Collection with Hadoop
Storage &
Structure on Data Exploration
Hadoop (HDFS)
TRC funding Project – Trip advisor Web Site
scraping Tool
Data Sentiment
Preparation Analysis
Storage &
• Step 4: Data Exploration Analysis
Structure on Data Exploration
Hadoop (HDFS)
Explore the most Important terms
It is topic modeling:
Data Sentiment
Preparation Analysis
Storage &
Structure on Data Exploration
Hadoop (HDFS)
IS THIS ENOGH ? Answer is NO
• It is sentiment analysis
• It is K-Mean analysis
•
Plan
Literature review
• Convince ministry of tourism about Data Collection & Scraping
effectiveness of our project in the domain
of IOT + BIG. Data Preparation
Data Insight
• Google cloud will show the future plan
Data Exploration
Reports
Thank you! Question start what I will do.. Not what you
did.
Works Cited
• https://data.gov.uk/dataset/road-accidents-safety-data
• https://www.thebalance.com/what-is-crowdsourcing-marketing-and-how-is-it-
used-2295467
• http://www.shu.edu/technology/
• http://archive.ics.uci.edu/ml/datasets.html?sort=nameUp&view=list
• http://www.dw.com/en/big-data-reveals-shakespeare-co-authored-17-of-his-
plays/a-36145979