Академический Документы
Профессиональный Документы
Культура Документы
Akhlak Hossain
ID:100421977
Computing and Mathematics, UoD
• Big Data and its type.
• Data Analytics and Business Intelligence.
• Key Technologies.
• Technological Aspects of Data Analytics.
• Analytical Applications.
• Benefits.
• Limitations.
• Which one to be used?
• Analytical Algorithm.
• Regression Analysis
• Benefits and Limitations.
• Decision Tree.
• Benefits and Limitations.
• Dataset.
• Conclusion.
Program Outline
• What is Big Data?
• Large volume of data
• Analyzed for better decisions and strategic business moves.
Key Technologies
Big data affects organizations across practically every industry.
Some of the areas are being discussed below where Data
Analytics is carried out.
• Banking.
• Education.
• Government.
• Healthcare.
• Manufacturing.
• Retail.
Analytical Applications
R SAS
1. Cost effective Alternative. 1. Commercial software and hence not
cheap.
2. Counterpart of SAS. 2. Secured UI and flexible with people
who have ideas in SQL.
3. It’s free and can be downloaded by 3. Well analyzed upgrades. Makes it
anyone. easier to use.
4. Low level programming language 4. Dedicated customer services.
and can take longer codes for Provides their support without ease.
straightforward processes.
5. Largest online support. 5. Around 15000 data can be based in
SAS. The largest of all.
Analytical Applications
R SAS
1. Good integration between the 1. SAS has the ability to work on
programming language and the many platforms
statistical functions.
2. relatively easy to integrate the 2. the software is reasonable to afford
application with other languages
3. Packages include a wide variety of 3. sufficiently effective and flexible to
quantitative applications meet a user’s demand
Analytical Algorithm
• Study of relationships in between the variables.
• Easy to use and applies in many situations.
• Most commonly used tools for business analysis.
Regression Analysis
Benefits Limitations
1. Use of analysis and research to 1. Focuses on relationship between
foresee what is liable to happen in the dependent and independent variables.
following quarter or year.
2. Can provide understanding how 2. Not correct in most of the cases
changes in customer spending or local
economy shifts will affect an
organization.
3. To make business decisions 3. Regression assumes that data is
independent.
4. Can diminish a large amount of data 4. This is frequently, but not generally,
to actionable information. sensible
5. Provides new insight for managers 5. It does not completely describe the
by disclosing patterns and relationship between variables.
relationships that has not been noticed
previously.
Decision Tree
• The core algorithm for building decision trees called ID3 by J. R.
Quinlan.
• ID3 uses Entropy and Information Gain to construct a decision tree.
Dataset
• For my proposed dataset, Regression Analysis have been
used.
• With the help of Regression analysis I will be able to
predict the rate at which it is increasing and how to
decrease it.
• Moreover, it will also help me to identify at which state
the incident is taking place and at what hour it is
happening more frequently.
Attributes of Dataset
• “R” does not recognize/take any string values. It only
understand Numeric values.
• In order to progress further we need to nullify the strings.
• Command that to be used to nullify the string under the
column “inctype”:
• rti$inctype <- NULL
Further Breakdown
• In this report I have built up a comprehension of what the
present business intelligence advance is alongside prologue to
expository calculations and what they are.
• Analytical techniques, for example, decision tree and
regression analysis have been examined alongside the
advantages and restrictions of the investigative procedures.
• A further understanding and investigation of Analytical
applications, for example, SAS and R have been quickly de-
scribed alongside their advantages and restrictions.
• At the end, a short detail of the information set for my next
paper has been incorporated into the report.
Conclusion
• EYGM Limited. (2014). Big Data: Changing The Way Businesses
Compete And Operate. London: EY.
• Flajolet, R. S. (2013). An Introduction To The Analysis of Algorithms.
New Jersey: Pearson Education, Inc.
• Gelman, A. (2003). Regression Modeling and Meta-Analysis for
Decision Making. American Statistical Association, 1-2Ranjan, J.
(2009). Business Intelligence: Concepts, Components, Techniques
And Benefits. Journal of Theoretical and Applied Information
Technology, 60-70.
• SAS Institute Inc. (2001). Step-by-Step Programming with Base SAS
Software. Cary: SAS Institute Inc.
• Stanley, J. (2009). Notes on computer programmes for statistical
analysis. 5-6.
References