Вы находитесь на странице: 1из 9

• Big Data and its type.

• Data Analytics and Business Intelligence.

• Key Technologies.

• Technological Aspects of Data Analytics.

• Analytical Applications.

• Benefits.

• Limitations.

• Which one to be used?

• Analytical Algorithm.

• Regression Analysis

• Benefits and Limitations.

• Decision Tree.

• Benefits and Limitations.

• Dataset.

• Conclusion.

• What is Big Data?

• Large volume of information


• Analyzed for better choices and vital business moves.

• 13 V’s of Big Data:

• Validity

• Volatility

• Verbosity

• Vulnerability

• Verification

• Volume

• Velocity
• Variety

• Variability

• Value

• Veracity

• Visualization

• Viscosity

• Data analytics (DA) is the exploration of looking at crude information with the reason for
making determinations about that data. Information investigation is utilized as a part of
numerous enterprises to permit organizations and association to settle on better business
choices and in the sciences to confirm or refute existing models or hypotheses.

• Business intelligence (BI) is an innovation driven process for investigating information and
showing noteworthy data to help corporate administrators, business chiefs and flip side clients
settle on more educated business choices.

Big data Analytics does not encompass any single advancement. Clearly, huge information can be
associated with cutting edge examination, however truly to get the most profitable information there
are a few sorts of advances that works out for the most ideal results. Here are some of them:

• Data management.

• Data mining

• Hadoop

• In-memory analytics

• Predictive analytics

• Text mining

Big Data influences associations crosswise over for all intents and purposes each industry. A portion of
the territories are being talked about beneath where Data Analytics is completed.

o Banking.

o Education.

o Government.

o Healthcare.

o Manufacturing.
o Retail.

Banking:

• To understand customers and boost their satisfaction.

• To minimize risk by maintaining regulatory compliance.

• Technologies Used:

• Data Management.

• Predictive Analytics.

Education:

• To make sure students are making adequate progress.

• To evaluate the support of Teachers.

• Technologies Used:

• Data Management.

• Predictive Analytics.

Healthcare:

• Patient records.

• Treatment plans.

• Prescription information.

• Technologies Used:

• Data management.

• In-memory analytics.

• Predictive analytics.

Retail:

• To comprehend what clients need and what they need for.


• To handle Transactions.
• Building Customer Relationship.

• Technologies Used:
• Hadoop.

• Data management.

• Predictive analytics.

Government:

• To overseeing utilities.
• Running organizations.
• Dealing with movement blockage.
• Preventing wrongdoing.

• Technologies Used:

• Data management.

• Data mining.

• In-memory analytics.

• Predictive analytics.

Manufacturing:

• To help quality and yield


• To Minimize the waste
• To take care of issues speedier
• To settle on more nimble business choices

• Technologies Used:

• Data mining.

• In-memory analytics.

• Predictive analytics.
• Analytical software or applications are organization-based software.

• Used to improve the performance of business operations.

• Precisely, these applications are known as a type of Business intelligence solution.

R SAS
1. Cost effective Alternative. 1. Commercial software and hence not cheap.

2. Counterpart of SAS. 2. Secured UI and flexible with people who have


ideas in SQL.
3. It’s free and can be downloaded by anyone. 3. Well analyzed upgrades. Makes it easier to use.

4. Low level programming dialect and can take 4. Dedicated customer services. Provides their
longer codes for direct procedures. support without ease.

5. Largest online support. 5. Around 15000 data can be based in SAS. The
largest of all.

R SAS

1. Good integration between the programming 1. SAS can work on many platforms
language and the statistical functions.

2. relatively easy to integrate the application with 2. the software is reasonable to afford
other languages

3. Packages include a wide variety of quantitative 3. sufficiently effective and flexible to meet a user’s
applications demand

4. Large network help or aid is instantly available 4. it is easy and straightforward to enter data and
on almost any topic based on the application. set up files

5. always up to date 5. incorporated framework with identical


architecture that is shared by modules
R SAS

1. Documentation is loose and impervious to a 1. Its primitive and thus can seem to be a bit hard to
non-expert. use for first time users.

2. May not hold expansive information sets as 2. No graphical possibilities.


effectively as SAS

3. Quickly expends a large portion of the 3. Not cheap.


accessible memory while the application is
running.

4. Poor decision of hardware for Data Mining. 4. Not an open source application.

• As, I will be doing a following research I might go for R as it is free to use and it’s logic are a bit
more easier.

• For example, TCO (Total Cost of Ownership) of utilizing R may go higher than SAS. For instance,
an Analytics organization chooses to utilize R solely figuring since they don't need to pay for SAS
licenses, their cost of venture conveyance will go down, better overall revenues, and bring down
charging to customer, better aggressiveness in the market.

What is Analytical Algorithm?

•It remains for an arrangement of interrogatives and estimations that fabricates an information
mining model from a given arrangement of information.

•It assesses the given information seeing to unmistakable sorts of examples.

•It utilizes the results of the investigation to separate huge examples and exact insights.

Why we need to analyze?

 To arrange issues and calculations by trouble, to expect execution.


 For a superior comprehension and upgrades in executions of calculations.
 Investigation of connections in the middle of the factors.
 Simple to utilize and applies much of the time.
 Most regularly utilized apparatuses for business investigation.
Benefits Limitations

1. Can give seeing how changes in client 1. Concentrates on relationship amongst reliant
spending or nearby economy movements will and autonomous factors.
influence an association.

2. Utilization of investigation and research to 2. Not right in a large portion of the cases
predict what is obligated to happen in the
accompanying quarter or year.

3. To settle on business choices 3. Regression accepts that information is free.

4. Can decrease a lot of information to significant 4. This is much of the time, however not by and
data. large, sensible

5. Gives new knowledge to administrators by 5. It doesn't totally portray the relationship


unveiling examples and connections that have between factors.
not been seen beforehand.

• It separates a dataset into simpler subsets while decision tree is incrementally created.
• The result is a tree with decision nodes and leaf nodes.
• A decision node has at least two branches.
• Leaf hub speaks to an arrangement or choice.
• The highest decision node in a tree which compares to the best indicator called root node.
• Decision trees can deal with both downright and numerical information.

The core algorithm for building decision trees called ID3 by J. R. Quinlan.

ID3 uses Entropy and Information Gain to construct a decision tree.

Benefits Limitations
1. Used in parallel to other venture 1. The more decision there are in a tree, the less
administration apparatuses. exact any normal results are probably going to be

2. Choice tree can without much of a stretch be 2. Extensive trees that incorporate many choice
altered by data. hubs can be convoluted and may have constrained
esteem.

3. Working with nonstop characteristics. 3. Unlikely Decision tree that could control you
toward a terrible choice.

4. Decision tree documentation can be 4. Unforeseen occasions may change choices.


maintained easily.

5. A decision tree can likewise speak to choice


options, possible outcomes, and hazard events
illustratively.

• Datasets has been downloaded from: https://www.data.gov/


• This dataset contains Police Traffic Enforcement activity. It has all types of activities regarding
enforcements, such as, Parking Violation, Traffic violation, Speed limit etc occurring at each
state frequently.
• Reason to choose this dataset is to show how the violations has been decreased and its impact
on the society.

• For my proposed dataset, Regression Analysis have been used.


• With the help of Regression analysis, I will be able to predict the rate at which it is increasing
and how to decrease it.
• Moreover, it will also help me to identify at which state the incident is taking place and at what
hour it is happening more frequently.

• “R” does not recognize/take any string values. It only understands Numeric values.
• To progress further we need to nullify the strings.
• Command that to be used to nullify the string under the column “inctype”:
• rti$inctype <- NULL

• In this report I have built up a comprehension of what the present business intelligence advance
is alongside prologue to expository calculations and what they are.
• Analytical techniques, for example, decision tree and regression analysis have been examined
alongside the advantages and restrictions of the investigative procedures.
• A further understanding and investigation of Analytical applications, for example, SAS and R
have been quickly de-scribed alongside their advantages and restrictions.
• At the end, a short detail of the information set for my next paper has been incorporated into
the report.

Вам также может понравиться