Вы находитесь на странице: 1из 28

MADHAV INSTITUTE OF TECHNOLOGY & SCIENCE, GWALIOR

(A Govt. Aided UGC Autonomous & NAAC Accredited Institute Affiliated to RGPV, Bhopal)

DEPARTMENT OF CSE & IT


(MCSE - 620303)
Presentation On Data Mining
M.TECH-CSE (3rd Sem.)
2019-2020

Submitted To :- Submitted By :-
Dr. R.K. Gupta Shefali Goel
Asst. Prof. Akanchha Tiwari (0901CS18MT15)
(Dept. of CSE & IT)
An Introduction Of
Classifier In DataMining
Content

 What is Data Mining


 Applications of Data Mining
 Techniques of Data Mining
 Flow diagram of Data Mining
 Different classifiers
 Ensemble
 Data Mining Tools
 Challenges of Data Mining
 Conclusion
What is Data Mining
 “Data Mining” refers to the extraction of
useful information from a bulk of data or data
warehouse .
 Data Mining is an essential step in discovering
knowledge from Databases.
Applications Of Data Mining
• Communications
• E-Commerce
• Super Markets
• Crime Investigation
• Bioinformatics
• Disease prediction
• Education
• Fraud Detection
Techniques Of Data Mining
CLUSTERING
 Clustering analysis is a data mining technique to
identify data that are like each other.
 This process helps to understand the differences and
similarities between the data.
REGRESSION
• Regression analysis is the data mining method of
identifying and analyzing the relationship between
variables.
• It is used to identify the specific variable, given the
presence of other variables.
• It is used to predict a range of numeric values.
OUTER
• This type of data mining technique refers to
observation of data items in the dataset which do not
match an expected pattern or expected behavior.
• This technique can be used in a variety of domains,
such as intrusion detection, fraud or fault detection,
etc.
• Outer detection is also called Outlier Analysis or
Outlier mining.
SEQUENTIAL PATTERN
• This data mining technique helps to discover or
identify similar patterns or trends in data for certain
period.
• Example- Customer shopping sequences, Weather
prediction.
PREDICTION
• Prediction has used a combination of the other data
mining techniques like trends, sequential patterns,
clustering, classification, etc.
• It analyzes past events or instances in a right
sequence for predicting a future event.
ASSOCIATION RULE
• Association rule mining is a procedure which aims to
observe frequently occurring patterns, correlations, or
associations from datasets found in various kinds of
databases .
Classification
• It is used to retrieve important and relevant
information about data .
• This data mining method helps to classify data in
different classes.
FLOW DIAGRAM
NAÏVE BAYES CLASSIFIER
The Naive Bayes Classifier technique is based on the
Bayesian theorem.
The Bayesian Classifier is capable of calculating the
possible output. That is based on the input.
DECISION TREES
 A decision tree is a predictive machine-learning model.
 The internal nodes of a decision tree denote the different
attributes.
 Also, the branches between the nodes tell us the possible
values. While the terminal nodes tell us the final value of
the dependent variable.
RANDOM FOREST
• Random forest is a tree based classifier
• The algorithm creates a forest with a large number of
trees.
• It is effective in handling missing values.
ENSEMBLE TECHNIQUES
• Ensemble is a strategy that can be used to improve
the accuracy of the classifier.
• It is the method in which two or more classifier are
combine.
• The aim of combining multiple classifier is to obtain
better performance.
TECHNIQUES OF ENSEMBLE
• Bagging
• Voting
• Stacking
• Boosting
Data Mining Tools
• Weka
• Rattle
• Rapid Miner
• Orange
• KNIME
• NLTK
Challenges of Data Mining

• Skilled Experts are needed to formulate the data


mining queries.
• Overfitting: -Due to small size training database, a
model may not fit future states.
• Data mining needs large databases which
sometimes are difficult to manage
• If the data set is not diverse, data mining results
may not be accurate.
CONCLUSION
• Data Mining is all about explaining the past and
predicting the future for analysis.
• The main drawback of data mining is that many
analytics software is difficult to operate and requires
advance training to work.

Вам также может понравиться