Nashit

A
SYNOPSIS
on
Data Mining
Techniques
(Session 2019-2020)
Mr. Dinesh Chand Gupta Submitted By:

(Faculty Supervisor/ Guide) Nashit Hussain
(PCE16IT038)
(1) Ms. Shazia Haque

(2) Ms. Sita Gupta
(Faculty Coordinators- Seminar)
Department of Information Technology

Poornima College of Engineering, Jaipur
30 January, 2020
1
Abstract:
Data mining is a process consisting in collecting knowledge from databases or

data warehouses and the information collected that had never been known before,
it is valid and operational. Nowadays data mining is a modern and powerful IT&C
tool, automatizing the process of discovering relationships and combinations in
raw data and using the results in an automatic decision support.
Data mining on large databases has been a major concern in research community,
due to the difficulty of analyzing huge volumes of data using only traditional
OLAP tools. This sort of process implies a lot of computational power, memory
and disk I/O, which can only be provided by parallel computers. We present a
discussion of how database technology can be integrated to data mining
techniques. Finally, we also point out several advantages of addressing data
consuming activities through a tight integration of a parallel database server and
data mining techniques.
This synopsis gives a short overview of the Data Mining Techniques. It contains
the need for this technology, its technical applications, some methods available
to perform data mining techniques, its advantages and some obstacles which are
required to be overcome by future technological developments. For this, I
reviewed 8 research papers of IEEE conferences ranging from 2016 to 2019.
Uses of Data Mining:
Data mining is used for examining raw data, including sales numbers, prices, and
customers, to develop better marketing strategies, improve the performance or
decrease the costs of running the business. Also, Data mining serves to discover
new patterns of behavior among consumers.
2
Data Mining Techniques:
Broadly speaking, there are four main Data Mining techniques:
1) Data Cleaning: data cleaning is the process of detecting and correcting

corrupt or inaccurate records from a record set, table, or database and refers
to identifying incomplete, incorrect, inaccurate or irrelevant parts of the
data and then replacing, modifying, or deleting the dirty or coarse data.
2) Clustering: Clustering is one of the oldest techniques used in Data Mining.

It is the process of identifying similar data that are similar to each other.
Clustering is called segmentation and helps the users to understand what is
going on within the database.
3) Visualization: Visualization is used at the beginning of the Data Mining

process. It is useful for converting poor data into good data letting different
kinds of methods to be used in discovering hidden patterns.
4) Decision Tree: A decision tree is a predictive model and the name itself
implies that it looks like a tree. In this technique, each branch of the tree is
viewed as a classification question. It leaves the trees which are considered
as partitions of the dataset related to that particular classification. This
technique can be used for exploration analysis, data pre-processing and
prediction work.
5) Classification: Classification is the most commonly used technique in

mining of data which contains a set of pre-classified samples to create a
model that can classify the large set of data. This technique helps in
deriving important information about data and metadata (data about data).
Classification is closely related to the cluster analysis technique and it uses
the decision tree or neural network system.
6) Outer Detection: This type of data mining technique refers to observation

of data items in the dataset which do not match an expected pattern or
expected behavior. This technique can be used in a variety of domains,
such as intrusion, detection, fraud or fault detection, etc. Outer detection is
also called Outlier Analysis or Outlier mining.
3
7) Tracking Patterns: Tracking patterns is intuitive for many people. Unlike
anomalies, patterns are generally reliable, though they're by no means
infallible. Businesses that store and analyze data in order to build buyer
personas and remain competitive have a clear advantage over those that
don't. Not only that, but identifying patterns is arguably the only thing that
makes identifying anomalies a possibility. If a business hasn't noticed a
pattern, how can it notice an anomaly? This means that companies which
don't recognize patterns also miss out on using anomalies to their
advantage.
Advantages:
• The derived pattern in Data Mining is helpful in better understanding of

customer behavior, which leads to better & productive future decision.
• Data Mining is used for finding the hidden facts by approaching the market,
which is beneficial for the business but has not yet reached.
• It is also used for identifying the area of the market, to achieve marketing
goals and generate a reasonably good ROI.
• Data Mining helps in bringing down operational cost, by discovering and

defining the potential areas of investment.
Conclusion:
Finally, the bottom line is that all the techniques help in the discovery of new
creative things. And at the end of this discussion about the data mining
techniques, one can clearly understand the feature, elements, purpose,
characteristics, and benefits with its own limitations.
Therefore, after reading all the above-mentioned information about the data
mining techniques, one can determine its credibility and feasibility even better.
4
References:
1. Hussain Ahmad Madni; Zahid Anwar; Munam Ali Shah, “Data Mining
Techniques and Applications – A Decade Review”, 2017 23rd
International Conference on Automation and Computing (ICAC), 26
October 2017.
2. Anoopkumar M; Dr. A. M. J. Md. Zubair Rahman, “A Review on Data

Mining Techniques and Factors Used in Educational Data Mining to
Predict Student Amelioration”, 2016 International Conference on Data
Mining and Advanced Computing (SAPIENCE), 12 December 2016.
3. Ubon Thongsatapornwatana, “A Survey of Data Mining Techniques for

Analyzing Crime Patterns”, 2016 Second Asian Conference on Defence
Technology (ACDT), 24 March 2016.
4. Rashi Bansal; Nishant Gaur; Dr. Shailendra Narayan Singh, “Outlier

Detection: Applications and Techniques in Data Mining”, 2016 6th
International Conference - Cloud System and Big Data Engineering
(Confluence), 11 July 2016.
5. Yogesh Gandge; Sandhya, “A Study on Various Data Mining Techniques

for Crop Yield Prediction”, 2017 International Conference on Electrical,
Electronics, Communication, Computer, and Optimization Techniques
(ICEECCOT), 08 February 2018.
6. Chitra Jalota; Rashmi Agrawal, “Analysis of Educational Data Mining

using Classification”, 2019 International Conference on Machine
Learning, Big Data, Cloud and Parallel Computing (COMITCon), 10
October 2019.
7. Virender Kumar; Cherry Khosla, “Data Cleaning – A thorough analysis

and survey on Unstructured data”, 2018 8th International Conference on
Cloud Computing, Data Science & Engineering (Confluence), 23 August
2018.
8. Priyanka Rajagouda Pradhan; Mr. R. B. Kulkarni, “Secure E-Learning

Using Data Mining Techniques and Concepts”, 2016 International
Conference on Electrical, Electronics, and Optimization Techniques
(ICEEOT), 24 November 2016.
5
9. Prashant Y. Niranjan; Harish H. Kenchannavar; Ramesh Medar, “Data
Mining Techniques in Telecom Sector”, 2017 International Conference on
Energy, Communication, Data Analytics and Soft Computing (ICECDS),
21 June 2018.
10.Irina Sitova; Jelena Pecerska, “Data Mining Techniques in Simulation

Results Analysis”, 2018 59th International Scientific Conference on
Information Technology and Management Science of Riga Technical
University (ITMS), 03 December 2018.

Nashit

Загружено:

Сведения о документе

Оригинальное название

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

Nashit

Загружено:

Авторское право:

Доступные форматы

A

Mr. Dinesh Chand Gupta Submitted By:

(1) Ms. Shazia Haque

Department of Information Technology

Data mining is a process consisting in collecting knowledge from databases or

Uses of Data Mining:

Broadly speaking, there are four main Data Mining techniques:

1) Data Cleaning: data cleaning is the process of detecting and correcting

2) Clustering: Clustering is one of the oldest techniques used in Data Mining.

3) Visualization: Visualization is used at the beginning of the Data Mining

5) Classification: Classification is the most commonly used technique in

6) Outer Detection: This type of data mining technique refers to observation

• The derived pattern in Data Mining is helpful in better understanding of

• Data Mining helps in bringing down operational cost, by discovering and

2. Anoopkumar M; Dr. A. M. J. Md. Zubair Rahman, “A Review on Data

3. Ubon Thongsatapornwatana, “A Survey of Data Mining Techniques for

4. Rashi Bansal; Nishant Gaur; Dr. Shailendra Narayan Singh, “Outlier

5. Yogesh Gandge; Sandhya, “A Study on Various Data Mining Techniques

6. Chitra Jalota; Rashmi Agrawal, “Analysis of Educational Data Mining

7. Virender Kumar; Cherry Khosla, “Data Cleaning – A thorough analysis

8. Priyanka Rajagouda Pradhan; Mr. R. B. Kulkarni, “Secure E-Learning

10.Irina Sitova; Jelena Pecerska, “Data Mining Techniques in Simulation

Вам также может понравиться