Вы находитесь на странице: 1из 3

APPLICATIONS & TRENDS

IN DATA MINING
Gaurav Gupta1, Geetika Hans2, Tamanna Sehgal3
1
Sr. Lecturer in Deptt of Computer Sc. & Engg., RIMT – IET. Mandi Gobindgarh.
gaurav_shakti@yahoo.com
2
Technical Associate, Tech Mahindra.
geetika@techmahindra.com
3
Lecturer in Computer Sc. & Engg., CIET, Rajpura.
tamannapuri@gmail.com

ABSTRACT with data warehousing and database systems, the


standardization of data mining languages, and data privacy
The advent of computing technology has significantly protection and security.
influenced our lives and two major impacts of this effect are
Business data Processing and Scientific Computing. During INTRODUCTION
the early years of the development of computer techniques
for business, computer professionals were concerned with Data mining is the process of extraction of interesting
designing files to store the data so that information could be (nontrivial, implicit, previously unknown and potentially
efficiently retrieved. There were restrictions on storage size useful) patterns or knowledge from huge amount of data.
for storing data and on the speed of accessing the data.
Needless to say, the activity was restricted to a very few, It is the set of activities used to find new, hidden or
highly qualified professionals. Then came an era when unexpected patterns in data or unusual patterns in data.
Database Management System simplified the task. The Using information contained within data warehouse, data
responsibility of intricate tasks, such as declarative aspects mining can often provide answers to questions about an
of the programs was passed on to the database administrator organization that a decision maker has previously not
and the user could pose his query in simpler languages such thought to ask.
as query languages. Thus almost any business-small,
medium or large scale began using computers for day-to- • Which products should be promoted to a particular
day activities. customer?
Now what is the use of all this data? Up to the early 1990’s • What is the probability that a certain customer will
the answer to this was “NOT much”. No one was really respond to a planned promotion?
interested in utilizing data, which was accumulated during • Which securities will be most profitable to buy or
the process of daily activities. As a result a new discipline in sell during the next trading session?
computer science, Data Mining gradually evolved. • What is the likelihood that a certain customer will
default or pay back a schedule?
DATA MINING • What is the appropriate medical diagnosis for this
patient?
Data Mining is the exploration and analysis of large sets, in
order to discover meaningful patterns and rules. The key These types of questions can be answered surprisingly
idea is to find effective ways to combine computers power easily if the information hidden among the petabytes of data
to process data with the human eye’s ability to detect in your databases can be located and utilized. In the
patterns. The techniques of data mining are designed for following paragraphs we will discuss about the applications
work best with large data sets. and trends in the fields of data mining.

Since Data Mining is a young discipline with wide and APPLICATIONS


diverse applications, there is still a nontrivial gap between
general principals of Data Mining and domain specific, As data mining matures, new and increasingly innovative
effective Data Mining tools for particular applications. In applications for it emerge. Although a wide variety of data
this paper we will illustrate a few application domains of mining scenarios can be described. For the purpose of this
Data Mining (such as finance, the retail industry and paper the applications of data mining are divided in the
telecommunication) and Trends in Data Mining which following categories:
include further efforts towards the exploration of new
application areas and new methods for handling complex • Healthcare
data types, algorithms scalability, constraint based mining • Finance
and visualization methods, the integration of data mining • Retail industry
• Telecommunication business involved, identify telecommunication patterns,
• Text Mining & Web Mining catch fraudulent activities, make better use of resources, and
• Higher Education improve the quality of service

Healthcare Text Mining and Web Mining


The past decade has seen an explosive growth in biomedical Text mining is the process of searching large volumes of
research, ranging from the development of new documents from certain keywords or key phrases. By
pharmaceuticals and in cancer therapies to the identification searching literally thousands of documents various
and study of human genome by discovering large scale relationships between the documents can be established.
sequencing patterns and gene functions. Recent research in Using text mining however, we can easily derive certain
DNA analysis has led to the discovery of genetic causes for patterns in the comments that may help identify a common
many diseases and disabilities as well as approaches for set of customer perceptions not captured by the other syrvey
disease diagnosis, prevention and treatment. questions.
An extension of text mining is web mining. Web mining is
Finance an exciting new field that integrates data and text mining
Most banks and financial institutions offer a wide variety of within a website. It enhances the web site with intelligent
banking services (such as checking, saving, and business behavior, such as suggesting related links or recommending
and individual customer transactions), credit (such as new products to the consumer. Web mining is especially
business, mortgage, and automobile loans), and investment exciting because it enables tasks that were previously
services (such as mutual funds). Some also offer insurance difficult to implement. They can be configured to monitor
services and stock services. Financial data collected in the and gather data from a wide variety of locations and can
banking and financial industry is often relatively complete, analyze the data across one or multiple sites. For example
reliable and high quality, which facilitates systematic data the search engines work on the principle of data mining.
analysis and data mining. For example it can also help in
fraud detection by detecting a group of people who stage Higher Education
accidents to collect on insurance money. An important challenge that higher education faces today is
predicting paths of students and alumni. Which student will
Retail Industry enroll in particular course programs? Who will need
Retail industry collects huge amount of data on sales, additional assistance in order to graduate? Meanwhile
customer shopping history, goods transportation and additional issues, enrollment management and time-to-
consumption and service records and so on. The quantity of degree, continue to exert pressure on colleges to search for
data collected continues to expand rapidly, especially due to new and faster solutions. Institutions can better address
the increasing ease, availability and popularity of the these students and alumni through the analysis and
business conducted on web, or e-commerce. Retail industry presentation of data. Data mining has quickly emerged as a
provides a rich source for data mining. Retail data mining highly desirable tool for using current reporting capabilities
can help identify customer behavior, discover customer to uncover and understand hidden patterns in vast databases.
shopping patterns and trends, improve the quality of
customer service, achieve better customer retention and TRENDS
satisfaction, enhance goods consumption ratios design more As different types of data are available for data mining
effective goods transportation and distribution policies and tasks, so data mining approaches poses many challenging
reduce the cost of business. research issues in data mining. The design of a standard data
mining languages, the development of effective and
Telecommunication efficient data mining methods and systems, the construction
The telecommunication industry has quickly evolved from of interactive and integrated data mining environments, and
offering local and long distance telephone services to the applications of data mining to solve large applications
provide many other comprehensive communication services large application problems are important tasks for data
including voice, fax, pager, cellular phone, images, e-mail, mining researches and data mining system and application
computer and web data transmission and other data traffic. developers. Here we will discuss some of the trends in data
The integration of telecommunication, computer network, mining that reflect the pursuit of these challenges:
Internet and numerous other means of communication and
computing are underway. Moreover, with the deregulation Application Exploration
of the telecommunication industry in many countries and Earlier data mining was mainly used for business purpose,
the development of new computer and communication to overcome the competitors. But as data mining is
technologies, the telecommunication market is rapidly becoming more popular it is gaining wide acceptance in
expanding and highly competitive. This creates a great other fields also such as biomedicine, stock market, fraud
demand from data mining in order to help understand detection, telecommunication and many more. And many
new explorations are being done for this purpose. In between the needs for these applications and the available
addition for data mining for business continues to expand as technology.
e-commerce and marketing becomes mainstream elements
of the retail industry. Web mining
The World Wide Web is huge collection of globally
Scalable data mining methods distributed collection of news, advertisements, consumer
The current data mining methods capable of handling only a records, financial, education, government, e-commerce and
particular type of data and limited amount of data, but as many other services. The WWW also contains huge and
data is expanding at a massive rate, there is a need to dynamic collection hyper linked information, providing a
develop new data mining methods which are scalable and huge source for data mining. Based on the above facts, the
can handle different types of data and large volume of data. web also poses great challenges for efficient resource and
The data mining methods should be more interactive and knowledge discovery.
user friendly. One important direction towards improving
the repair efficiency of the timing process while increasing NEED OF DATA MINING
user interaction is constraint-based mining. This provide
user with more control by allowing the specification and use The massive growth of data from terabytes to perabytes is
of constraints to guide data mining systems in their search due to the wide availability of data in automated form from
for interesting patterns. various sources as WWW, Business, science, Society and
many more. But we are drowning in data but deficient of
Combination of data mining with database systems, data knowledge Data is useless, if it cannot deliver knowledge.
warehouse systems, and web database systems That is why data mining is gaining wide acceptance in
Database systems, data warehouse systems, and WWW are today’s world. A lot has been done in this field and lot more
loaded with huge amounts of data and have thus become the need to be done.
major information processing systems. It is important to
make sure that data mining serves as essential data analysis
component that can be easily included in to such an CONCLUSION
information-processing environment. The desired
architecture for data mining system is the tight coupling Since data mining is a young discipline with wide and
with database and data warehouse systems. Transaction diverse applications, there is still a nontrivial gap between
management query processing, online analytical processing general principles of data mining and domain specific,
and online analytical mining should be integrated into one effective data mining tools for particular applications. A few
unified framework. application domains of Data Mining (such as finance, the
retail industry and telecommunication) and Trends in Data
Standardization of data mining language: Mining which include further efforts towards the
Today few data mioning languages are commercially exploration of new application areas and new methods for
available in the market like Microsoft’s SQL server 2005, handling complex data types, algorithms scalability,
IBM Intelligent Miner, SAS Enterprise Miner, SGI Mineset, constraint based mining and visualization methods, the
Clementine , DBMiner and many more but a standard data integration of data mining with data warehousing and
mining language or other standardization efforts will database systems, the standardization of data mining
provide the orderly development of data mining solutions, languages, and data privacy protection and security.
improved interpretability among multiple data mining
systems and functions. REFERENCES

Visual data mining 1. Data Mining Concepts and Techniques – Jiawei


It is rightly said a picture is worth a thousand words. So if Han & Micheline Kamber
the result of the mined data can be shown in the visual form 2. Modern Data Warehousing, Mining and
it will further enhance the worth of the mined data. Visual Visualization Core Concepts by George M.
data mining is an effective way to discover knowledge from Marakas.
huge amounts of data. The systematic study and
development of visual data mining techniques will promote
the use for data mining analysis.

New methods for mining complex types of data


The complex types of data like geospatial, multimedia, time
series, sequence and text data poses an important research
area in field of data mining. There is still a huge gap

Вам также может понравиться