Вы находитесь на странице: 1из 8

Running head: DATA MINING/TEXT MINING FOR KNOWLEDGE DISCOVERY

TERM PAPER
ON
DATA MINING/TEXT MINING FOR KNOWLEDGE DISCOVERY
Suraj Dahal
Sagar Shrestha
Knowledge Management
Submitted to: Mr. Sanjay Pudasaini
June 21, 2015

DATA MINING/TEXT MINING FOR KNOWLEDGE DISCOVERY


Introduction
Data mining is one of the most important steps of the knowledge discovery in databases
process and is considered as significant subfield in knowledge management. Research in data
mining continues growing in business and in learning organization over coming decades.
In information era, knowledge is becoming a crucial organizational resource that provides
competitive advantage and giving rise to knowledge management (KM) initiatives. Many
organizations have collected and stored vast amount of data. However, they are unable to
discover valuable information hidden in the data by transforming these data into valuable and
useful knowledge.
Data mining is the process of discovering meaningful new correlations, patterns and
trends by sifting through large amounts of data stored in repositories, using pattern recognition
technologies as well as statistical and mathematical techniques. Data mining sometimes called
data or knowledge discovery. It is an essential step in the knowledge discovery in databases
(KDD) process that produces useful patterns or models from data. The terms of KDD and data
mining are different. KDD refers to the overall process of discovering useful knowledge from
data. Data mining refers to discover new patterns from a wealth of data in databases by focusing
on the algorithms to extract useful knowledge. The overall goal of the data mining process is to
extract information from a data set and transform it into an understandable structure for further
use. Aside from the raw analysis step, it involves database and data management aspects, data
pre-processing, model and inference considerations, interestingness metrics, complexity
considerations, post-processing of discovered structures, visualization, and online updating.
(Maimon & Rokach, 2010)

DATA MINING/TEXT MINING FOR KNOWLEDGE DISCOVERY


Text Mining has become an important research area. Text Mining is the discovery by
computer of new, previously unknown information, by automatically extracting information from
different written resources. A key element is the linking together of the extracted information
together to form new facts or new hypotheses to be explored further by more conventional means
of experimentation. Text mining is different from what are familiar with in web search. In search,
the user is typically looking for something that is already known and has been written by
someone else. The problem is pushing aside all the material that currently is not relevant to your
needs in order to find the relevant information. In text mining, the goal is to discover unknown
information, something that no one yet knows and so could not have yet written down. (Maimon
& Rokach, 2010)
Text mining usually involves the process of structuring the input text (usually parsing,
along with the addition of some derived linguistic features and the removal of others, and
subsequent insertion into a database), deriving patterns within the structured data, and finally
evaluation and interpretation of the output.

DATA MINING/TEXT MINING FOR KNOWLEDGE DISCOVERY


Literature review
Data mining and knowledge discovery in databases have been attracting a significant
amount of research, industry, and media attention of late (Maimon & Rokach, 2010). There is an
urgent need for a new generation of computational theories and tools to assist researchers in
extracting useful information from the rapidly growing volumes of digital data.
Data mining techniques provide a popular & powerful tool set to generate various data
driven classification systems. Leonid Churilov, Adyl Bagirov, Daniel Schwartz, Kate Smith and
Michael Dally had already studied about combined use of self-organizing maps & no smooth,
nonconvex optimization techniques in order to produce a working case of a data driven risk
classification system. (Maimon & Rokach, 2010)
Anthony D Anna & Oscar H. Gandy develop a more comprehensive understanding of
data mining by examining the application of this technology in the marketplace. As more firms
shift more of their business activities to the web, increasingly more information about consumers
and potential customers is being captured in web server log.
Anthony D Anna & Oscar H. Gandy examine issues related to social policy that arise as
the result of convergent developments in e_business technology and corporate marketing
strategies. About consumers and potential customers is being captured in web server logs.

DATA MINING/TEXT MINING FOR KNOWLEDGE DISCOVERY


Benefits to organization
Data mining is intended to provide support in the complex data rich but in situation
where the information is poor. Raw data are often transformed by organization into functional
knowledge as a part of their knowledge management initiatives. Data mining is then used to
translate structured data into knowledge where it can make an essential contribution to a
knowledge management effort. (Maimon & Rokach, 2010)
Data mining for knowledge discovery can have various benefits to organizations like in
medical and health care sectors, many leaders of healthcare find themselves overwhelmed with
data, but they lack the information for right decision making. Data mining for Knowledge
Discovery can help organizations turn their data into information. Organizations that take
advantage of these techniques will find that healthcare cost can be lowered while improving
healthcare quality by using fast and better clinical decision making.
Clinical decisions are often made based on doctors intuition and experience i.e. tacit
knowledge rather than on the knowledge rich data hidden in the database. This practice leads to
unwanted biases, errors and excessive medical costs which affects the quality of service provided
to patients. Integration of Data mining for Knowledge Discovery tools with Electronic Health
Record (EHR) could reduce medical errors, enhance patient safety, decrease unwanted practice
variation, and improve patient outcome. EHR is only a first step in capturing and utilizing healthrelated data the problem is turning that data into useful information. Models produced via data
mining and predictive analysis can form the backbone of Clinical Decision Support Systems
(CDSS). (Sharma, 2014)

DATA MINING/TEXT MINING FOR KNOWLEDGE DISCOVERY


Data mining techniques can be used for knowledge management in technology enhanced
learning, which would lead to a better performance and understanding of e-learning participants
behavior. Knowledge Discovery and Data Mining is an interdisciplinary area focusing upon
methodologies for extracting useful knowledge from data. The ongoing rapid growth of online
data due to the Internet and the widespread use of databases have created an immense need for
Data mining for Knowledge Discovery methodologies. (Sharma, 2014)
The implementation of data mining techniques for knowledge discovery in the university
domain, especially to log files, can help to understand visitors needs, for instance, modification
of web page to better fit for the user, Web page creation that are unique per user or using the
desires of a user to determine what documents to retrieve. These pieces of knowledge could lead
to better course location on web site and education strategies.
Text Mining for Knowledge discovery can be beneficial to any organization in situations
like when a company tries to get feedbacks for their particular product or service. For example,
the hash tags trends in different social media sites has made it easy for companies to know their
customers views towards their product and services which could bring improvement and provide
the products and services as per the customers needs.

DATA MINING/TEXT MINING FOR KNOWLEDGE DISCOVERY


Conclusion and Recommendations
Data mining offers great promise in helping organizations uncover patterns hidden in
their data that can be used to predict the behavior of customers, products and processes.
However, data mining tools need to be guided by users who understand the business, the data,
and the general nature of the analytical methods involved. (Introduction to Data Mining and
Knowledge Discovery, Third Edition, 1999)
The amount of data that is stored in the databases of organizations has increased rapidly.
It has become very important for organizations to find out the necessary patterns and valuable
information in these large databases. By exploring these important data, the organizations can be
able to manage their knowledge system and gain competitive advantage against other
organizations. Managing the knowledge system facilitates the decision making process of the
organization. Especially if we consider the importance of knowledge in the 21st century, data
mining can be seen as a very effective tool to explore the essential data to create competitive
advantage in the changing environment.

DATA MINING/TEXT MINING FOR KNOWLEDGE DISCOVERY

Bibliography
(1999). Introduction to Data Mining and Knowledge Discovery, Third Edition. Potomac: Two
Crows Corporation.
Maimon, O., & Rokach, L. (2010). Data Mining And Knowledge Discovery Handbook: Second
Edition. New York: Springer.
Sharma, M. (2014, February). Data Mining: A literature Survey. pp. 1-4.

Вам также может понравиться