Академический Документы
Профессиональный Документы
Культура Документы
PRESENTED BY : SIDDHARTH KHARE TANMAY NAGAR AMAN SINGH VIPLAV VINOD TARUN KEWALRAMANI
DATA MINING..?
hold on new data with some certainty novel: non-obvious to the system useful: should be possible to act on the item understandable: humans should be able to interpret the pattern
Knowledge Discovery Concrete information gleaned from known data. Data you may not have known, but which is supported by recorded facts.
Knowledge Prediction Uses known data to forecast future trends, events, etc. (i.e: Stock market predictions) E.g. neural networks are inherently geared towards prediction and pattern recognition.
Databases
TYPES OF DATA
RAW DATA META DATA PREPOSITIONAL DATA RELATIONAL DATA
Predictive:
Regression Classification Collaborative
Filtering
Descriptive:
Clustering
Training Data: used to build the model Test data: used to validate the model (determine accuracy of the model) Given data is usually divided into training and test sets.
predicting
sales volumes of new product based on advertising expenditure Time series prediction of stock market indices.
Clustering : Given a set of data points, each having a set of attributes, and a similarity measure among them, find clusters such that
data
points in one cluster are more similar to one another data points in separate clusters are less similar to one another.
Association Rule : Given a set of records, each of which contain some number of items from a given collection
produce
dependency rules which will predict occurrence of an item based on occurences of other items
dimensionality
Data
reduction, transformations
mining
selecting selecting
Testing
Knowledge
Raw Data
__ __ __ __ __ __ __ __ __
Understanding
SOME APPLICATIONS
Search Engine : Google success is due to its algorithm which uses mainly links to the page Diagnostics : Helps to predict the rate of molecule generation within the body to note any abnormal symptoms.
Molecular
Direct
Marketing and CRM : Most major direct marketing companies are using modeling and data mining Most financial companies are using customer modeling Modeling is easier than changing customer behaviour
at a Retail Store :
Identify customer buying behaviors Discover customer shopping patterns and trends Improve the quality of customer service Achieve better customer retention and satisfaction Enhance goods consumption ratios Design more effective goods transportation and distribution policies
FAIS
Securities Fraud
NASDAQ
Phone fraud
AT&T,
SGI Mine Set IBM Intelligent Miner SAS Enterprise Miner Microsoft SQL Server 2000 DB Miner (DB Miner Technology Inc.)
Data mining is at Chasm!? Existing data mining systems are too generic Need business-specific data mining solutions and smooth integration of business logic with data mining functions
CONCLUSION
There is always some pitfalls in every technology..just depends on the intension in which it is used .it can no doubt gives a cutting edge to the organizations.