Вы находитесь на странице: 1из 6

DATA MINING

A class of database applications that look for hidden patterns in a group of data that can be used to predict future behavior. For example, data mining software can help retail companies find customers with common interests. The term is commonly misused to describe software that presents data in new ways. True data mining software doesn't just change the presentation, but actually discovers previously unknown relationships among the data. Data mining is popular in the science and mathematical fields but also is utilized increasingly by marketers trying to distill useful consumer data from Web sites.

Explore Further on the Internet

Introduction to Data Mining http://www-pcc.qub.ac.uk/tec/courses/datamining/stu_notes/dm_book_1.html Information about data mining research, applications, and tools: http://info.gte.com/kdd/ http://www.kdnuggets.com http://www.ultragem.com/ http://www.cs.bham.ac.uk/~anp/TheDataMine.html http://www.think.com/html/data_min/data_min.htm

http://direct.boulder.ibm.com/bi/ http://www.software.ibm.com/data/ http://coral.postech.ac.kr/~swkim/software.html http://www.cs.uah.edu/~infotech/mineproj.html http://info.gte.com/~kdd/index.html http://info.gte.com/~kdd/siftware.html http://iris.cs.uml.edu:8080/ http://www.datamining.com/datamine/welcome.htm

Data Sets to test data mining algorithms: http://www.scs.unr.edu/~cbmr/research/data.html

Data mining journal (Read Usama M. Fayyad's editorial.): http://www.research.microsoft.com/research/datamine/

Interesting application of data mining: http://www.nba.com/allstar97/asgame/beyond.html

Data mining papers: http://www.satafe.edu/~kurt/index.shtml http://www.cs.bham.ac.uk/~anp/papers.html http://coral.postech.ac.kr/~swkim/old_papers.html

Data mining conferences: http://www-aig.jpl.nasa.gov/kdd97 http://www.cs.bahm.ac.uk/~anp/conferences/html

Conference on very large databases: http://www.vldb.com/homepage.htm

Sites for datamining vendors and products: American Heuristics (Profiler) http://www.heuristics.com Angoss software (Knowledge Seeker) http://www.angoss.com Attar Software (XpertRule Profiler) http://www.attar.com Business Objects (BusinessMiner) http://www.businessobjects.com DataMind (DataMind Professional) http://www.datamind.com HNC Software (DataMarksman, Falcon) http://www.hncs.com HyperParallel (Discovery)

http://www.hyperparallel.com Information Discovery Inc. (Information Discovery System) http://www.datamining.com Integral Solutions (Clementine) http://www.isl.co.uk/index.html IBM (Intelligent Data Miner) http://www.ibm.com/Stories/1997/04/data1.html Lucent Technologies (Interactive Data Visualization) http://www.lucent.com NCR (Knowledge Discovery Benchmark) http://www.ncr.com NeoVista Sloutions (Decision Series) http://www.neovista.com Nestor (Prism) http://www.nestor.com Pilot Software (Pilot Discovery Server) http://www.pilotsw.com Seagate Software Systems (Holos 5.0) http://www.holossys.com SPSS (SPSS) http://www.spss.com Thinking Machines (Darwin)

http://www.think.com

What is Data Mining?


Data mining, or knowledge discovery, is the computer-assisted process of digging through and analyzing enormous sets of data and then extracting the meaning of the data. Data mining tools predict behaviors and future trends, allowing businesses to make proactive, knowledge-driven decisions. Data mining tools can answer business questions that traditionally were too time consuming to resolve. They scour databases for hidden patterns, finding predictive information that experts may miss because it lies outside their expectations.

Definition of 'Data Mining'


A process used by companies to turn raw data into useful information. By using software to look for patterns in large batches of data, businesses can learn more about their customers and develop more effective marketing strategies as well as increase sales and decrease costs. Data mining depends on effective data collection and warehousing as well as computer processing.

What is the current state?


Techniques in Data Mining Association Rule:
Association is to discover interesting associations between attributes contained in a database. This technique is also known as market basket analysis. Based on frequency counts of the number of items occur in the

event (i.e. a combination of items), association rule tells if item X is a part of the event, then what is the percentage of item Y is also part of the event.

Clustering
Clustering is often used to find appropriate groupings of elements for a set of data. Unlike decision trees we discuss above, clustering is a kind of undirected knowledge discovery or unsupervised learning; that is, there is no target field, and the relationship among the data is identified by bottom-up approach.

Decision Trees
Decision Trees performs classification by constructing a tree based on training instances with leaves having class labels. The tree is traversed for each test instance to find a leaf, and the class of the leaf is the predicted class. This is a directed knowledge discovery in the sense that there is a specific field whose value we want to predict.

Neural Network
Neural network is often represented as a layered set of interconnected processors. These processor nodes are frequently referred as neurodes so as to indicate a relationship with the neurons of the brain. Each node has a weighted connection to several other nodes in adjacent layers. Individual nodes take the input received from connected nodes and use the weights together to compute output values. More details please refer following sites:

http://www-3.ibm.com/software/data/iminer/db2.html http://www.sas.com/technologies/analytics/datamining/miner/ http://otn.oracle.com/products/bi/odm/odmining.html

Вам также может понравиться