Академический Документы
Профессиональный Документы
Культура Документы
A class of database applications that look for hidden patterns in a group of data that can be used to predict future behavior. For example, data mining software can help retail companies find customers with common interests. The term is commonly misused to describe software that presents data in new ways. True data mining software doesn't just change the presentation, but actually discovers previously unknown relationships among the data. Data mining is popular in the science and mathematical fields but also is utilized increasingly by marketers trying to distill useful consumer data from Web sites.
Introduction to Data Mining http://www-pcc.qub.ac.uk/tec/courses/datamining/stu_notes/dm_book_1.html Information about data mining research, applications, and tools: http://info.gte.com/kdd/ http://www.kdnuggets.com http://www.ultragem.com/ http://www.cs.bham.ac.uk/~anp/TheDataMine.html http://www.think.com/html/data_min/data_min.htm
Sites for datamining vendors and products: American Heuristics (Profiler) http://www.heuristics.com Angoss software (Knowledge Seeker) http://www.angoss.com Attar Software (XpertRule Profiler) http://www.attar.com Business Objects (BusinessMiner) http://www.businessobjects.com DataMind (DataMind Professional) http://www.datamind.com HNC Software (DataMarksman, Falcon) http://www.hncs.com HyperParallel (Discovery)
http://www.hyperparallel.com Information Discovery Inc. (Information Discovery System) http://www.datamining.com Integral Solutions (Clementine) http://www.isl.co.uk/index.html IBM (Intelligent Data Miner) http://www.ibm.com/Stories/1997/04/data1.html Lucent Technologies (Interactive Data Visualization) http://www.lucent.com NCR (Knowledge Discovery Benchmark) http://www.ncr.com NeoVista Sloutions (Decision Series) http://www.neovista.com Nestor (Prism) http://www.nestor.com Pilot Software (Pilot Discovery Server) http://www.pilotsw.com Seagate Software Systems (Holos 5.0) http://www.holossys.com SPSS (SPSS) http://www.spss.com Thinking Machines (Darwin)
http://www.think.com
event (i.e. a combination of items), association rule tells if item X is a part of the event, then what is the percentage of item Y is also part of the event.
Clustering
Clustering is often used to find appropriate groupings of elements for a set of data. Unlike decision trees we discuss above, clustering is a kind of undirected knowledge discovery or unsupervised learning; that is, there is no target field, and the relationship among the data is identified by bottom-up approach.
Decision Trees
Decision Trees performs classification by constructing a tree based on training instances with leaves having class labels. The tree is traversed for each test instance to find a leaf, and the class of the leaf is the predicted class. This is a directed knowledge discovery in the sense that there is a specific field whose value we want to predict.
Neural Network
Neural network is often represented as a layered set of interconnected processors. These processor nodes are frequently referred as neurodes so as to indicate a relationship with the neurons of the brain. Each node has a weighted connection to several other nodes in adjacent layers. Individual nodes take the input received from connected nodes and use the weights together to compute output values. More details please refer following sites: