Академический Документы
Профессиональный Документы
Культура Документы
BI applications include the activities of decision support systems, query and reporting,
online analytical processing (OLAP), statistical analysis, forecasting, and data mining.
3rd Slide:
To be more comprehensive:
Advantage of BI:
Most companies collect a large amount of data from their business operations. To keep
track of this information, a business would need to use a wide range of software
programs such as Excel, Access different database applications for various departments
throughout their organization. Using multiple software programs makes it difficult to
retrieve information in a timely manner and to perform analysis of the data.
The term Business Intelligence (BI) represents the tools and systems that play a key role
in the strategic planning process of the corporation. These systems allow a company to
gather, store, access and analyze corporate data to aid in decision-making.
BI to a common man:
Business intelligence is not business as usual. It's about making better decisions easier
and making them more quickly. Businesses collect enormous amounts of data every
day--information about orders, inventory, accounts payable, point-of-sale transactions,
and of course, customers. Businesses also acquire data, such as demographics and
mailing lists, from outside sources. Unfortunately, based on a recent survey, over 93% of
corporate data is not usable in the business decision-making process today.
Consolidating and organizing data for better business decisions can lead to a competitive
advantage, and learning to uncover and leverage those advantages is what business
intelligence is all about.
6th slide:
Inevitably the first question that arises when describing the objectives of a business
intelligence system is, "Doesn't a data warehouse have the same objectives and provide
the same capabilities as a business intelligence system?" A similar question arose when
data warehouses were first introduced, "Isn't a data warehouse similar to the corporate
information systems and information centers we built in the past?"
Although a quick and simple answer to both questions is yes, closer examination shows
that in the same way that there are important differences between a warehouse and
early corporate information systems and information centers, there are also important
differences between a business intelligence system and a data warehouse.
7th slide:
Early ways:
o Early business information systems employed batch applications to provide
business users with the information they needed. The output from these
applications typically involved huge volumes of paper that users had to wade
through to get the answers they needed to business questions. The advent of
terminal-driven time-sharing applications provided more rapid access to
information, but these systems were still cumbersome to use, and required
access to complex operational databases.
Users of first generation Business Information System:
o This first generation of business information systems could only be used by
information providers, such as business analysts, who had an intimate
knowledge of the data and extensive computer experience.
HERE POINT TO BE NOTED IS THAT
o Information consumers, like business executives and business managers, could
rarely use these early systems, and instead had to rely on information providers
to answer their questions and supply them with the information they needed
8th slide:
Definition:
o A data warehouse is a repository of an organization's electronically stored data.
Data warehouses are designed to facilitate reporting and analysis.
Essential components:
o means to retrieve and analyze data, to extract, transform and load data, and to
manage the data dictionary are also considered essential components of a data
warehousing system.
History:
o The concept of data warehousing dates back to the late 1980s [2] when IBM
researchers Barry Devlin and Paul Murphy developed the "business data
warehouse". In essence, the data warehousing concept was intended to provide
an architectural model for the flow of data from operational systems to decision
support environments. The concept attempted to address the various problems
associated with this flow, mainly the high costs associated with it.
Key developments in early years of data warehousing were:
1) Warehouses are designed to satisfy the needs of business users and not day-to-day
operational applications.
2) Data Warehouse information is clean and consistent, and is stored in a form business
users can understand.
3) Unlike operational systems, which contain only detailed current data, warehouses can
supply both historical and summarized information.
4) The use of client/server computing provides data warehouse users with improved user
interfaces and more powerful decision support tools.
10th slide:
Data Integration
Definition:
o The ability to transform corporate data into meaningful and actionable
information is called data integration.
Harnessing the data explosion to better understand the past and get direction for the future has
turned out to be one of the most challenging ventures for enterprise Information Technology
departments in global organization.
There are three broad categories of issues associated with data integration:
o -Technology challenge
Multiple sources with different formats.
Structured, Semi-Structured, and Unstructured data.
Data feeds on source system arriving at different times.
Huge Data volumes.
o -Organizational issues:
There are two Broad Issues with data integration in a large organisation.
They are:
Power Challenges:
Data is power and is usually very hard to make people think of
data as real valuable shared assets of a company. Lack of
cooperation from the relevant party is one f the major reason
for the failure of data integration projects.
Comfort Zone Challenge:
About 60% of data Integration is solved by hand coding. The
technology used to solve similar problems can range frm
replication, ETL, SQL, to EAI. People gravitate towards the
technology they are familiar with.
o -Economic challenges:
Getting the data out in the format that is necessary for data integration
end up being a slow and torturous process fraught with organizational
power games.
Cleansing the data and mapping the data from multiple sources into one
coherent and meaningful format is extraordinarily difficult.
More often then not, standard data integration tools don’t offer enough
functionality or extensibility to satisfy the data transformation
requirements for the project. This can result in expenditure of large sum
of money in consulting cost to develop special ETL code to get the job
done.
Different parts in a organization focus on data integration problem in
silos.
13th slide:
Data Analysis:
Definition:
o Analysis of data is a process of inspecting, cleaning, transforming, and
modelling data with the goal of highlighting useful information, suggesting
conclusions, and supporting decision making.
General Description:
o Microsoft SQL Server 2005 Analysis Services delivers online analytical processing
(OLAP) and data mining functionality for business intelligence applications.
Analysis Services supports OLAP by letting you design, create, and manage
multidimensional structures that contain data aggregated from other data
sources, such as relational databases. For data mining applications, Analysis
Services lets you design, create, and visualize data mining models that are
constructed from other data sources by using a wide variety of industry-
standard data mining algorithms.
Process of data analyses:
o Data cleaning:
The most important distinction between the initial data analysis phase
and the main analysis phase, is that during initial data analysis one
refrains from any analysis that are aimed at answering the original
research question that are quality of data, quality of measurement, initial
transformation, did the implementation of the study fulfil the intention of
the research design?
o Main data analysis (answer the original research question):
Additional analyses are made and a report of the entire process od data
analyses is prepared in this step.
15th slide:
Data Reporting:
16th slide:
Data Mining:
Definition:
o The process of extracting valid, authentic, and actionable information from large
databases.
o Explanation: In other words, data mining derives patterns and trends that exist
in data. These patterns and trends can be collected together and defined as a
mining model.
Tasks in data mining:
o Classification - Arranges the data into predefined groups. For example an email
program might attempt to classify an email as legitimate or spam. Common
algorithms include Decision Tree Learning, Nearest neighbor, naive Bayesian
classification and Neural network.
o Clustering - Is like classification but the groups are not predefined, so the
algorithm will try to group similar items together.
o Regression - Attempts to find a function which models the data with the least
error.
o Association rule learning - Searches for relationships between variables. For
example a supermarket might gather data on customer purchasing habits. Using
association rule learning, the supermarket can determine which products are
frequently bought together and use this information for marketing purposes. This
is sometimes referred to as market basket analysis.
Notable Usage of data mining:
o Games:
Since the early 1960s, with the availability of oracles for
certain combinatorial games, also called tablebases (e.g. for 3x3-chess)
with any beginning configuration, small-board dots-and-boxes, small-
board-hex, and certain endgames in chess, dots-and-boxes, and hex; a
new area for data mining has been opened up. This is the extraction of
human-usable strategies from these oracles,
o Business:
Data mining in customer relationship management applications can
contribute significantly to the bottom line.[citation needed] Rather than randomly
contacting a prospect or customer through a call center or sending mail,
a company can concentrate its efforts on prospects that are predicted to
have a high likelihood of responding to an offer. More sophisticated
methods may be used to optimise resources across campaigns so that
one may predict which channel and which offer an individual is most
likely to respond to — across all potential offers
Additionally, sophisticated applications could be used to automate the
mailing. Once the results from data mining (potential prospect/customer
and channel/offer) are determined, this "sophisticated application" can
either automatically send an e-mail or regular mail.
17th slide:
Data Mining can be defined by using the following six basic steps:
2. Preparing Data.
3. Exploring Data.
4. Building Models.
18th slide:
Diagrammatic view of the above steps….explain the above points using the diagram.
19th slide:
Conclusion:
CONCLUSION:
o Microsoft SQL Server 2005 provides the tools that developers need to build new
classes of database applications. By removing the barriers to code execution and
storage location, and by integrating standards such as XML, SQL Server 2005
opens up a world of possibilities to the database developer. This is only an brief
and relevant to project introduction to what you will be able to do with SQL
Server 2005.
20th slide:
Bibliography:
o www.technet.microsoft.com
o www.wikipedia.org
o www.google.co.in