Академический Документы
Профессиональный Документы
Культура Документы
Abstract
Data warehouse modeling is a complex task, which involves knowledge of business processes of the domain of discourse,
understanding the structural and behavioral system’s conceptual model, and familiarity with data warehouse technologies.
The suitability of current data warehouse modeling methods for large-scale systems is questionable, as they require
multiple manual actions to discover measures and relevant dimensional entities and they tend to disregard the system’s
dynamic aspects. We present an Object-process-based Data Warehouse Construction (ODWC) method that overcomes
these limitations of existing methods by utilizing the operational system conceptual model to construct a corresponding
data warehouse schema. We specify the ODWC method, apply it on a case study, evaluate it, and compare it to existing
methods.
r 2008 Elsevier B.V. All rights reserved.
Keywords: Data warehousing; Data models; Database design; Conceptual modeling; Object-Process Methodology
0306-4379/$ - see front matter r 2008 Elsevier B.V. All rights reserved.
doi:10.1016/j.is.2008.02.002
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
2 D. Dori et al. / Information Systems ] (]]]]) ]]]–]]]
physical data models and on-line analytical proces- serves as the basis for the data warehouse design.
sing (OLAP) foundation. Yet, conceptual and 2. Incorrect data warehouse designs might be
logical design of the data warehouse remains a generated if the designer does not understand
complicated human-intensive task, which is as- the underlying relationships among the various
signed to systems engineers and analysts of the data items.
organization that own and run these data ware- 3. Premature aggregation often causes loss of
houses. information that limits the options to analyze
Data warehouse (DW) design usually follows the the data in future desired fine-grained resolutions.
Star schema diagram [2]. A Star schema consists of
a fact table and a single de-normalized dimension These drawbacks have motivated studies that
table for each dimension of the corresponding data focus on using operational system conceptual
model. The dimension tables can be normalized to models as a basis for data warehouse modeling,
create a Snowflake schema, which can support design, and construction. There are two major
attribute hierarchies. Both the Star and the Snow- approaches for utilizing an operational system
flake schemata present data as a single cube, which model for the construction of data warehouse
is the basic data warehouse structure that allows models: the structure-based approach and the
users to perform multidimensional data analyses. process-based approach.
Several conceptual data warehouse models have The structure-based approach is motivated by the
been developed to support the design of data understanding that since data warehouses depend
warehouses [5–7]. While these approaches assist in on the respective underlying operational systems as
data warehouse modeling, they address implemen- their data supply sources, the latter become
tation issues only partially by creating or manip- important for the conceptual and logical design of
ulating the Star or Snowflake architectures. For a data warehouse. The techniques that follow this
example, the Object Relational View (ORV) approach (for example, [1,5,14–19]) use mainly the
approach [8], which is an object-oriented data entity-relationship concept [20] as the basis for the
warehouse design framework, is obtained by a proposed solution. These techniques suffer from
transformation of the Snowflake schema. Arguably, the following limitations: (1) the data warehouse
ORV can lead to better system performance designer is assumed to be familiar with the
through object links instead of the more expensive organization’s business processes, (2) the behavioral
joins. After the transformation, the object-oriented aspects of the system are ignored, and (3) multiple
schema has to be partitioned in order to reduce the manual transformations are needed to obtain a data
number of joins and disk accesses. UML [9] warehouse model.
extensions for data warehouse conceptual modeling The process-based approach stems from the
have also been proposed [10,11]. OMG suggested understanding that the fundamental role of data
the Common Warehouse Metamodel (CWM) warehousing is to provide business performance
[12,13], an approach to metadata interchange, measurement and redesign support [21,22]. With
which is capable of expressing major multidimen- this in mind, several techniques were offered to
sional properties [10]. create data warehouse models from business process
Most data warehouse modeling techniques refer models. Only one model, discussed in [23], creates a
mainly to the user requirements [2] and advocate data warehouse from relatively generic business
building a new multidimensional model regardless process models [24], and it can be applied to various
of existing models of the underlying operational content areas. However, since this technique is
systems. This approach raises several problems, as manual and does not relate to the operational
stated by [14]: system structure, the Extract, Transform, and Load
(ETL) process is complicated. Another transforma-
1. User requirements are unpredictable, often not tion technique [25], which is content-specific, is
validated, and subject to change over time, so applied to rule-based Customer Relationship Man-
they produce an unstable basis for design. One agement (CRM) systems. List et al. [22] offered a
may have better chances of foreseeing future specific data warehouse model aimed at analyzing
analysis requirements, such as the need for business workflow performance that uses a work-
additional dimensions, in case the operational flow management system as the data source. This
system schema, rather than user requirements, approach might yield infeasible data warehouse
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
D. Dori et al. / Information Systems ] (]]]]) ]]]–]]] 3
models that cannot be attained by loading data one needs to be acquainted with the business
from operational sources, because it does not processes in order to select relevant transactional
account for the data structure in the operational entities, make decisions in dimensional design, and
systems. define summarizability constraints. Nonetheless, the
A survey of the state-of-the-art methods for methods do not utilize information about the
transforming operational system conceptual models functionality of the underlying operational system.
to data warehouse models along with their evalua- With respects to handling business process analysis,
tion is presented in [26]. Based on this survey, a most methods do not adequately analyze the
comparison between the various operational sys- process perspective. They use the data of the system,
tem-based data warehouse generation techniques is but not its operational semantics or the system’s
presented in Table 1 along the various criteria and dynamic aspect, which are expressed through
their importance. Each technique is rated as good processes. Disregarding the process perspective
(G), average (A), low (L), very low (VL), or diminishes the usefulness of the model during the
irrelevant (I). While the method ratings are sub- conceptual data warehouse design, as the modeling
jective, we believe they represent the state of affairs technique can express the ultimately obtained actual
with respect to techniques of data warehouse structure of the metadata in the data warehouse, but
generation from operational system. not the information needs that might have been
The process automation criterion was divided into discovered along the process path. The methods
five sub-criteria, and with respect to this criterion, leave mapping relevant dimensional attributes to the
most of the examined methods provide some means designer. As for support for data transformation
for semi-automatic transformation from an opera- from the operational system to the data warehouse,
tional conceptual model to a data warehouse none of the methods considers Extract, Transform,
specification, but they comprise several manual and Load (ETL) issues directly. However, most
activities that the data warehouse designer needs methods derive the data warehouse entities from the
to perform. Most methods’ applicability to large- operational sources ER diagrams, implying that the
scale systems is low. The criterion required business resulting structures are reachable by transforma-
process knowledge is rated low for almost all the tions of the operational sources. While SOM utilizes
surveyed methods, since they utilize mostly struc- business process models, it is not based on actual
tural models of the operational systems. As noted, information offered by the operational sources.
Table 1
Comparison between data warehouse generation techniques
Original Operational Model E/R XML DTD SERM E/R E/R E/R SOM
([32]) ([24])
Derived DW Model Dimensional Dimensional Star Fact Various MER Star
Fact Model Fact Model Schema Schema ([6]) Schema
Process automation High
a Facts definition L L L L L H L
b. Initial model creation H H L A I H L
c. Dimensions (Hierarchies) definition L L L A A H L
d. Refinement L L L L L VL L
e. Summarizability constraints L L L A L L L
Applicability to large-scale systems High L L A L L VL A
Required business process knowledge Medium- L L L L L L A
High
Handling business process analysis High L L L L L L H
Mapping relevant dimensional attributes Low- A A A A A A A
Medium
Support for data transformation from an High H A H H H H VL
operational system to the data warehouse
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
4 D. Dori et al. / Information Systems ] (]]]]) ]]]–]]]
To address the problems of existing methods for point in time, each object is at some state, and
data warehouse generation from operational system objects are transformed (generated, consumed, or
specification, we suggest considering both the affected, i.e., their state is changed) through the
structural and behavioral aspects of the system. occurrence of a process. Things—objects or pro-
We have chosen Object-Process Methodology cesses—can be either systemic (internal to the
(OPM) as the system specification language, pri- system) or environmental (external to the system
marily because it unifies the major system aspects— and interacting with it). In an orthogonal manner,
function, structure, and behavior—within a single a thing can be either physical or informatical
view. Consequently, we propose a new DW (logical).
construction method, called ODWC—Object-pro- Unlike the object-oriented approach, behavior in
cess-based Data Warehouse Construction, which OPM is not necessarily encapsulated as an opera-
creates a multidimensional conceptual data ware- tion or method within a particular object class
house model. We automate and simplify the data construct. A process class can involve any number
warehouse design process by utilizing knowledge of object classes rather than be an operation of
about the business processes within the OPM model exactly one object class, as imposed by the object-
in order to analyze their performance. oriented encapsulation principle. By allowing for
The rest of the paper is organized as follows. In the existence of stand-alone processes, one can
Section 2 we present the Object-Process Methodol- model behavior that involves several object classes
ogy, discuss its suitability for data warehouse intuitively and in a straightforward manner rather
construction, and introduce a case study of a than having to twist reality to conform to the
trading catalog system. In Section 3 we present the encapsulation principle, as is often the case with
Object-process-based Data Warehouse Construc- OO-based modeling. Moreover, OPM enables the
tion (ODWC) method, formalize its rules, and modeler to specify that some of the involved objects
exemplify its use. We evaluate the method in Section are transformed, while others enable the process
4 and conclude in Section 5. without being transformed. The modeled behavior
is integrated into the system’s structure in a single
2. Object-Process Methodology bimodal (graphical and textual) model, so system
dynamics is represented in a balanced way beside its
Object-Process Methodology (OPM) is an inte- static composition. Processes are connected to the
grated approach to the study, development, and involved object classes through procedural links,
lifecycle support of systems in general and informa- which are classified into enabling, transformation,
tion systems in particular [27]. The basic premise of and event links.
the holistic OPM paradigm is that objects and OPM’s built-in scaling (refinement/abstraction)
processes are two types of equally important classes mechanisms, which are unfolding/folding, in-zoom-
of things, which together describe the function, ing/out-zooming, and state expression/suppression,
structure, and behavior of complex, multidisciplin- help manage the system’s complexity by controlling
ary systems in a single, domain-independent model, the visibility of details in any OPD and providing
expressed in both formal yet intuitive graphics and for creating new, interconnected OPDs at higher or
an equivalent subset of natural language. OPM lower levels of details. Unfolding/folding is applied
unifies the major system lifecycle stages—initiation, by default to objects for exposing/hiding their
development, and deployment—within one frame of structural components (parts, specializations, fea-
reference, using a single diagramming tool, a set of tures, or instances) in a hierarchical manner. In-
Object-Process Diagrams (OPDs), and a corre- zooming/out-zooming is applied by default to
sponding subset of English, called Object-Process processes for exposing/hiding their sub-process
Language (OPL). The OPM elements are presented components and details of the process execution.
in Appendix A. In OPM, existing things, their Time within an in-zoomed process flows down-
attributes, physical devices, human users, and wards, providing for the specification of the order in
environmental interfaces, are modeled as object which subprocesses are executed, so processes
classes. An object can be stateful, i.e., it can be at depicted at the same height are concurrent. The
one of a certain number of possible states. Processes third scaling mechanism, state expression/suppres-
are things that transform objects by generating or sion, enables showing or hiding all or some of the
consuming them, or by changing their state. At any states of an object class.
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
D. Dori et al. / Information Systems ] (]]]]) ]]]–]]] 5
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
6 D. Dori et al. / Information Systems ] (]]]]) ]]]–]]]
Fig. 1. The System diagram (SD) – the top-level OPD describing a Goods Purchasing process.
CASE Tool [28], the software environment that 3. The object-process-based data warehouse
supports OPM modeling. Regular Arial font (as in construction method
‘‘consists of’’) denotes reserved OPL phrases,
while bold font (as in ‘‘Purchasing Organi- Having introduced object-process, in this section
zation’’) is used for non-reserved phrases. Text in we introduce the OPM-based Data Warehouse
brackets explains the OPL terms and is not part of Construction (ODWC) method, which transforms
the automatically generated text. Note the slight the OPM operational system conceptual model into
grammatical mistakes resulting from the automatic a data warehouse schema. We present the basic
OPL sentence generation. ODWC steps, the ODWC transformation rules and
Material Catalog consists of many their rationale, the formalization of these rules, and
Materials. Purchaser, which is envir- a demonstration of their application on the case
onmental and physical, exhibits [has the study.
attribute of] Buyer [Buyer is the system represen-
tation of the physical and environmental Purcha- 3.1. The basic ODWC steps
ser]. Purchasing Organization consists
of many Buyers. Purchasing Decision The ODWC approach follows an iterative user-
exhibits Material. Many Purchase Orders guided search process. Each iteration consists of the
issued to Vendor. Many Materials are following steps:
supplied by optionally many [zero or more]
Vendors. Vendor is a default supplier of 1. The data modeler selects a business process.
optionally many Materials. Purchaser 2. The ODWC system proposes possible Snowflake
handles [is agent of] Goods Purchasing. schemata that might describe the selected busi-
Goods Purchasing requires Material, ness process. This is done in two stages: creating
Buyer, and Vendor. Goods Purchasing a basic Snowflake schema and populating facts
affects Purchasing Decision. Goods Pur- and dimensions. Each stage is carried out subject
chasing yields Stocked Goods, which to several action rules. The rules for each stage
is physical, Purchase Order, and Goods are specified in the next subsection, along with
Receipt. the motivation behind each one of them.
The other parts of the system specification appear 3. The data modeler selects the Snowflake schema
in Appendix B as OPDs along with their corre- that is most appropriate for the organization’s
spondence OPL paragraphs. data mining needs.
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
D. Dori et al. / Information Systems ] (]]]]) ]]]–]]] 7
4. Optionally, the data modeler can hierarchically enhances its ability to measure the business process
navigate to more detailed OPDs that zoom into in terms of its inputs and outputs.
lower level processes and repeat the procedure
from step 1 as many times as needed.
Stage 2: Populating facts and dimensions
Action Rule 2.1: Create a fact for each quantitative
The transformation executed by the ODWC
attribute of a transformed object.
method constructs Snowflake schemata. A Snow-
flake schema is more informative than a Star
schema, as it includes information also about Motivation: Facts in the fact table have to be
possible navigation through the cube for obtaining quantitative to enable performing statistical opera-
information in response to specific queries. A tions such as summation or means analysis on them.
corresponding Star schema can easily be obtained We use attributes of the fact entity to ensure that the
from the Snowflake schema by collapsing the granularity of all the facts in the fact table is the
dimensional hierarchies [14]. same, as required by Kimball [22]. In the framework
of an automatic schema creation, it is impossible to
3.2. ODWC transformation rules and their rationale estimate the relevance of a given quantitative
attribute as a fact. Therefore, we define all the
Stage 1: Creating a basic Snowflake schema quantitative attributes in the fact entity as candidate
Action Rule 1.1: Select a process and create an facts. Several other techniques [14,19] use a similar
empty Snowflake schema for each informatical (non- approach, while further manual assistance is re-
physical) object which is transformed (i.e., created, quired in others [15,16,18]. However, since all these
consumed, or affected) by the selected process. techniques require at least one fact in order to create
a multidimensional schema, they cannot be used to
create a factless fact table [22]. In the proposed
Motivation: One of the goals of the method is to ODWC method, creating a factless fact table is
enable business process measurement. The center of possible, since a multidimensional schema creation
any multidimensional schema, its fact table, in- is initiated (in Action Rules 1.1 and 1.2) regardless
cludes measurements (facts). A dimensional mode- of the existence of quantitative attributes in the fact
ler is supposed to ‘‘strive to store the measurement entity.
data resulting from a business process in a single Action Rule 2.2: Add all attributes of dimensional
data mart’’ [2]. In the ODWC method, a business objects into the Snowflake schema.
process is measured by its influence on the
enterprise, which is identified in OPM by transfor-
mations that these objects undergo as a result of the Motivation: We strive to create as wide dimension
process execution. tables as possible, since dimension attributes ‘‘serve
Action Rule 1.2: Create a dimension for each as the primary source of query constraints, group-
object that the process transforms and for each object ings and report labels’’ [22]. Once a dimensional
that enables the process, either as an instrument (a entity has been chosen (in Action Rule 1.2), each
non-human enabler) or an agent (a human enabler). of its attributes may serve as a meaningful
query constraint. Therefore, we include all the
attributes of the dimensional entities as dimensional
Motivation: Dimensions are supposed to include attributes.
‘‘virtually all interesting constraints and report
Action Rule 2.3: Define foreign keys in dimensional
labels’’ [20]. Methods that construct data ware-
objects as navigational attributes.
houses from data models of operational systems use
the structural relations between the fact entity and
its neighboring entities to discover dimensions Motivation: To create dimensional hierarchies that
[1,14–16,18,19]. In OPM, this detection is refined expand our schema from Star model to Snowflake
by identifying objects that contribute to the business model, we make a single pass through the structural
process and introducing procedural links between relations of the dimensional entity. We balance
the dimension entities and the chosen business between two objectives of dimensional modeling:
processes. This helps bring relevant dimensions creating deep and eminent dimensions, which are
into the constructed multidimensional scheme and vital if we are to make the data warehouse usable
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
8 D. Dori et al. / Information Systems ] (]]]]) ]]]–]]]
and understandable on one hand, and exercising as Proc is a finite set of (class) processes. Each
little snowflaking as possible on the other hand. The process is defined by the tuple (name, essence,
reason for minimizing snowflaking is that it is affiliation).
believed to create slower applications that rely on fli : Li-{effect, result, consumption, instrument,
more join operations, thereby hampering users’ agent, invocation} is a function which maps each
ability to browse within dimensions [2]. Our link from Li to its type.
compromise allows the generation of one snowflak- fre : Rel-{exhibition, generalization, aggrega-
ing level, as implemented by some commercial tion, general} is a function which maps each
products [29]. The user can still perform deeper relationship from Rel to its type
search for dimensional attributes and create more Li D(Obj Proc)[(Proc Obj)[(Proc Proc) is
snowflaked schemata by repeating this action rule. a set of procedural links.
This is possible since during the physical database RelD(Proc Obj)[(Obj Proc)[(Obj Obj)[
design, Snowflake schemata can be reduced, or even (Proc Proc) is a set of structural relationships.
transformed into de-normalized Star schemata, by In case of exhibition relationship RelD(Proc
the application of the ‘‘collapse’’ operator, as shown Obj)[(Obj Proc)[(Obj Obj)
by Moody and Kortink [14]. In case of generalization, aggregation, and
Action Rule 2.4: Add basic dimensions. general relationship RelD(Obj Obj)[(Proc
Proc).
Motivation: Basic dimensions include Time, Currency,
Li is a finite set of procedural links.
and Measurement Unit. The Currency and Measure-
In case of effect link elALiD(Obj Proc)[
(Proc Obj)
ment Unit dimensions are needed to describe cur-
rency-dependent and unit-dependent facts [29], and
In case of result link rlALiD(Proc Obj)
must therefore be added to the schema if such facts
In case of consumption link clALiD(Obj Proc)
that use them are present. The Time dimension, on the
In case of instrument link ilALiD(Obj Proc)
other hand, is needed in virtually every multidimen-
In case of agent link alALiD(Obj Proc)
sional schema [2] and is considered a key dimension in In case of invocation link inlALiD(Proc
Proc)
a data warehouse [17]. One of the Time dimension
roles is to add the historical perspective to the
analyzed data, which is vital to decision-making 3.3.2. A formal representation of a snowflake scheme
processes. Indeed, several data warehouse modeling
Definition 2:. A multidimensional data warehouse
techniques advocate addition of a time dimension
model (DW) may consist of one or more Snowflake
regardless of the presence of time attributes in the
schemata. A Snowflake schema Snowflake is speci-
chosen operational entities [15,16,19,23].
fied by a nine-tuple (Fact, Dim, Nav, Ass, Att, Key,
getKey, getQAtt, getAtt), where:
3.3. Formalizing the ODWC rules
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
D. Dori et al. / Information Systems ] (]]]]) ]]]–]]] 9
Function Definitions Action Rule 2.2: Add all the attributes of dimen-
sional objects into the Snowflake schema.
getsKey is a function that return the keys of an
OPM object or a Snowflake entity. 8Oik 2 Dimk ; 8Oj 2 System; 9ðOj ; Oik Þ
getQAtt is a function that return the quantitative 2 Rel ^ ðOj ; Oi Þ ! generalÞ ! Ojk
attributes of an OPM object or a Snowflake 2 Navk ^ Ojk Oj
entity.
getAtt is a function that return the attributes of
an OPM object or a Snowflake entity. 8Oik 2 Navk ! ðOik ; Navk Þ 2 Assk
8Oi 2 System; 8A 2 getAttðOi Þ; 8Oik
2 Navk ; 9Oi Oik ! A 2 getAttðOik Þ
3.3.3. Transformation rules
Stage 1: Creating a basic Snowflake schema Action Rule 2.3: Define foreign keys in dimen-
Action Rule 1.1: Create an empty Snowflake sional objects as navigational attributes.
schema for each informatical object transformed
(created, consumed or affected) by the selected 8Oik 2 Dimk ; 8Oj 2 System; 9ðOj ; Oik Þ
process. 2 Rel ^ ðOj ; Oi Þ ! generalÞ !
8Oi 2 OPDcurrent ; ðð9ðOi ; Pselected Þ 2 Li ^ ðOi ; Pselected Þ Ojk 2 Navk ^ Ojk Oj
! consumptionÞ_ 8Oik 2 Navk ! ðOik ; Navk Þ 2 Assk
ð9ðOi ; Pselected Þ 2 Li ^ ðOi ; Pselected Þ 8Oi 2 System; 8A 2 getAttðOi Þ; 8Oik
! effectÞ _ ð9ðPselected ; Oi Þ 2 Li ^ ðPselected ; Oi Þ 2 Navk ; 9Oi Oik ! A 2 getAttðOik Þ
! resultÞÞ ! Oik Oi ^ Oik 2 Factk ^ Factk 2 Snowflakek
Action Rule 2.4: Add basic dimensions.
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
10 D. Dori et al. / Information Systems ] (]]]]) ]]]–]]]
Table 2
Intermediate conversion stages for Purchasing Decision
ARTICLE IN PRESS
D. Dori et al. / Information Systems ] (]]]]) ]]]–]]] 11
process. A transformed object is identified via the at the same time also contributes to this object. This
OPM link semantics specified in Appendix A and exemplifies how ODWC supports the line item
in [27]. dimension [2] of a data warehouse, which is a
Snowflake schema where a dimension table is like
the fact table.
Since a data warehouse is designed to analyze the
performance of a business process, and since Stage 2: Populating facts and dimensions
processes in OPM transform objects, we place the Action Rule 2.1: Create a fact for each quantitative
transformed objects in the middle of our cube as attribute of a transformed object.
holders of the facts.
In our case study, empty Snowflakes are created
for the object Goods Receipt and for the object Quantitative (numerical or Boolean) attributes of
the objects created or modified by the analyzed
Purchase Order, both created by the Goods
process (or objects characterizing it), which were
Purchasing process, and for the object Pur-
placed in the middle of the Snowflake schema, are
chasing Decision, which is affected by that
proposed automatically as facts. The quantitative
process. Since the object Stocked Goods is
attribute for the Purchasing Decision schema-
physical, it does not trigger the creation of a
ta is Quantity (see Table 2). Quantitative attri-
corresponding Snowflake. In the sequel, we refer
butes for the Purchase Order schemata are
to the Purchase Order and the related Pur-
chasing Decision schemata. DeliveryInterval, Quantity, PayToVendor, Delivery-
TermsCode, and ActuallyPaid.
Action Rule 1.2: Create a dimension for each object
Action Rule 2.2: Add all attributes of dimensional
that the process transforms and for each object that
objects into the Snowflake schema.
enables the process (as an instrument or an agent).
Only objects linked to the process at the current level
of detail are to be taken in account. Objects that were chosen to be dimensions in
the previous stage inherit their entire attribute
In our case study, referring to the Purchase set as dimensional attributes. In both schemata,
the dimensional objects Vendor, Material,
Order and Purchasing Decision schemata,
Purchasing Decision, and Buyer transfer
Vendor, Material, Purchasing Decision
their attribute sets into the created scheme.
and Buyer are added as dimensions. Only informa-
Action Rule 2.3: Define foreign keys in dimensional
tical objects should be proposed as attributes
objects as navigational attributes.
(dimensions) of the cubes. Inventory Worker
also participates in the Goods Purchasing
process, but being a physical object, it is not Attributes of dimensional objects, which participate
included in the schema. in a structural link with other objects in the
If a physical object is an agent (a human) for the operational system conceptual model, are proposed
selected process, a corresponding informatical as navigational attributes of those objects in the
object exhibited by the physical agent (and which cube. This way, a dimensional hierarchy is created.
represents it) is proposed as a dimension. In our The dimensional object Vendor has no foreign keys
case study, Purchaser is a physical agent repre- among its attributes. Purchasing Decision, on
sented by the informatical object Buyer, which is the other hand, has Purchasing Decision
an attribute of Purchaser, as expressed in the Buyer and Purchasing Decision Material
OPD in Fig. 1 and by the corresponding OPL as foreign keys, since many-to-one relationships are
sentence ‘‘Purchaser exhibits Buyer.’’ present between Decision and Buyer and be-
At this stage of the Snowflake schema construc- tween Decision and Material. Therefore, we
tion, the schema of Purchasing Decision define Material and Buyer as navigational
deserves special attention. Since Purchasing attributes of the Purchasing Decision dimen-
Decision is affected by Goods Purchasing, it sion and snowflake it into the Purchasing
was recognized both as a fact entity by Rule 1.1 Decision Material and Buyer objects. To
and as a dimension entity by Rule 1.2. This reflects facilitate the identification of navigational objects,
the fact that the Goods Purchasing process we use the concatenation of the dimension name
uses this object as a source of information and and the attribute name. Similarly, we snowflake it
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
12 D. Dori et al. / Information Systems ] (]]]]) ]]]–]]]
into Default Vendor in the Purchasing As can be seen without having to analyze the
Decision Material object. Note that we do requirements for the DW, the cubes obtained by the
not continue this process indefinitely. For example, ODWC method can be used for the following needs:
we do not search for foreign keys in the Purchas-
ing Decision Buyer object. Infinite snowflaking 1. Analysis of workload over time in terms of the
is not recommended due to performance reasons [2] amount of Purchasing Decisions assigned
and may not be supported by industrial OLAP to a Buyer (Decision Dates) can be per-
products. formed using the Purchasing Decision cube.
Action Rule 2.4: Add basic dimensions. Further development will allow what-if analysis
that simulates reassignment of Materials to
Buyers or Purchasing Organizations in
The following dimensions are added automatically,
the enterprise.
regardless of the operational system conceptual
2. Relevance of Default Vendors can be
model: time dimension (always), unit dimension (if a
determined by comparing selected vendors to
fact is characterized by the Measurement Unit
Material Default Vendors, either in the
reserved object [27] containing a unit, like a gallon
Purchase Order cube or the Purchasing
or a kg.) currency dimension (if a fact is character-
Decision cube.
ized by a currency object), and data packet (at the
designer’s discretion). Automatically added dimen- 3. Updating Default Vendors can be done for
certain Materials based on the actual Orders
sions need to be populated with the relevant data
issued.
(transaction time, unit of measurement, currency, or
4. The influence of the urgency of purchasing
data upload packet) from the selected transactional
decisions on the Delivery Interval and
data object. Filling the time dimension is done using
prices Paid to Vendor in the Purchase
the attributes of the transformed object that were
Order can be evaluated using the Purchase
modeled as date or date-time. If this data does not
Order cube. The expectation is to get shorter
exist, the time dimension can be populated with the
data loading date. Similarly, unit and currency Delivery Intervals while paying slightly
higher prices.
dimensions should be established whenever the
5. Identification of preferred business partners
chosen facts are of quantitative type and are
(Vendors) in terms of overall purchasing
characterized by a Measurement Unit object or
volume, or purchasing volume for certain
a Currency object. In these cases, a key attribute
Materials, in order to introduce and negotiate
like TimeKey, CurrencyKey, or UnitKey, pointing to
scheduling agreements with them, allowing lower
the relevant entry in the time/unit dimension table is
cost and more time-efficient purchasing.
created in the fact table.
We add time dimensions to our Snowflake 6. Pareto analysis of quantity vs. volume of
schemata based on the Decision Date attribute Purchasing Decisions, in order to identify
of Purchasing Decision and Order Date of low-cost commonly used materials and change
Purchase Order. In both Snowflake schemata, a their purchasing policies to purchasing to stock
Unit of Measurement dimension is added since the instead of make-to-order, thereby allowing
Quantity attributes of the Purchasing Deci- buyers to devote more time to search alternative
sion and Purchase Order objects are character- suppliers for the more expensive Materials.
ized by Measurement Unit objects.
Applying the proposed method to the Goods Moving to the next iteration, we perform the
Purchasing process, we obtain three similar transformation operations on the Order Issuing
Snowflake schemata, in which the dimensions are process, shown in Fig. 3, in which the Buyer issues
Vendor, Material, Purchasing Decision and Buyer, as an Order based on the Selected Proposal and
well as Unit of Measurement and Date. The facts Purchasing Decision. Fig. C2 in Appendix C
come from Purchase Order, Purchasing shows the Snowflake scheme that was generated this
Decision or Goods Receipt. Fig. 2 presents time, where Purchase Order is the fact entity,
the results of the transformation process for Selected Proposal is transferred into a dimen-
Purchasing Decision snowflake while Fig. C1 sion, and Decision, Buyer and Material serve
in Appendix C presents the resultant Purchase as additional dimensions. The time and unit
Order snowflake. dimensions were added, while the Purchasing
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
D. Dori et al. / Information Systems ] (]]]]) ]]]–]]] 13
Fig. 2. A Snowflake schema of the Purchasing Decision, drawn from the Goods Purchasing process.
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
14 D. Dori et al. / Information Systems ] (]]]]) ]]]–]]]
Decision, Buyer, and Selected Proposal Dimensions definition: Dimensions are defined
dimensions snowflaked into their classifying entities. automatically by searching the attribute sets of
Using this scheme, additional analysis can be the dimensional objects.
performed, including the following. Schema refinement: No automatic assistance
is provided for schema refinement (as is
1. Comparing Quantity, Price, and Delivery the case with the other DW construction
in the Selected Proposal and the actual methods).
Order: This is needed to ensure that while Summarizability constraints: Summarizability
allowing flexibility to negotiate a quantity raise constraints need to be added manually (as is
and a commensurable price adjustment, the the case with the other DW construction
Buyer indeed represents the enterprise’s goal methods).
of increasing its profits. The cube helps dis- 2. Applicability to large-scale systems: The ODWC
cover unusual differences in quantities or prices method inherits the complexity management
between Orders and Proposals and/or strategy of OPM, enabling the architect to
Decisions. control the detail level of the things described
2. Comparing between Buyers, Vendors, and in the OPM model. The transformation is
Materials: The comparison can be done in performed on a selected process, which can be
terms of the result of negotiation after a proposal at any level of abstraction. While browsing
has been selected. It can help locating Buyers through a hierarchical set of Object-Process
and Vendors with whom negotiation has Diagrams, the designer can perform top-down,
achieved better results. It can also help pointing bottom-up, or middle-out search to find pro-
out adequate Materials, whose price was cesses of interest at their appropriate abstrac-
successfully negotiated. This indicates whether tion level, and then translate them into the
more time should be spent in order to find other corresponding data warehouse structures.
new vendors that would offer an even lower price The search can be performed using OPCAT,
for these Materials. which provides such navigational features.
Therefore, the size and complexity of the opera-
Other interesting analyses include the match tional system are much less of a challenge to the
between Request for Proposals created by DW designer compared with non-hierarchical
the RFP Issuing process and the resulting representations of the operational system pro-
Selected Proposal and the relation to the posed by other data warehouse construction
Proposal Selecting process (Fig. 4). methods using operational system conceptual
models.
4. Method evaluation 3. Required business processes knowledge: Following
our approach, the data warehouse modeling is a
Having presented and demonstrated the ODWC natural extension of the business and operational
method, in this section we evaluate it according to system modeling. As such, the data warehouse
the criteria discussed in Section 1, compare it to a modeler is assisted by the model itself in under-
representative method of data warehouse genera- standing the business processes.
tion, and discuss its limitations. 4. Analyzing business processes: The OPM-based
operational conceptual system model is a
4.1. Criteria-based evaluation straightforward representation of the enterprise’s
business processes.
1. Process automation: 5. Mapping relevant dimensional attributes: Dimen-
Facts definition: Selecting a process automati- sional attributes are mapped by their relation to
cally leads to definition of facts, so the ODWC the process being analyzed.
method provides great assistance in carrying 6. Support for data transformation from the opera-
out this task. tional system to the data warehouse: We derive
Initial model creation: Action rule 1.2 of the the data warehouse entities from the opera-
ODWC method creates dimensions automati- tional source entities, thereby greatly simplify-
cally from the objects used by the selected ing the Extract, Transform, and Load (ETL)
process. processes.
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
D. Dori et al. / Information Systems ] (]]]]) ]]]–]]] 15
4.2. Comparison to a structure-based technique To overcome this problem, Golfarelli et al. advise
that ‘‘a portion of interest of an ER scheme’’, rather
To gain better understanding of the features, than the whole diagram, should be used for the task
strengths and weaknesses of the ODWC method, [16]. Following the stages they suggested, we select
we apply a structure-based method to our case study Purchasing Decision as our fact entity. The attribute
for the sake of comparison. We then analyze the tree created appears in Appendix D. We then
differences between the structure method and manually define the dimension and the facts. The
ODWC method. We choose the Golfarelli-Maio- resulting Snowflake scheme appears in Fig. D3 in
Rizzi Dimensional Fact technique suggested by Appendix D. Examination of the construction
Golfarelli et al. [16], since it is a well known method, process performed by the Golfarelli-Maio-Rizzi
which is also used as the framework for other method and its outcomes, compared to the out-
methods for generating DW schemata. Following the comes achieved by ODWC method leads to the
principles of the chosen method, we construct an following insights:
alternative multidimensional representation of the
system shown in the Purchasing System case study. 1. The Golfarelli-Maio-Rizzi method does not aim
Using the structural relationships between objects at automating the initial multidimensional struc-
in our Object Process Diagram set, we developed tures design, nor is it capable of achieving this
the Entity-Relationship diagram (ERD) of our task. Numerous intelligent decisions must be
Purchasing Process case study. The resulting dia- made by the designer during the initial model
gram is presented in Fig. D1 in Appendix D. creation, the most crucial of which is the
Informational objects are transferred into entities identification of the fact entity. Other decisions
along with their attributes, while structural links are relate to pruning and grafting the attribute tree,
translated to relationships. According to the ER where the designer should decide which tree
principles, the diagram is a single-level visualization, vertices are ‘‘interesting’’ and omit definitions of
neglecting the OPM hierarchical diagram structure. some dimension, hierarchies, and fact attributes.
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
16 D. Dori et al. / Information Systems ] (]]]]) ]]]–]]]
2. While the ODWC method allows for creating 6. The diagram achieved by the Golfarelli-Maio-
cubes with similar facts at different levels of Rizzi method includes the Request for Proposals
summation, depending on the context of the dimension, which is not included in the cube
selected business process, the Golfarelli-Maio- proposed by our method. Since Request for
Rizzi method can reach variability of multi- Proposals has a 1-1 relationship with Purchasing
dimensional representations only by repeating Decisions, this difference does not alter the cube’s
the design process while manually making granularity (data resolution) while adding less
various design decisions. The basic structure of analytic capabilities than what the ODWC based
cubes based on the same fact entity will remain Purchasing Decision snowflake offers. Moreover,
unchanged, except for ‘‘pruning’’ and ‘‘grafting’’ this ‘‘disadvantage’’ could easily be reverted by
different dimensional entities. either (1) following the Golfarelli-Maio-Rizzi
3. Fig. D2 presents urgency and status as direct guideline to graft a 1-1 related vertex in an
descendants of the root vertex (decision number). attribute tree or (2) using our method’s scal-
Following the manual modeling stages, the child ability in order to produce an additional cube,
vertices of the root entity may either become such as the Purchasing Decision cube from the
dimensions or fact attributes. In our case, we Proposals Selecting process, where Request for
chose neither to mark urgency and status as Proposals will automatically be included as a
dimensions nor to use them in fact calculations. dimension.
By doing so, we compromised our cube’s ability
to perform data analysis, since we cannot use our The most eminent disadvantage of the structure-
cube to discover materials for which purchasing based technique is its lack of ability to utilize
is badly planned (and therefore are regularly existent many-to-one structural relations for tasks
purchased in an urgent fashion), nor can we other than expanding the multidimensional struc-
answer simple operational control queries, such ture. Since a valid Entity-Relationship Diagram
as regarding the volume of decisions that are yet includes a sufficient set of relations (as opposed to
to be ordered. In the general decisions cube every imaginable relation), certain paths may
proposed by our method (see Fig. 2), both these become too short to construct a beneficial multi-
scenarios are met by the presence of a line item dimensional structure. For example, considering
dimension, a multidimensional design alternative Purchase Order as the fact entity, the only entity in
that cannot be exploited using the Golfarelli- the ERD of our case study related to Order by a
Maio-Rizzi method. many-to-one relationship is Vendor. As a result,
4. Structure-based methods use only structural Vendor is the only possible dimension in the
relations between entities to construct a multi- diagram apart from attributes of Order, making
dimensional structure. This approach prevents the resulting cube almost useless for analysis. On the
the incorporation of context-relevant entities. In other hand, our method, which uses the procedural
our case, the chosen vendor has no direct rather than structural links to identify possible
structural relation with the decision, and there- dimensions, can achieve several multidimensional
fore, the Golfarelli-Maio-Rizzi method does not structures based on the Purchase Order as a
include it as a dimension. Vendor is only included fact entity, including those presented in Fig. C1
as the default vendor of a material. Hence, if one and Fig. C2.
or more other vendors are involved in a decision-
making process, the multidimensional represen- 4.3. Limitations of the ODWC method
tation does not reflect it.
5. Both cubes include Purchasing Decision Quantity The limitations of the ODWC method relate to
as a fact. Since a quantity is categorized by a features which cannot be automatically specified.
Measurement Unit, our method adds it as These include:
another dimension. This solution complies with
commercial products standards on this issue [29]. 1. Addition of facts which do not exist in the
The Golfarelli-Maio-Rizzi method disregards operational system schema or need to be
this issue, and in fact creates a scheme in which calculated during data load. Our example de-
a single fact is non-additive on any of its monstrated the use of such calculated facts,
dimensions. which are defined manually in some of the
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
D. Dori et al. / Information Systems ] (]]]]) ]]]–]]] 17
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
18 D. Dori et al. / Information Systems ] (]]]]) ]]]–]]]
Appendix A. A Quick Guide to the Syntax and Semantics of Object-Process Methodology (OPM)
A synopsis of the Syntax and Semantics of Object-Process Methodology is presented. More information is
found in [27].
ENTITIES
B is physical.
(shaded rectangle)
An object is a thing that exists.
Object C is physical and
environmental. A process is a thing that transforms
Things
A is s1.
A state is situation an object can be at
or a value it can assume.
B can be s1 or s2.
State
States are always within an object.
C can be s1, s2, or s3.
States can be initial or final.
s1 is initial.
s3 is final.
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
D. Dori et al. / Information Systems ] (]]]]) ]]]–]]] 19
MANAGEMENT
Name Symbol OPL Semantics
A consists of B
and C.
Aggregation-
A is the whole, B and C are parts.
Participation
A consists of B
and C.
A exhibits B, as
Fundamental Structural Relations
B is an A.
C is an A.
A specializes into B and C.
Generalization-
Specialization A, B, and C can be either all objects or
all processes.
B is A.
C is A.
B is an instance
Object A is the class, for which B and
Classification- of A.
C are instances.
Instantiation C is an instance
Applicable to processes too.
of A.
A relates to B.
(for
Unidirectional &
unidirectional)
bidirectional A user-defined textual tag describes
tagged structural any structural relation between two
A and C are
links objects or between two processes.
related.
(for
bidirectional)
A exhibits C.
A consists of B.
Zooming into process A, B is its part
In-zooming A zooms into B,
and C is its attribute.
as well as C.
A exhibits C.
A consists of B. Zooming into object A, B is its part and
A zooms into B, C is its operation.
as well as C.
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
20 D. Dori et al. / Information Systems ] (]]]]) ]]]–]]]
State-
Specified Process B creates Object A at State
B yields s1 A.
Result Link s1.
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
D. Dori et al. / Information Systems ] (]]]]) ]]]–]]] 21
PROCEDURAL LINKS
State-
A triggers B. Entering state s1 will attempt to trigger
Specified
when it enters s1. the process once. Execution will proceed
Instrument
B requires s1 A. if the triggering failed.
Event Link
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
22 D. Dori et al. / Information Systems ] (]]]]) ]]]–]]]
The Goods Purchasing process is the main function of the system. In the OPD of Fig. 3, the Goods
Purchasing process is in-zoomed. This OPD, like its ancestor, is next explained via its corresponding OPL
paragraph below.
Purchaser, which is physical and environmental, exhibits Buyer. Purchasing
Organization consists of many Buyers. Purchasing Decision exhibits Status and
Urgency. Status can be approved, vendor selected, ordered, goods received, or
completed. Urgency can be urgent or regular. Purchaser handles Goods Purchasing.
Goods Purchasing requires Buyer. Goods Purchasing affects Purchasing Decision.
Goods Purchasing zooms into Full Vendor Selection, Automatic Vendor Assignment,
Order Issuing, Goods Receiving, and Paying, as well as Selected Proposal. Full Vendor
Selection requires regular Urgency and Vendor. Full Vendor Selection yields Selected
Proposal and Request For Proposals. Full Vendor Selection changes Status from
approved to vendor selected. Automatic Vendor Assignment requires urgent Urgency.
Automatic Vendor Assignment yields Selected Proposal. Automatic Vendor Assignment
changes Status from approved to vendor selected. Order Issuing consumes Selected
Proposal. Order Issuing yields Purchase Order. Order Issuing changes Status from
vendor selected to ordered. Inventory Worker, which is environmental and physical,
handles Goods Receiving. Goods Receiving requires Purchase Order. Goods Receiving
yields Goods Receipt and Stocked Goods. Goods Receiving changes Status from ordered
to goods received. Paying requires Purchase Order and Goods Receipt. Paying changes
Status from goods received to completed.
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
D. Dori et al. / Information Systems ] (]]]]) ]]]–]]] 23
The Full Vendor Selection process is in-zoomed in the OPD of Fig. 4. Purchaser, which is
physical, exhibits Buyer. Purchasing Organization consists of many Buyers. Purchas-
ing Decision exhibits Status and Urgency. Status can be approved, RFP issued,
ordered, and vendor selected. Urgency can be urgent or regular. Purchasing Decision
consists of an optional Request For Proposals and optionally many Proposals.
Purchaser handles Full Vendor Selection. Selected Proposal is a Proposal. Full
Vendor Selection requires Buyer and regular Urgency. Full Vendor Selection zooms into
RFP Issuing, Proposals Receiving, Negotiating, and Proposal Selecting. RFP Issuing
changes Status from approved to RFP issued. RFP Issuing yields Request For Proposals.
Proposals Receiving requires Vendor. Proposals Receiving yields Proposal. Negotiat-
ing requires Vendor. Negotiating affects Proposal. Proposal Selecting requires
Request For Proposals. Proposal Selecting changes Status from RFP issued to vendor
selected. Proposal Selecting yields Selected Proposal.
Fig. B1 is an OPD in which the Purchasing Decision object is unfolded. The OPL paragraph of this
OPD follows. Purchasing Decision exhibits Status, Urgency, Decision Date, Quantity,
and Decision Number. Status can be approved, RFP issued, vendor Selected, ordered,
goods received, or completed. Urgency can be urgent or regular. Quantity exhibits
Measurement Unit. Purchasing Decision consists of an optional Request For Proposals
and optionally many Proposals. Request For Proposals exhibits RFP Number, Closing
Date, and Quantity. Proposal exhibits Proposal Number, Delivery Interval, Pay To
Vendor, Delivery Terms Code, and Proposal Date. Pay To Vendor exhibits Currency.
Proposal refers to Request For Proposals. Proposal is submitted by Vendor. Buyer
handles Purchasing Decision. Purchasing Decision features Material. Selected
Proposal is a Proposal. Selected Proposal exhibits Agreement Date.
Figs. B2 and B3 unfold other important objects of our case study.
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
24 D. Dori et al. / Information Systems ] (]]]]) ]]]–]]]
Figs. C1 and C2
Fig. C1. A Snowflake schema of the Purchase Orders, drawn from the Goods Purchasing process.
Fig. C2. A Snowflake schema of the Purchase Orders, drawn from the Order Issuing sub process.
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
D. Dori et al. / Information Systems ] (]]]]) ]]]–]]] 25
Fig. D1. An Entity-Relationship Diagram featuring the structural relationships of the Purchasing case study.
Fig. D2. An Attribute tree corresponding for the Purchasing case study.
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
26 D. Dori et al. / Information Systems ] (]]]]) ]]]–]]]
Fig. D3. Multidimensional representation of the Purchasing case study, by the Golfarelli, Maio and Rizzi method.
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002
ARTICLE IN PRESS
D. Dori et al. / Information Systems ] (]]]]) ]]]–]]] 27
[20] P. Chen, The Entity-relationship model—toward a unified [26] D. Dori, R. Feldman, A. Sturm, Transforming an opera-
view of data, QACM Trans. Database Syst. 1 (1) (1976) 9–36. tional system model to a data warehouse model: a survey of
[21] C. Kaldeich, J. Oliviera e Sa, Data warehouse methodology: techniques, in: Proceedings of the IEEE International
a process driven approach, in: Proceedings of CAiSE 2004, Conference on Software—Science, Technology & Engineer-
2004, pp. 536–549. ing (SwSTE), Hertzlia, Israel, 2005, pp. 47–56.
[22] B. List, J. Schiefer, A.M. Tjoa, G. Quirchmayr, Multi- [27] D. Dori, Object-Process Methodology: a Holistic Systems
dimensional business process analysis with the process Paradigm, Springer Press, 2002.
warehouse, in: W. Abramowicz, J. Zurada (Eds.), Knowl- [28] D. Dori, I. Reinhartz-Berger, A. Sturm, OPCAT—
edge Discovery for Business Information Systems, Kluwer a bimodal CASE tool for object-process based system
Academic Publishing, 2000, pp. 211–227. development, Proceedings of ICEIS (3) (2003) Angers,
[23] M. Boehnlein, A.U. Ende, Business process oriented France.
development of data warehouse structures, in: Proceedings [29] SAP, SAP BW Business Warehouse—Introduction,
of Data Warehouse 2000, Physica Verlag, Friedrichshafen, retrieved from: /http://www.thespot4sap.com/Articles/SAP_
Germany, 2000, pp. 3–21. BW_Introduction.aspS, 2006.
[24] O.K. Ferstl, E.J. Sinz, SOM modeling of business systems, in: [30] OMG, XML Metadata Interchange (XMI) Specification,
P. Bernus, K. Mertins, G. Schmidt (Eds.), Handbook on retreived from: /http://www.omg.org/technology/documents/
Architectures of Information Systems–International Handbook formal/xmi.htmS, 2005.
on Information Systems, Springer, Berlin, 1998, pp. 339–358. [31] M. Golfarelli, S. Rizzi, I. Cella, Beyond data warehousing:
[25] H. Kim, T.H. Lee, S. Lee, J. Chun, Automated data what’s next in business intelligence?, in: Proceedings of
warehousing for Rule-based CRM Systems, in: Proceedings DOLAP 2004, Washington DC, USA, 2004, pp. 1–6.
of Australasian Database Conference (ADC) 2003, Ade- [32] E.J. Sinz, Das Strukturierte Entity-Relationship-Modell
laide, Australia, 2003, pp. 67–73. (SER-Modell), Ang. Inform. 30 (5) (1988) 191–202.
Please cite this article as: D. Dori, et al., From conceptual models to schemata: An object-process-based data warehouse construction
method, Informat. Systems (2008), doi:10.1016/j.is.2008.02.002