Академический Документы
Профессиональный Документы
Культура Документы
CV. DEUTSCH
Department of Petroieum Engineering
Stanford University, Stanford, CA 95305-2220
1. Introduction
The basic paradigm of geostatistics is to model the uncertainty about any
unknown value e as a random variable [RV] Z characterised by a speci c
probability distribution. The unknown value z could be a global parameter
such as the economic ultimate recovery of an oil eld or a local attribute
115
Benji and NA. Schojieid {eds}, Georronlnics Woiiongong '96, Voiione I, 115-125.
-1
Kinwer Academic Pubiishers. Printed in the N:-rirerionds.
@i"1 - 1" 9'97
1 16
Ch. DEUTSCH
ne-tuning the
For clarity and in the context of evaluating the goodness of a probabilistic model, we propose speci c de nitions of accuracy and precision.
For a probability distribution, accuracy and precision are based on the actual fraction of true values falling within symmetric probability intervals of
varying width p:
11'?
[1]
where n[u,-] is the set of n data minus the data at location u,-. These local
ccdf models may be [1] derived from a set of L realizations, [2] calculated
directly from indicator-kriging, or [3] de ned by a Gaussian mean, variance,
and transformation.
The probabilities associated to the true values .z[u,-],i = 1,... ,n are
calculated from the previous ccdf as:
F[u";; a[u,]|n[u,]], -i = I, . . .,n
For example, if the true value at location u.,- is at the median of the simulated
values then .F'[u,-; :[u,]|n[u;]] would be 0.5.
Consider a range of syrmnetric p-probability intervals [Pls], say, the
centiles 0.01 to 0.99 in increments of 0.01. The symmetric p-PI is de ned
by corresponding lower and upper probability values:
__ l1Pl
plow "
3-I1
C,
_ u+p1
Pupj!;r T
H __ ]__{ 1: ii'F(11i~=(111'}|'-'1(11="l]E(aia..,P..,a]
ullp _
(.2)
otherwise
Z:G=
;3
ll
.2Ilsa)
..M=
<3)
is the proportion of locations where the true value falls within the symmetric p-Pl.
According to our earlier de nition of accuracy, the simulation algorithm
used to generate the ccdfs [1] is accurate when m E; p, lip. A graphical
-_
l 13
C.'*v'. DEUTSCH
41.?-4-1.
way to check this assessment of accuracy is to cross plot E [p] versus p and
see that all of the points fall above or on the 45" line. This plot is referred
to as an occurncy plot, see Figure 1 and example hereafter.
An equivalent way to calculate m is to de ne the indicator
u; p] =
for z 0
F[i,-,s|[n[i]]] ={ z,
for 0 < z <1
.
for s __
10,
>1
The probability of each true value s,;,i = 1,. . . , n is that value itself, i.e.,
F[i;s;|{n[i]]]=.e,-, i=1,...,n
[4]
The average indicator function [p] was calculated for each centile p =
I-,%,j = 1, . . . ,90 and the resulting accuracy plot is shown o11 the top of
Figure 1. Note that the plot is very close to a 45" line indicating that this
ccdf model is both accurate and precise.
The middle row graphs of Figure 1 illustrate the accuracy plot if the
ccdfs were expected to be uniform between -0.5 and 1.5. The simulated
distributions are still accurate, because the probability intervals are sufficiently wide, but not precise. The 0.5 PI of this ccdf model contains all of
the true values, therefore, -f[0.5] = 1.0.
The lower graphs on Figure 1 show the results when the ccdfs are mod-
-1-.
'1
1 151
1.0_
0.0..
0.5..
tivi I
0.-ii
;1
0 1
i||ii||i|||iL||||i|
0.0
0-2
0.
0.5
0.0
1.0
probability interval i- p
1.1:'
_
.-
0.3
55*.
0.E__
an
LLu.Ii1|JI1
1-H
vi,-ii
vs -1
.-"
yr
0.2
ii _n
L.-_I
i.
i,
I
I'\J
'5,"
0'
0.0
0.2
0.4
0.0
0-0
1.0
probability interval - p
1-0
0.0
__p-E
tivi
.- - - - - . . ..
0.4
as
an
n Cl
-1
, H ,
n
,,,,l
GT
h Q
F-C
mi
PI
m _
I
0
I
I
I
I
*1a
probability interval - p
Figure 1. Ai:cura.cy plots for n = 1000 inrlependeiit iuiiforrnly rlistriliiiteil in [11, 1]
ralidom mnuhers a,-,i = 1.. . . ,n. The plots curresponrl tn [1] a correct uni orni
distribution between 0 and 1, [2] a uniform ilistrihution between -0.5 and 1.5. and
'1
1-.
12:]
c.v. DEUTSCH
[0, 1]:
as-{ 5;
-I1l'I'liI-I
is
A = [1 voids
<6)
where A = 1.0 for maximum accuracy [air] Z p lip E [0,1]] and A = 0.0
in the worst case, i.e., no true values are contained in any of the probability
P= 1~2ju1 (a)[@~al==ia
1?)
gi-ii.
As seen on the top of Figure 1, there are some instances where {;'[p] '-1.
p by some small amount. The previous de nitions oficcuracy [5] and
precision [T] fail to account for the inaccurate cases
-=1. p. We de ne
G = 1 [fslallia-alda+?[1 ~sia)llP-Z010]-'ia[
{8}
= 1~[J1[3s(a)~?l [@-alda
wbere G = 1 for maximum goodness
= p, 'v'p E [0,1]] and G = 0 in
the worst game, i.e., no true values are contained in any of the probability
intervals
= 0, VP E [0,
Table 1 gives the accuracy, precision, and goodness statistics for the
results on Figure 1. Note that the triangular distribution is not as good as
the too-wide uniform distribution because of the more severe penalty for
H
F?
ll-.I
121
Accuracy A
I"re-cisiou P
Gooilness G
u1.LifIJ1'I.L'i E [0, 1]
0.40-*1
0.001
0.005
uniform E [0.5,1.5]
1.000
0.000
0.'i'00
0.000
0.000
0.640
I"
"'1
112
CH. DEUTSCH
Gnu D1
0.1125
_. J. I.-
[_'||_
II
It
__
-3
_.,_|
ms
__
'
1
|
_L F
|
Frequency DP E
.~';
an-us
II
|__
I;
1|:
-l_
"
1,'
@..'I.' "'i
H "i'-.|ir__
0.0
l_r|__JI|':|
u.2
n.4
as
n.a
quanti e
1.0
Cni ?
U115
"'1
_ Fl '
EH34
"LI
.."-*
"L-Fl
"-r'
-I
P8
.1".
.-an
':--
Frequency EH12
.'!
'-'
_.|__
I101
DIE}
EI.l.']
D2
:l
U.-1
11-:-|
'15
{LB
1 .0
quanti a
a-w -I
:1:-.10.}-I
'.'|
' |
-1-!.
EI.lJB_'|
Frequency
n.|:=ej
1:
l.'J.U-4..
*
|j_[}2 _ |I.|I
ml0.0 ||||m|l||||||||||.|||l|||||||||ln||||||||||
0.2
0.4
as
ma
1.u
quantila
Figum 2.
Figure 1.
-1
1|
123
If we systematically tool-: the global histogram as the model of uncertainty at each location we would nd that this conservative probabilistic
model is both accurate and precise. The uncertainty, however, is large. Uncertainty increases as the spread of the probability distribution increases
U1N
2 -}
_Efg'5"luI
where there are N locations of interest u,-,i = 1,...,N with each variance
ima-
124
C."v". DEUTSCH
|c4cc|su|ny1c1-unisex
B
|II'lIlf it
1--'.U"-I.-I
.53,-|.
oeb
CI
I.
...':,.
col-coo.
E ', _.
.illii-2:
|.'..:
' B 'e
I.
'
I c
-_
.,,__*
= -
' i
..
El
C:
If
I- E
FiI1IT 3.
'
Li
'
ill
tit
'|
,.
'
ii in
it
II
IIBIIIIIII.Ii
FI
ls!
iiit
"-
'
ll
ill
TL!
v-nuns-;:.um,||urus|ny.ai
LII
_ _
_ _ _ .
_ .
. . - - _ _.-
ll
In
ml
'
'
u
llll
'
Ill-1|
11.:
In
I1
T "
F1
III}.
'
'
IND.
Ii.
anus:
-Ii.
I.
FIN.
Q I '|.Dl]'iil
In
U.-U
F =: U15?
G = 'l].BTH
|-F-I'I-f
Ill-I
-|-
Ill
-IiI
|-
I-LI
1-
-Ii
H-.I
I-
-i
-Ii
LEI
Dmhabll y interval - p
Figure 4'. Horisontal normal scores seluivariogrutn from the 3-D porosity data
[calculated using stratigraphic coordinates} and the corresponding accuracy plot
considering cross validation with the normal scores of the T4 well data on Figure 3.
The page limitation prevents full presentation of the case study. Un
the basis of work not shown, considering seismic data as soft information
for simulation of porosity with a carefully t linear model of coregionalisation allows further reduction of uncertainty while maintaining accurate
I25
1.0-
13%
0.1:]
.'
0.0-
'*
tin) _
04;
,1
'4'-I
1
'
_
_
*3
'
A = aces
P = U15?
e = 0.952
-4-
1-
lmu.
N.
Dianne:
Ii.
MO.
GI
DJ
Df
I.-I
ll
Ill
Ll
ibilily iI"li'IIl"li'El - II
probabilistic model.
The main uses for the diagnostic tools presented here are: [1] detecting
implementation errors, (2) quantifying uncertainty, (3) comparing different
simulation algorithms (e.g., Gaussian-based algorithms versus indicatorbased algorithms versus simulated annealing-based algorithms), and (4)
ne-tuning the parameters of any particular probabilistic model (e.g., the
variogram model used).
These tools provide basic checks, i.e., necessary but not sufficient tests
that any reasonable stochastic simulation algorithm should pass. They do
not assess the multivariate properties of the simulation. Care is needed
to ensure that features that impact the ultimate prediction and decision
making, such as continuity of extreme values, are adequately represented
in the model.
References
Dcutscli. C. iii. Joiirncl. A. (1992). GSLIB: Ger.r.rlriti.vtir:ril Sfl iilflfrf Lil-r'riry rind .1".w:r".'i
Gnirie, Uxfiiril University Press, New York.
lsaaks, E. [199UlI. The Ajijilicritiori of Mriritr: C'rii"lo Mrithrirls to Hie r-lririiysis of Sgiiifiriil-y
C'or'rr.'l0ir:ri Brita. PhD thesis, Stanford University, Stiinforil. CA.