Академический Документы
Профессиональный Документы
Культура Документы
A SimplifiedStatisticalTreatmentof GeochemicalData
by GraphicalRepresentation
CLAUDE LEPELTIER
Abstract
Inthecourse
ofamineral
explo•-ation
sponsored
bytheUnited
Nations
Development
Programmein two selectedzonesof Guatemala,a streamsedimentreconnaissance was
carriedout, and graphicalmethodsof interpretationwere attemptedin the searchfor a
simplifiedstatisticaltreatmentof about 25,000 geochemical results. The data were
groupedby drainageandlithological units,andthe frequency distributions
of the abun-
danceof Cu, Pb, Zn andMo werestudiedin the form of cumulative
frequencycurves.
The four elementsappearto be approximatelylognormallydistributed.Background,
coefficients
of deviationandthreshold
levelsweregraphically
estimated.Examplesare
givenof simpleandcomplexpopulations.Mineral associations
were studiedby correla-
tion diagrams.
Value Value
Arithmetic scale Logarithmic
scale
Cumulated
frequency
•.
Cumulated
frequency
• Q 99.99
,,
5o
5
2.5
0,5
O.X
Figure3. Confidence
limits(Pl, P2}=t 0.05probobllity
level- againstthe lower classlimits. Using the classcenter
will entailan error of excesson the centraltendency
Cumulative parameters(backgroundand threshold) but not on
Frequency
inO/o Number
of the dispersionparameter(coefficientof deviation).
This error, or difference,varies with the type of
samples classesused and is easily calculated (6% for the
0.05 logarithmicclassinterval,12% for the 0.1 log.
int. and 26% for the 0.2 log. int.). If the class
limit is used,curvesconstructed from differentlog.
int. classescan be directly comparedwithout cor-
rection.
Let us take a concreteexample: the distribution
of Zn in the quaternaryalluvial depositsof Block I
(Fig. 2). There are 989 resultsrangingfrom 10
to 230 ppm.
230
population:N-- 989 range:R- - 23
10
log.
int. log
n
R 1.36
14
= 0.097
4OO
A 0.1 log. interval will give 14 intervals,which is
5OO
acceptable. Usually, the histogram-frequency curve
step is skippedand the cumulativefrequencydia-
gram directlyconstructed.
2 000
In Figure 2, the points fit fairly well along a
5OOO straight line, suggestinga lognormaldistributionof
10 000
zinc in the alluvial deposits. Actually, the points
never fit the line exactly,but this doesnot matter
provided they stay in a channeldelimited by the
Source:
A. t ie•zou,
Initiatiou
pratique
i laslatistiwe,
confidencelimits usually taken at the 5% prob-
Gauthief
Villars,Parrs,1961. ability level. This confidenceinterval has been
drawn on Figure 2 by usinga graph (Fig. 3), which
frequenciesfrom the lowest values toward the high- avoids fastidiouscalculationand givesa fairly good
est (Fig. 1) (Hubaux, 1961; Termant and White, precision for the cumulativefrequencyvalues be-
1959). However, one has to considera property tween 5% and 95%. The width of the confidence
of the probabilityscaleused as ordinates:the values channel is inversely proportionalto the importance
zero and 100% are rejectedat the infinite; it does of the population considered: the bigger the popula-
tion, the narrower the confidence interval. To
not matter for zero becausezero% never occurs,
but in each case the last cumulatedfrequencyis check that a distribution fits a lognormal pattern,
100%, and this value is impossibleto plot, lost one shoulduse the Pearson'stest (Rodionov,1965;
for the curve. Then consideringthe lack of pre- Vistelius, 1960), but this longer operation is gen-
cision in the low values and the importanceof the erally not warranted in this type of interpretation
high ones for the determination of the threshold and, for practical purposes,the graphical control
level, I considerit much better to cumu, late the fre- describedabove is satisfactory.
quenciesfrom the hi#hestto the lowestvalues; thus,
Comparison with Histograms
the 100% will correspondto the lowest classand be
eliminated. For comparisonpurposesthe cumulativefrequency
As for the secondpoint, the curve being an in- curve for Cu in the Motagua drainage (Fig. 2)
tegral one, the ordinates must be plotted at class was also constructed, then, in Figure 4, the cor-
limits and not at class center; then, since one respondinghistogramsand frequencycurvesfor Cu
cumulatesthe frequenciesfrom the highest values and Zn. Figures 2 and 4 presentthe samedata in
to the lowest,cumulatedfrequenciesare to be plotted two different ways. Before enumeratingand com-
_ '•.
ß
.- I I tz.
- I
.......
,,
.....
19l 1.• 1,• 1,t l,• l,• ],1l 4.1 i• LII 9.l 11.1 14,8 18.f l•,4 19,• •1.l 1.81 1.9 14,1•,•
ordinate2.28% to 2.5%. This showsthe importance b. a mixture of two populationsin a given set of
of the deviation in the estimationof the threshold; data; and
two populationsmay havethe samebackgroundbut, c. an excessof low valuesin the consideredpopu-
nevertheless,different thresholds if their coefficients lation.
of deviationare different. In Figure 2, the threshold
is five timesthe background
for Cu and only 2.7 These three casesare representedgraphicallyin
times for Zn. Figures 5. They correspondto real distributions
In all the foregoing, I have consideredthe sim- encounetredin the Guatemalandrainagesurvey and
plest case: a singlelognormalpopulation,the dia- appear as solid lines with slopebreaks on the dia-
grammatic expressionof which is a straight line. gram. Some indications are given below showing
However, when constructingcumulativefrequency how to interpret suchlines.
curves, a broken line is frequentlyobtainedsug- Copper Distribution (in a lithologicalunit). The
gesting that the set of data consideredconsistsof a cumulativefrequencyline (Fig. 5) shows a break
complexpopulationor of different ones. Whenever to a flatter slope at the 30% level. This is the
possiblein practice,the interpretationis made on case when there is an excessof high values in the
sets of data selected so as not to include more than population;the histogramwill give a frequencycurve
two different distributions; for instance, a litho- skewed to the right, in the direction of the high
logicalunit may includetwo typesof mineralization values (positive skewhess). If the populationwas
showingup in soil or sedimentsamples;one repre- lognormallydistributed,the main branch Oat should
sentativeof the normal or backgroundcontent of extend as a straight line in Oz whereas,in this case,
the materialsampled, and the other,a superimposedOx is lifted to Oy whichmeansthat insteadof having
mineralization related to ore. 2.5% of the values30 ppmor greater,thereare 17%
of them. The abscissaof the breaking point, O,
Examples (in this case 18 ppm) indicatesthe limit above
The three main casesof non-homogeneous dis- whichthereis a departurefrom the norm (i.e., from
tribution that are the most likely to occur are, in the lognormaldistribution),an excessof high values.
decreasing frequencyorder: In this case,backgroundand coefficientsof deviation
a. an excess of high values in the considered are calculated with the main branch Oat. The
population; abscissa of the breakingpoint may be conveniently
taken as threshold value if the break occurs above Advantages of Cumulative Frequency Curves
the normalthresholdlevel of 2.5%. If, however, Plotting the distributionof an elementin a selected
the breakoccursbelow2.5% level (at point p for unit as cumulativefrequencycurve on probability
instance) the thresholdshould be taken as usual
graph paper is the easiestand most preciseway to
(abscissa of point P). Positivelybrokendistribu- presenta great amount of data (for instance,pre-
tion linesare the moreinteresting because they in- sentingFigure 5 as histogramsand frequencycurves
dicatean excessover the background mineralization. will result in an overloadedand illegiblediagram).
Molybdenum Distribution(in a lithological
unit). All the characteristicparametersof the distribution
The cumulative distribution line shows two breaks: can be estimated without cumbersome calculations.
first a positive,then a negativeone. Such a graph Comparisonbetweenvarious populationsare easy
is the expressionof a dual distribution,suggesting andcomplexdistributions are clearlyidentified.Fur-
the existenceof two distinctpopulationsin the set thermore,the adjustmentto a lognormaldistribution
of data considered.It givesa double-peaked histo- can be checkedgraphically.
gram. We shall considerhere only the most fre- Comparingthe geochemical featuresof the various
quentcaseof a main "background" populationmixed units of a surveyarea is importantin assessing their
with a smallerone of higher averagevalue, the two mineral potential. This is convenientlydone by
of them beinglognormallydistributed. On the dia- plotting the correspondingdistributionson the same
gram (Fig. 5), branchA corresponds to the main diagram for instance Cu distribution in three or
or normal population, branch B to the anomalous four different drainagesin the case of a stream
population and the central branch A q- B to a mix- sedimentreconnaissance.Distribution heterogenei-
ture of the two. By splitting the data at a value ties will be spotted and the correspondingunits
taken aroundthe middleof A q- B (at 4 ppm for selectedfor further investigations. On a broader
instance),it is possibleto separatethe total popula- scale, the geochemicalbehavior of trace elementsin
tion into two elementaryones appearingas a and a given geologicalenvironmentfrom different coun-
b on the diagram. The generalbackgroundwill be tries or metallogenicprovincescan be readily com-
taken with branch A and the threshold as the abscissa
pared. This is an approachto a betterunderstanding
of the middleof branchA q- B, thoughthe threshold of the distributionlaws of trace elementsin naturally
of populationa may alsobe considered, but we have occurringmaterials.
not enoughexamplesof suchcomplexdistributions
to make definite recommendations,and we lacked The Coefficients of Deviation
computingfacilities to calculatetheoreticaldistribu- A lognormaldistributionis completelydetermined
tions. The coefficients of deviation must be cal-
by two parameters:the geometricmean (b) and
culatedseparatelyfor distributionsa and b. the coefficientof deviation (s). It has been seen
Zinc Distribution (in a drainage unit). The that the absolutedeviation can be expressedas a
negativelybrokenline on Figure 5 is the expression geometricfactor s' or, more commonly,as a logarith-
of an excessof low valuesin an essentially
lognormal mic coefficients. The term "deviation"is preferred
distribution;in this case, the histogramis skewed to "dispersion"which might be more expressive,
to the left, toward theselow values (negativeskew- becausethere is no geneticimplicationin the concept
ness). Provided their proportionis not too high of statisticaldispersionwhereasthere is one in the
(20% or less or instance),they do not interfere notion of geochemicaldispersion; however, many
in the interpretation,which is done on the main peopleuse the term "dispersion"in statisticalinter-
branchof the distributionline in the usual way. pretationof geochemical data.
This excessof low valuesmay be due to the inclu- The coefficientof deviationis a dispersionindex
sionin the populationof a low-background lithologi-
specificfor the distribution of a given element in a
given environment and expresses the degree of
cal unit or, more often, to poor sampling(for in-
homogeneityof this distribution. When rocks are
stance,collectingan important set of sedimentsam-
plesthat are too coarse). considered, a similarityin the coefficientof deviation,
together with similar average values, may indicate
When the resultsdo not fit a lognormaldistribu-
similar geochemicalprocessesin their formation.
tion, an explanationmay generallybe found among
It is possiblethat a given value of s corresponds
these three factors: (1) lack of homogeneityin
to each type of mineralizationin a lithologicalunit.
sampling,(2) complexgeology(imprecisionin the Confirmingthis assumptionwould require very ex-
lithologicalboundaries),and (3) analyticalerrors.
tensivegeological-statisticalstudiesencompassing all
It shouldalsobe kept in mind that someelements
metallogeniccases.
in somesurroundings
may not be lognormallydis- There is also a relationshipbetween the back-
tributed. ground (b) and the coefficientof deviations(s)
All lilholo•icllunits
,=o.
lsJ'1
2o.2
Figure 7. CorrelationdiagramCu/Zn
cu(pp,.) •
III
III • \ ß
Ill / ' x ß
., Ill
Ill , / X , Ill
Ill I " • Ill
Cu :
III ,
Ill
III
,,i
ß.
....
'.A......• •
/'
- Il
•
:-_ .
•
,,,
I II
tll•
Ill
III Jill
III ...............
', III I II
III ; •11
Ill , III
III ill
N1= nI + n3 = 168
Practically, this presentationof the data is very (1) eitherp is equalor near to zero: the elliptical
convenientbecauseit gives a geometric image of cloud has its axes parallel to the coordinate axes
the distribution laws. The axes passing by the and the two variablesare independent,
gravity center (b•, b)•), that is to say by the point (2) or p is clearly different from zero and the
whosecoordinatesare the backgroundvaluesfor the cloud is an ellipsewhoseaxes are inclined relative
two consideredelements,are then drawn. In Figure to the coordinates. The slopeof the main axis has
7, the axes will passthrough the point (bc, = 5.3 the samesign as p (if p > 0 the two elementsvary
ppm, bz, = 75 ppm). The points falling in each in the same direction; if p < 0 the two elements
quadrantare summedup and countedas follows: vary inversely).
N• = numberof pointsin first and third quadrants The correlation cloud is in fact a two dimensional
N•. = numberof pointsin secondand fourth quad- histogram; it is the bestand simplestway to estab-
rants. lish whethera populationis homogeneous or hetero-
geneous:in the first case,the points tend to group
Then Ois givenby the formula: in a singleellipticalcloud; in the second,they split
into 2 or several attraction centers and form several
o= sin[•'N•+Ns
•rN•--N•1 elliptical clouds more or less overlapping. G.
Matheron pointsout that the relation expressedby
p is an expressionof the MassAction Law if p = --+1
Practically,p is never equal to --1 (which would (or of the orderof +0.95) (Matheron,1962); then
be the caseif all the pointswere on a straightline) it is likely that a geologicallybasedchemicalequi-
and the points form an ellipticalcloud. Two cases librium exists between the two elements considered.
may happen: In geochemical prospecting,correlationcoefficients
Figure8. CorreletiondiegramPb/Zn
Pb (i)Pm)
1000•
6
© - I I IIII I II
............... III
'f' i ii
• iI[ I•111.
ilii-: / /
/, ,,,
Iii
I I
,,,
Ill
III
* IXI [ll../ / I ii ii
• /. ; / ' /I
, / • ,,,,,. 111
/
/
- '"'"
I IIIII
Illl•
III "III
III