Академический Документы
Профессиональный Документы
Культура Документы
Spatial Distributions
2. Inferential statistics
Concerned with making inferences from samples about a
populations
1. mean (average)
2. median (middle value)
--50% larger and 50% smaller
--rank order data and select middle number
3. mode (most frequently occurring)
These may be obtained in ArcGIS by:
--opening a table, right clicking on column heading, and selecting Statistics
--going to ArcToolbox>Analysis>Statistics>Summary Statistics
ADMIN_NAME Illiteracy-Prcnt Rank order
Beijing
Liaoning
Tianjin
3.11
3.48
3.52
1
2
3
Calculation of
Taiwan 3.9 4
Shanghai
Guangdong
Heilongjiang
3.97
4.02
4.16
5
6
7
mean and median
Shanxi 4.42 8
Jilin 4.44 9
Xinjiang 4.64 10
Hebei
Guangxi
4.83
5.61
11
12
Mean
Hunan 5.87 13
Jiangxi
Hong Kong
6.49
6.5
14
15
296.15 / 34 = 8.71
Henan 7.36 16
Hubei 7.69 17
Chongqing 7.8 18
Shandong 7.96 19
Jiangsu
Nei Mongol
8.05
8.14
20
21
Median
Shaanxi 8.19 22
Hainan
Macao
8.65
8.7
23
24
(7.69 + 7.8)/2 = 7.75
Zhejiang
Ningxia
Sichuan
9.36
10.09
10.24
25
26
27
(there are 2 “middle values”)
Fujian 10.38 28
Yunnan 13.29 29
Anhui 14.49 30
Guizhou 14.58 31
Qinghai 16.68 32
Gansu 17.77 33
Xizang 37.77 34 Note: data for Taiwan is included
Sum 296.15 7
n
2 n 2
( Xi - X ) X i - [( X ) 2 / N ]
i =1 = i =1
N
N
Definition Formula Computation Formula
10000
0 In ArcGIS, you may obtain frequency counts
under 15 to 30 to 45 to 60 to 75 and
15 29 44 59 74 older on a categorical variable via:
years years years years years
--ArcToolbox>Analysis>Statistics>Frequency
Frequency Distributions for China Province Data
Symetric Distribution
Height of bar shows frequency
There are 16 provinces with
percent urban between 38.4% and
50.8% (mode)
Mode = (38.1+50.8)/2 =44.5
Mean = 48.97
Median = 44.0
Symetric distribution:
mean = median = mode
n
wixi
X = i =1
wi =population of each
n
i =1
wi province
• Very common error in GIS because we use
aggregated data frequently
13
20
21
22
i=1 wixi
n
n
X = Y= i =1
wiyi
i=1 wi
n
n
i =1
wi
23
2 4 7
3 7 7
4 7 3 n n
2,3
7,3 5 6 2
Xi Y i
sum 26 22
X= i =1
,Y = i =1
6,2 n n
Centroid/MC 5.2 4.4
0
0 5 10
10
7,3
2
3
4
7
7
7
500
400
2,000
2,800
3,500
2,800
wX i wY i i i
X= i =1
,Y = i =1
w w
2,3 4 7 3 100 700 300
i i
5 6 2 300 1,800 600
6,2
sum 26 22 4,300 13,300 16,200
0
w MC 3.09 3.77
0 5 10
24
Median Center:
Intersection of a north/south and an
east/west line drawn so half of
population lives above and half
below the e/w line, and half lives to
the left and half to the right of the n/s
line
Mean Center:
Balancing point of a weightless map,
if equal weights placed on it at the
residence of every person on census
day.
i =1 i=1
n n
( Xi - Xc ) 2
(Yi - Yc ) 2
i=1 wi( Xi - Xc)2 i=1 wi(Yi - Yc)2
n n
i=1 wi
n
N
which by Pythagoras
i =1
n 2
d iC
reduces to:
N
---essentially the average distance of points from the center
Provides a single unit measure of the spread or dispersion of a
distribution.
We can also calculate a weighted standard distance analogous to the 27
weighted mean center. Briggs Henan University 2010
Standard Distance Deviation Example
10
Circle with radii=SDD=2.9
4,7
7,7
5
7,3
2,3
i X Y (X - Xc)2 (Y - Yc)2
6,2
1 2 3 10.2 2.0
0
2 4 7 1.4 6.8
3 7 7 3.2 6.8
0 5 10
4 7 3 3.2 2.0
i X Y (X - Xc)2 (Y - Yc)2
5 6 2 0.6 5.8
1 2 3 10.2 2.0
sum 26 22 18.8 23.2 2 4 7 1.4 6.8
Centroid 5.2 4.4 3 7 7 3.2 6.8
sum 42.00 4 7 3 3.2 2.0
divide N 8.40 5 6 2 0.6 5.8
sq rt 2.90
sum 26 22 18.8 23.2
Centroid 5.2 4.4
sum of sums 42
divide N 8.4
sq rt 2.90
( Xi - Xc ) 2 i =1 (Yi - Yc ) 2
n n
sdd = i =1
N 28
There appears to be no
major difference
between the location of
the software and the
telecommunications
industry in North
Texas.
31
Kernel=10,000 Kernel=5,000
33
high high
36
37