Академический Документы
Профессиональный Документы
Культура Документы
Exchange. This raw data represents the traded shares (X variable) and its turnover (Y variable)
in February and March of the present year.
1
I. For each of the two variables:
a) Calculate and interpret the average, standard deviation and the coefficient of variation
for row data. Interpret the results. Is the data series homogenous?
AVERAGE:
n
x shares (1)
x i 1
9, 611, 913.50
n
STANDARD DEVIATION:
2
xi
n
x0
shares (2)
s i 1
5,323,400.96
n 1
COEFFICIENT OF VARIATION:
s 5,323,400.96
cv 100 100 55.38% (3)
x 9,611,913.50
The coefficient of variation is higher than 35% so we concluded that the data is homogenous,
meaning that the average is representative.
2. Turnover (Y)
AVERAGE:
n
x RON (4)
x i 1
3,529,767.89
n
STANDARD DEVIATION:
2
n
xi x0 RON (5)
s i 1
2,257,093.08
n 1
2
COEFFICIENT OF VARIATION:
s 2,257,093.08
cv 100 100 63.94% (6)
x 3,529,767.89
The coefficient of variation is higher than 35% so we concluded that the data is
homogenous, meaning that the average is representative.
We have chosen the size of the intervals of 5,000,000 traded shares. The classes begin
at 1,500,000 and end at 31,500,000.
2. Turnover (Y)
We have chosen the size of the intervals of 2,050,000 RON. The classes begin at
700,000 and end at 13,000,000.
3
c) Calculate and interpret for the frequency distribution the average, standard deviation
and coefficient of variance. Compare with the results from point a). Explain the
differences.
2. Turnover (Y)
In both cases we can observe that the values obtained now are lower than the ones at
point a). The explanation consists in the fact that here we have considered the middle point of
the interval as data.
4
d) Construct a histogram and describe the shape of the distribution based on the
histogram.
The shape of the histogram reveals us that the distribution is not equal. It presents a
maximum of 18 values between 6,500,000 and 11,500,000 traded shares. Between 21,500,000
and 26,500,000 there were no traded shares.
2. Turnover (Y)
The shape of the histogram reveals us that the distribution is not equal. It presents a
maximum of 19 values between 700,000 RON and 2,750,000 RON. Between 8,900,000 and
10,950,000 there is no registered turnover.
5
e) In which interval is expected that about 95% of the data will fall? Is this assumption
true for this data?
The empirical rule tells us how the data is spread based on the standard deviation, as
represented in the graphic above.
Based on the empirical rule 95% of the data will fall in the following interval:
x 2s , x 2s
1. Traded Shares (X)
For the X variable we have the interval below:
-1, 034, 888.414, 20, 258,715.41
The traded shares in the analyzed period of time are lower than 20,258,715.41 with a
probability of more than 95,45%.
2. Turnover (Y)
For the Y variable we have the interval below:
-984, 418.2629, 8,043, 954.049
The turnover in the analyzed period of time are lower than 8,043,954.049 with a
probability of more than 95,45%.
II. Using the “Pivot Table Wizard” in EXCEL, build a pivot table on your spreadsheet
(using also the second variable).
6
02.02.2009 5,403,514.00 1,879,119.48 Low Activity
03.02.2009 3,245,718.00 1,222,817.83 Low Activity
04.02.2009 7,500,332.00 2,579,067.62 Low Activity
05.02.2009 5,737,005.00 2,444,467.01 Low Activity
06.02.2009 5,049,353.00 1,730,092.10 Low Activity
09.02.2009 2,993,196.00 1,101,750.64 Low Activity
10.02.2009 1,808,274.00 705,179.22 Low Activity
11.02.2009 6,186,518.00 2,355,701.65 Low Activity
12.02.2009 13,946,398.00 6,209,479.06 Moderate Activity
13.02.2009 12,423,715.00 4,923,604.66 Moderate Activity
16.02.2009 8,907,782.00 3,206,025.83 Low Activity
17.02.2009 10,773,795.00 3,542,856.33 Moderate Activity
18.02.2009 11,417,417.00 3,347,756.87 Moderate Activity
19.02.2009 7,833,768.00 2,557,032.70 Low Activity
20.02.2009 10,467,840.00 3,462,192.83 Moderate Activity
23.02.2009 4,006,705.00 1,236,808.67 Low Activity
24.02.2009 8,779,805.00 2,470,934.48 Low Activity
25.02.2009 11,856,792.00 3,570,525.56 Moderate Activity
26.02.2009 6,777,797.00 1,945,496.40 Low Activity
27.02.2009 7,328,769.00 2,177,383.64 Low Activity
02.03.2009 2,703,902.00 778,406.38 Low Activity
03.03.2009 4,898,424.00 1,503,212.69 Low Activity
04.03.2009 6,230,026.00 2,103,037.85 Low Activity
05.03.2009 8,815,584.00 3,086,765.53 Low Activity
06.03.2009 7,134,930.00 2,205,016.17 Low Activity
09.03.2009 10,504,816.00 3,328,595.49 Moderate Activity
10.03.2009 11,674,273.00 3,765,877.67 Moderate Activity
11.03.2009 17,707,743.00 5,884,537.67 Moderate Activity
12.03.2009 7,216,209.00 2,624,394.96 Low Activity
13.03.2009 15,072,826.00 4,988,690.30 Moderate Activity
16.03.2009 9,970,215.00 3,620,283.31 Low Activity
17.03.2009 14,376,964.00 5,274,961.37 Moderate Activity
18.03.2009 9,386,785.00 3,427,668.08 Low Activity
19.03.2009 17,438,934.00 6,756,517.98 Moderate Activity
20.03.2009 5,196,603.00 2,177,679.34 Low Activity
23.03.2009 14,941,163.00 6,715,704.59 Moderate Activity
24.03.2009 31,224,878.00 12,851,219.79 High Activity
25.03.2009 10,001,218.00 4,310,862.02 Moderate Activity
26.03.2009 16,064,354.00 7,212,439.67 Moderate Activity
27.03.2009 14,206,280.00 6,410,988.44 Moderate Activity
30.03.2009 8,850,338.00 3,526,870.13 Low Activity
31.03.2009 7,639,409.00 3,028,229.49 Low Activity
Table 4 Categorization of the traded shares based on the market activity
The Market Activity column of the above table was built using the vlook-up function in
Microsoft Excel. This function was applied on Table 5. We have chosen three forms of market
activity which categorize the volume of traded shares.
7
LL Market Activity Interval
1,000,000 Low Activity 1,000,000-10,000,000
10,000,000 Moderate Activity 10,000,000-20,000,000
20,000,000 High Activity more than 20,000,000
Table 5 Table for building the vlook-up function
In order to characterize the market we have built a pivot table showing the average
transaction unit price and the market activity structure, based on the number of traded shares.
The average transaction value is automatically calculated by the pivot table, by dividing the
turnover to the number of traded shares.
III. Calculate the regression line, the coefficient of determination and coefficient of
correlation. Interpret the results.
14,000,000.00
12,000,000.00
10,000,000.00
Turnover
8,000,000.00
6,000,000.00
4,000,000.00
2,000,000.00
0.00
0.00 5,000,0 10,000, 15,000, 20,000, 25,000, 30,000, 35,000,
00.00 000.00 000.00 000.00 000.00 000.00 000.00
Traded shares
Figure 4 Scatter Diagram of the two variables
Figure 4 shows the relationship between the traded shares and the turnover. It can be
observed that the points are very close to or directly on the trend line. They are not very
scattered on the plot area, fact which reveals the linear relations between the two variables.
8
where b0 is the point where the line will intersect the y-axis and b1 is the slope of the line. The
values where calculated with the intercept and slope functions in Microsoft Excel.
Coefficient of determination:
R 2 0.973495291
The fact that the coefficient of determination is very close to its maximum value of 1,
means that there is a close relationship between the values.
Coefficient of correlation:
r R 2 = 0.947693081
The coefficient of correlation is also close to the maximum value, which means that there is a
close relationship between the selected variables.
P-values:
The P-values are less than α, 0.0000< 0.013, that means that the coefficients are
statistically coherent and we have a linear relationship between the 2 variables, X and Y.
9
References:
10