Вы находитесь на странице: 1из 4

Statistics

ASSIGNMENT NO. 2

Sampling and Sampling distribution.


Problem 1. The time spent by 40 students, randomly selected, studying for a Statistics test (hours) was
recorded, and the processing results are shown below:

Time (hours)
Mean … a. How would you describe the central tendency of the
Standard Error … data?
b. Is the sample average time representative? Why?
Median 4,55
c. Analyze the distribution shape, using appropriate
Mode 3,6 statistical indicators.
Standard Deviation …. d. Estimate – using a 95% confidence interval – the average
Sample Variance 1,90 time spent on study by a student in the total population
Kurtosis -0,86 (z=1,96).
e. How many students should we include in the sample, in
Skewness 0,22
order to obtain a maximum error 15% lower than the
Range … previous one?
Minimum 2,4
Maximum 7,3
Sum 185,4
Count …
Confidence Level (95%)

Problem 2. A veterinarian recorded the number of pets owned by 80 families, randomly selected
from a total population. The summarized data are presented in the following table:

Number of pets 0 1 2 3 4 5
% of families 2,5 13,75 27,5 26,25 18,75 11,25

a. Identify the absolute frequency distribution and represent it in a graph. Comment on the graph.
b. What is the weighted mean number of pets owned by a family in the sample? Is it
representative? Why?
c. Find the median and the modal number of pets and interpret the results obtained.
d. Analyze the distribution shape, using an appropriate statistical indicator.
e. Find and interpret the quartiles of the data set.
f. Estimate the average number of pets owned by a family in the total population, using a 90%
confidence interval (z=1,65).
g. How many families should be included in the sample, in order to obtain a length of the
confidence interval of 0,2?
h. Find a confidence interval for the proportion of families in the total population which own at
most 2 pets.
i. Based on the results previously obtained, fill in the Descriptive Statistics table below:

Number of pets
Mean …
Standard Error …
Median …
Mode …
Standard Deviation …
Sample Variance …
Kurtosis -0,70
Skewness 0,04
Range …
Minimum …
Maximum …
Sum …
Count …
Confidence Level(90,0%) …

Problem 3. For a sample of 70 IT enterprises the following data are given:

Number of employees (pers.) Less than 10 10-20 20-30 30-40 40-50 More than 50
Number of enterprises 5 15 25 15 8 2

a) Analyze the shape of the enterprises distribution by the number of employees, using a graph;
b) Describe the central tendency of the distribution, using appropriate indicators.
c) Is the data-set homogeneous? Explain.
d) Analyze the distribution shape, using an appropriate statistical indicator.
e) Estimate the average number of employees of an enterprise in the total population, using
a 95% confidence interval (z=2).
f) Find the new sample size, in order to obtain a marginal error 10% lower than the
previous one.
g) Estimate the % of enterprises in the total population which have more than 30
employees.
h) Based on the results previously obtained, fill in the Descriptive Statistics table below:

Number of employees (persons)


Mean …
Standard Error …
Median …
Mode …
Standard Deviation …
Sample Variance …
Kurtosis 0,23
Skewness 0,17
Range …
Minimum …
Maximum …
Sum …
Count …
Confidence Level(95%) …

Problem 4. A representative of the City Hall made a survey of public opinion on the existing parking spaces.
200 citizens were randomly sampled and it was found out that 134 of them agreed that parking is indeed a
problem. Provide a 90% confidence interval for the proportion of citizens in the entire population who think
parking is a problem. (z=1,65).
Regression and Correlation Method
Problem 5. A researcher from the Ministry of Culture has recorded - for 10 museums – the number of
advertising contracts signed and the number of visitors in 2015 (thousand persons):

Number of advertising
7 5 9 8 10 2 6 7 9 10
contracts signed
Number of visitors
42 32 50 40 61 8 35 34 54 65
(thousand persons)

a. Name the two variables and identify the independent variable and the dependent variable.
b. Graph the data and interpret the graph. (you may use Excel scatter plot)
c. Identify the linear regression equation in the sample, and interpret the values of b0 and b1 (use the Excel
functions: Intercept and Slope).
d. Predict the number of visitors if the museum has 3 advertising contracts signed.
e. Analyze the direction and the strength of the relationship between the two variables using appropriate
indicators (covariance, Pearson-s linear correlation coefficient). Use the Excel functions: Covariance
and Correl and the menu options: Data-Data Analysis-Correlation in order to find the Correlation
Matrix).

Problem 6. A sales manager is interested in determining the relationship between the amount spent on
advertising (thousand $) and total sales (thousand $). The manager collects data for the past 17 months and
runs a regression of sales on advertising expenditures. The results are presented below:

How would you interpret the values of the


model coefficients?
c. Predict the sales value if next month the company
spends 15 thousand dollars on advertising. Under
what conditions the projected value will comply
with reality?
d. Compute and interpret the linear correlation
coefficient, considering that:

Cov(x,y) =56,65
a. Analyze the relationship between the two
variables, using the chart displayed.
b. What is the most adequate linear regression Fill in the Correlation matrix:
equation: Amount spent Total sales
i) on advertising
ii) Amount spent …… …..
iii) on advertising
iv) Total sales …… …..

Time series

Problem 7. An airplane company recorded the airplane ticket prices to a specific destination, from
2011 to 2015 ($):

Year 2011 2012 2013 2014 2015


Airplane ticket price ($) 311 320 348 335 384
It is required to:
a. Graph the data and identify the type of the time-series (flow or stock);
b. Determine the absolute indicators and interpret the results for 2014.
c. Determine the relative indicators and interpret the results for 2014.
d. Determine the average indicators and interpret the results;
e. Identify the trend component of the time series, using:
i. The “average absolute change” method;
ii. The “average time index” method;
iii. The “linear trend” method.
f. Predict the airplane ticket price for the next 2 years, using the three above mentioned
methods.

Problem 8. A travel agency examines the evolution of tourist packages sold between 2010 and 2015:
Year 2010 2011 2012 2013 2014 2015

Chain-base absolute changes of the number of tourist 0 12 -4 12 3 5


packages (hundreds packages)

a. Knowing that in 2015 there were 68 hundreds tourist packages sold, identify the elements of the time
series and represent them in a graph.
b. Identify the type of the time series; characterize the series using relative indicators. Interpret the values
for 2012.
c. Fill in the following statement: „The number of tourist packages increased, on average, by ......... hundred
packages per year, or by ...... % on average per year.
d. Identify the trend component, using the linear trend method and predict the number of tourist packages
sales for 2017 and 2018.

2. Proiectul se realizează în:


 document EXCEL
 document word - care va fi predat listat
3. Ambele documente vor alcătui un folder cu denumirea temei.
4. eful/ efa grupei va prelua folderele i le va transfera pe un DVD care vaȘ ș ș
fi predat odată cu proiectele, în ziua testulu

Вам также может понравиться