Академический Документы
Профессиональный Документы
Культура Документы
MULTIVARIATE ANALYSIS OF
RELEVANCE OF CONTAGIOUS
DISEASES IN HOSPITALS
(CASE STUDY OF THREE INFECTIOUS
DISEASES)
1
ABSTRACT
This paper concentrated on the prevalence of three contagious diseases between the years 2003 2012. It was observed that the rate
of infections in Osun State and Oyo State is alarming rampant, which implies that there is variation between the various diseases.
Discriminant analysis and Multivariate Analysis of Variance (MANOVA) were also employed to see if there is any variation in the
rate of infections over the years between the hospital and among the diseases. The results showed that the diseases are on the
alarming rate in Osun state and Oyo State. However it is good to note that the data collected is from two teaching Hospital one in
Osun state the other in Oyo state. But the truth is that the diseases rate of infections in any country is directly connected to the level
of infections of the people in the country.
Keywords: Contagious Diseases, Discriminant analysis, Multivariate Analysis of Variance (MANOVA)
1.INTRODUCTION
The occurrence of outbreaks of cholera in Africa in 1970 and in Latin America in 1991, mostly in seaboard groups, and the
look of the new serotype Vibrio cholera are 139 in India and subsequently in Bangladesh have stimulated efforts to realize
ecological components leveraging the development and geographic circulation of outbreak Vibrio cholerae serotypes.
Because of the severity of latest outbreaks, cholera is now being advised by some contagious infection examiners as a
reemerging infection, prompting new work on the ecology of vibrios.
Epidemiological and environmental surveillance for cholera has been under way in four countries, geographically divided
positions in Bangladesh for the past 4 years, throughout which both clinical and ecological trials were assembled at
biweekly intervals. The clinical epidemiology piece of the study has been released (Sack et al., J. Infect. Dis. 187:96-101,
2003). The outcomes of ecological trying and investigation of the ecological and clinical facts and numbers have disclosed
important associations of water warmth, water deepness, rainfall, conductivity, and copepod enumerations with the incident
of cholera toxin-producing pathogens (presumably V. cholerae). The lag time span between rises or declines in flats of
components, for example warmth and salinity, and incident of cholera correlate with biological parameters, for
demonstration, plankton community blooms. The new data on the ecology of cholera is verifying helpful in evolving
ecological forms for the proposition of cholera epidemics.
Tuberculosis; Tuberculosis is a common disease which is caused by bacteria. It is usually affect the respiratory organ
system which involves gastro intestinal and genitor-urinary tract, bones and joints, nervous system, skin and lymphatic.
Extra- pulmonary tuberculosis is more common than pulmonary. Tuberculosis may enter body by inhalation, injection or
direct inoculation.
Diarrhoea: Is another serious infectious diseases characterized by the formation of fibrins pseudo membranes. Mucus
membrane is a thin layer of skin that cover the inside of the nose and outside of other organs in the body, producing mucus
to prevent these parts from becoming dry and it is usually transmitted by droplets.
Multivariate analysis in statistics is devoted to the summarization, representation, and interpretation of data when more
than one characteristics of each sample unit is measured[3],[6],[7]. Almost all data-collection processes yield multivariate
data. The medical diagnostician examines pulse rate, blood pressure, hemoglobin, temperature and so forth, the educator
observes for individuals such quantities as intelligence scores, quantitative aptitudes and class grade; the economist may
consider at points in time indexes and measures such as per-capital personal income, the gross national product,
employment and the Dow- Jones average. Problem using these data are multivariate because inevitable the measure are
Page 7
interrelated and because investigations involve inquiry into the nature of such interrelationships and their uses in prediction,
estimation, and methods of classification[1],[2].[5]. Thus, multivariate analysis deals with samples in which for each unit
examined these are observation on two or more stochastically related measurement. Most of multivariate analysis deals with
estimation, confidence sets, and hypothesis testing for mean, variance, covariance, correlation coefficients and related, more
complex population characteristics.
The objective of this study is to determine if there is discriminant relationship among the named infectious diseases, to
examine the number of patient that are being infected with infectious disease and to know if there is significant difference
among the three contagious diseases
2.SOURCE OF DATA
The data use for this study were collected from University College Hospital Ibadan, Oyo State and LAUTECH teaching
hospital, Ogomosho respectively on Tuberculosis, Diarrhoea and Viral hypetisis between 2003-2012
yijk eijk
yijk is a vector of observation
W
B W
~ U( p,1, N1)
U p,1, N 1,
Page 8
two groups, using the Mahalanobi,s metric of generalized distances. It also gives the same linear separating decision
surfaces as Bayesian maximum likelihood discrimination in the case of equal class covariance matrices.
Purpose of Using L.D.A
LDA is used to find linear combination in groups of objects. The linear combinations classify the object into separate groups
used in analyzing information, and are used for dimensionally reduction and classification of data.
The maid purpose of LDA is to find commonalities in data.
LDA is statistically given as
n1 n 2 n1 n 2 1 k
2
D
n1 n 2 ( n1 n 2 2 ) k
where D 2 is mahalonbis square dis tan ce
Fcal
which is given as D 2 ( X 1 X 2 ) S 1 ( X 1 X 2 )
and is the linear discri min ant given as
y ( X 1 X 2 ) S 1 X
1
S 1 S 2
n1 n 2 2
YEAR
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
TB`
66
73
41
38
40
27
47
34
8
44
DISEASES
DIARRHOEA VIT
64
9
64
4
16
4
24
1
12
4
15
3
26
6
18
3
13
1
4
Page 9
Discriminant Analysis
Ho: There is no significant difference in prevalence among the contagious diseases
HI: There is significant difference in prevalence among the contagious diseases
Level of significance =0.05
Discriminant Analysis: hospital versus TUBERCULOSIS(TB)`, DIARRHEA, VIRALHYPETISIS(VIT)
Linear Method for Response: hospital
Predictors: TB, DIARRHOEA, VIT
Group
1
2
Count
10
9
19 cases used, 1 cases contain missing values
Squared Distance Between Groups
1
2
1 0.00000 4.96779
2 4.96779 0.00000
Linear Discriminant Function for Groups
1
2
Constant -2.0324 -4.0634
TB`
0.0423 0.2368
DIARRHOEA 0.0306 -0.0499
VIT
0.2562 -0.0815
Summary of Misclassified Observations
Decision rule: Reject Ho if Fcal. Ftab., otherwise do not reject Ho
Test statistic
Computation; Fcal=
=4.7368(0.2941) (24.6789)
=34.3799
Conclusion;Since Fcal. Ftab. (i.e 34.37994.63) we Reject Ho and conclude that there is significant difference in the rate
of infectious disease
MANOVA (Using Hotelling
- Test)
HYPOTHESIS TESTING
Ho: There is no significant difference in the prevalence of tuberculosis between the two hospitals considered.
H1: There is significant different in the prvalience of the infection between the hospitals considered.
Level of significant: =0.05
Decision rule: reject H0 if P-value < 0.05 otherwise do not Ho
Let X1 represent TB
Let X2 represent Diarrhoea
Let X3 represent VIT
MANOVA for TUBERCULOSIS
s = 1 m = -0.5 n = 6.5
Page 10
Conclusion: There is significant different in the prevalence of Tuberculoses between the two hospital.
HYPOTHESIS TESTING
Ho: There is no significant difference in the prevalence of DIARRHOEA between the two hospitals considered.
H1: There is significant difference in the prevalence of DIARRHOEA between the two hospitals considered.
Level of significant: =0.05
Decision rule: reject H0 if Fcal Ftab otherwise do not reject Ho.
MANOVA for DIARRHOEA
s = 1 m = -0.5 n = 6.5
Table 5
Conclusion: Fcal Ftab(i.e1.255 7.856)we accept H0 and conclude There is no significant difference in the prevalence of
DIARRHOEA between the two hospital considered
HYPOTHESIS TESTING
Ho: There is no significant difference in the prevalence of VIRAL HYPETITIS
Hi: There is significant difference in the prevalence of VIRAL HYPETITIS
Level of significant: =0.05
MANOVA for VIRAL HYPETITIS
s = 1 m = -0.5 n = 6.5
Table 6
Conclusion: There is significant difference in the prevalence of VIRAL HYPETITIS between the two hospital considered
5.CONCLUSION OF RESULTS
From all the analysis and computation of the data carried out in this study, Discriminant analysis was used to analysis the
data while Hotelling T2 Test was carried out to see if there was a significant difference in the type of disease and the level
of infection in studied communities.
Based on the discussion of result we conclude that the prevalence of the diseases considered in the two hospitals is as
follows
Page 11
There is significant different in the prevalence of Tuberculoses between the two hospital.
There is no significant difference in the prevalence of DIARRHOEA between the two hospitals considered.
There is significant difference in the prevalence of VIRAL HYPETITIS between the two hospitals considered.
REFERENCE
[1]. Anderson T. W. (2003) An Introduction to Multivariate Statistical Analysis (3rd ed.) New York: John Wiley. Box,
G.E.P, W.G. Hunter and J.S. Hunter (2005). Statistics for Experimenters (2nd edition) New York: John Wiley
[2]. Gabriel, K. R (1968), Simultaneous Test Procedures Some theory of multiple comparisons, annals of mathematics
and statistics, 40, 224-250.
[3]. Hardin, J.W., and Hilbe, J.M. (2007), Generalized Linear Models and Extensions, 2nd ed., Stata Press, College Station,
TX.
[4]. Jobson, J. D (1992), Applied Multivariate Data Analysis, Volume II: categorical and multivariate methods, New York:
Springer-Verlag.
[5]. Johnson R.A. and G.K. Bhattachary (2005) Statistics: Principles and methods (5th edition) New York; John Wiley.
[6]. Johnson R.A. and R.L (2006) Multivariate statistical process control schemes for controlling a mean Springer
Handbook of engineering statistics H. Pham. Ed Springer, Berlin.
[7]. Kleinbaum, Kupper L.L, Muller KE and Nizam A(2008) Applied regression analysis and other multivariable methods
(4th edition) New York: NY:
[8]. Montgomery, D.C (2005) Design and Analysis of Experiments (6th edition) New York: John Wiley.
[9]. Morrison, D.F. (2005) Multivariate statistical methods (4th edition) Belmont C.A: Brooks/Cole Thomson .Learning.
Page 12