Академический Документы
Профессиональный Документы
Культура Документы
By submitting a brief SAS program, you can access a procedure from the library and use it to analyze a set of data.
Most statistical research involves gathering data and performing analyses to determine what data means.
_________________________________________________________________________________________________
The process of research often begins by developing a clear statement of the research question (or questions). The
research question is a statement of what you hope to have learned by the time the research has been completed. It is
good practice to revise and refine the research question several times to ensure that you are very clear about what it is
you really want to know. The reason behind refining and revising the research question is to make it concise w.r.t various
events and variables so that the research is correctly focused Ex: "What variables have a causal effect on an event", once
the research question has been more clearly defined, you are in a better position to develop a good hypothesis that
provides a possible answer to the question.
A hypothesis is a statement about the predicted relationships among events or variables. In developing the hypothesis,
you might be influenced by any number of sources: an existing theory, some related research, or even personal
experience. For ex. we can state a hypothesis "the amount of Y increases the amount of X" below figure illustrates this
relationship:
The variable being affected (X) appears on the left side of the
figure, the causal variable (Y) appears on the right and the
arrow reflects the prediction that Y is the causal variable, and
X is the variable being affected.
A statistical null hypothesis is typically a prediction that there
is no difference between groups in the population, or that
there is no relationship between variables in the population.
You will analyze the data from your sample, and if the observed difference is large enough, you will reject this null
hypothesis of no difference. Rejecting this statistical null hypothesis means that you have obtained some support for your
original research hypothesis (the hypothesis that there is a difference between the groups).
Statistical null hypotheses are often represented symbolically.
A statistical alternative hypothesis is typically a prediction that there is a difference between groups in the population,
or that there is relationship between variables in the population. The alternative hypothesis is the counterpart to the null
hypothesis; if you reject the null hypothesis, you tentatively accept the alternative hypothesis. There are different ways
that you can state alternative hypotheses. One way is simply to predict that there is a difference between the population
means, without predicting which population mean is higher.
A Nondirectional hypotheses (also known as a two-sided or two-tailed alternative hypothesis) simply predicts that one
population mean differs from the other population meanit does not predict which population mean will be higher. With
a nondirectional alternative hypothesis, you are predicting some type of difference, but you are not predicting the specific
nature, or direction, of the difference.
Directional hypotheses (also known as a one-sided or one-tailed alternative hypothesis) not only predicts that there will
be a difference, but also makes a specific prediction about which population will display the higher mean.
____________________________________________________________________________
Data is defined as a collection of scores that are obtained when subject characteristics and/or performance are observed
and recorded. Different types of instruments can be used to obtain different types of data.
With the null hypothesis stated, you can now test it by conducting a study in which you gather and analyze relevant data.
The study results would lend some support to your research hypothesis; if not, the results would fail to provide support.
In either case, you would be able to draw conclusions regarding the tenability of your hypotheses, and would have made
some progress toward answering your research question. For example, if you obtained support for your hypothesis with
a correlational study, you might choose to follow it up with a study using a different research method, perhaps an
experimental study.
A variable refers to some specific characteristic of a subject that can assume one or more different values. A value, refers
to either a particular subject's relative standing on a quantitative variable, or a subject's classification within a classification
variable. Quantitative variables represent the quantity, or amount, of the construct that is being assessed, numbers
typically serve as values. In Classification (qualitative/categorical) variables different values represent different groups
to which the subject might belong.
The independent variable is that variable whose values (or levels) are selected by the experimenter to determine what
effect the independent variable has on the dependent variable. The independent variable is the experimental counterpart
to a predictor variable. A dependent variable, on the other hand, is some aspect of the subject's behavior that is assessed
to determine whether it has been affected by the independent variable. The dependent variable is the experimental
counterpart to a criterion variable.
predictor variable and criterion variable can be used with almost any type of researchexperimental or
nonexperimental. However, the terms independent variable and dependent variable should be used only with
experimental researchresearch conducted under controlled conditions with a true manipulated independent variable.
Levels of the independent variable: experimental conditions or treatment conditions, corresponding to the different
groups to which a subject might be assigned. With respect to the independent variable, it is common to speak of the
experimental group versus the control group. Generally speaking, the experimental group is the group that receives the
experimental treatment of interest, while the control group is an equivalent group of subjects that does not receive this
treatment. The simplest type of experiment consists of one experimental group and one control group.
Type-of variable figure: graphically illustrates the number of values that are assumed by predictor and criterion variables.
place the symbol for the dependent variable on the left side of the equals sign (=),
place the symbol for the independent variable on the right side of the equals sign.
The preceding type-of-variable figure could be used to illustrate any experiment in which the dependent variable was a
multi-value variable and the independent variable was a dichotomous variable.
Three Types of SAS Files: one file will contain the SAS program, one will contain the SAS log, and one will contain the SAS
output.
SAS Program: consists of a set of statements written by the user which provide SAS System with the data to be analyzed,
tell SAS about the nature of the data, and indicate which statistical analyses should be performed on the data. These
statements are usually typed as data lines in a file in the computers memory. The DATA step versus the PROC step. There
is another, more fundamental way, to divide a SAS program into its constituent components. It is possible to think of each
SAS program as consisting of a DATA step and a PROC step. In the DATA step, programming statements create and/or
modify a SAS data set. the PROC step includes statements that request specific statistical analyses of the data.
After submitting the SAS program. Once the preceding program has been submitted for analysis, SAS will create two types
of files reporting the results of the analysis, SAS log file & SAS output file.
SAS log is generated by SAS after you submit your program. It is a summary of notes and messages generated by SAS as
your program executes. These notes and messages will help you verify that your SAS program ran correctly.
Specifically, the SAS log provides
a reprinting of the SAS program that was submitted (minus the data lines)
a listing of notes indicating how many variables and observations are contained in the data set
a listing of any notes, warnings, or error messages generated during the execution of the SAS program
The SAS output file contains the results of the statistical analyses requested in the SAS program. An output file is
sometimes called a listing file, because it contains a listing of the results of the analyses that were requested.