Академический Документы
Профессиональный Документы
Культура Документы
Session Content
1,2 Understanding R-Studio, Different Window Panes, Basic Variables and Structures:
integer, float, array vector, matrix, data-frame, list, Random Number generation and
understanding shape of distribution, Saving and Loading R Data files and Outputs.
3,4 Importing dataframes from other file formats, difference between load and import, viewing
data with head(), tail()
Understanding a Dataframe: column names, changing column names, dimensions, data
slicing, data merging, data sorting, sampling from dataset, sub-setting data, Frequency tables,
Manipulate Date functions, generate Derived variables.
5,6 Simple Graphs in R: box charts and pie charts, box plots, histograms, line diagrams, density
plots/dividing the plotting screen, saving a plot, using ggplot2, graphics grammar, complex
plots.
7,8 Statistical Data Processing and Hypothesis Testing: descriptive statistics, frequency tables,
sapply() features, t-test with R, chi-square test with R, ANOVA using R
9,10 Correlation and Regression Analysis with R: correlation coefficients, corplot() function,
regression procedure (multiple linear), model validation, stepwise regression, prediction from
regression.
Factor Analysis using R: concept of factors, eigenvalues of principal components,
commonality and variance lost, factoring functions in R, composition of factors/naming the
factors, factor rotation.
11,12 Clustering of Data with R: concept of clustering (unsupervised learning), concept of
distance, hierarchical clusters and their working logic, dendograms, R functions for
hierarchical clustering, concept of cluster profile and cluster means, understanding working of
k-means algorithm, performing k-means using R, determination of cluster profiles
The course outline will be elaborated further in session objective for each session. The instructor
will communicate the session objectives in the class.