Midterm exam information, STATS 401 W18

Summation exercises

S1. A basic exercise.

S2. An example involving the summation representation of matrix multiplication.

R exercises

R1. Using rep() and matrix().

R2. Manipulating vectors and matrices in R.

Fitting a linear model by least squares

[This category is similar, but slightly different, from the F1 and F2 questions in the quiz.]

F1. Write the sample version of a linear model in subscript form given the matrix form.

F2. Write the sample version of a linear model in matrix form given the subscript form.

F3. Write the sample version of a linear model in matrix form subscript form given a dataset and verbal description of the model OR writing the sample version of a linear model in matrix form given a dataset and verbal description of the model.

F4. Explain how to obtain the least square value of the coefficients and the fitted values.

Properties of variance and covariance

V1. A numerical calculation to find the variance of a linear combination using matrix techniques.

V2. An algebraic calculation using basic definitions of variance & covariance, together with the linearity of expectation.

Normal probability calculations

N1. A normal approximation to estimate a probability using the mean and variance.

N2. A normal approximation to find a region with a given probability using the mean and variance.

The population version (or probability version) of the linear model

P1. Describe a suitable probability model, in subscript form, to give a population version of a linear model.

P2. Describe a suitable probability model, in matrix form, to give a population version of a linear model.

P3. Explain how R produces standard errors for coefficients in a linear model. Interpret the standard errors using the probability model.

Example: patient satisfaction in a hospital

The following survey data on a collection of hospital patients measures self-reported satisfaction, age, a measure of case severity, and a measure of anxiety. The hospital managers want to see whether satisfaction can be explained by the other variables, and, if so, which variables are important.

patients <- read.table("patients.txt",header=T)
dim(patients)

## [1] 46  4

head(patients)

##   Satisfaction Age Severity Anxiety
## 1           48  50       51     2.3
## 2           57  36       46     2.3
## 3           66  40       48     2.2
## 4           70  41       44     1.8
## 5           89  28       43     1.8
## 6           36  49       54     2.9

(F1,F2,F3). Write the sample version of a linear model to address this question, in subscript form and matrix form.

(P1,P2). Write a probability model that can be used to assess the chance variation in the coefficients of the sample linear model. What is the source of this chance variation?

(P3) Explain how this probability model is used to obtain standard errors for the coefficient estimates.

Exercise related to HW6: Working with a generic \((i,j)\) component of a matrix

A matrix quantity such as \({\mathrm{Var}}({\mathbf{X}})\) is really a collection of quantities for each row \(i\) and column \(j\).
Sometimes, when working with matrices, it is helpful to write equations for a generic \((i,j)\) component. Something that you work out for a generic \((i,j)\) component is necessarily true for the whole matrix.
On questions of the form “Show that \(A=B\)” you can in principle start from \(B\) and show how to get \(A\) or vice versa. Usually, people work left-to-right so the question is likely suggesting that it is simpler to start from \(A\) and show how to get to \(B\).
Here, we are considering questions of the form “Show that \({\mathbb{A}}={\mathbb{B}}\)”. We can do this by showing that \([{\mathbb{A}}]_{ij}=[{\mathbb{B}}]_{ij}\) for an arbitrary \((i,j)\).

Example. Let \({\mathbf{X}}=(X_1,\dots,X_n)\) and \({\mathbf{Y}}=(Y_1,\dots,Y_n)\) be independent vector random variables with \(n\times n\) variance matrices \({\mathbb{U}}\) and \({\mathbb{V}}\) respectively. Show that \({\mathrm{Var}}({\mathbf{X}}+{\mathbf{Y}})={\mathbb{U}}+{\mathbb{V}}\). This is a version for vector random variables of the formula \({\mathrm{Var}}(X_i+Y_i)={\mathrm{Var}}(X_i)+{\mathrm{Var}}(Y_i)\), which we have already seen for the variance of a sum of two independent random variables. You can use the definitions of variance and covariance. You may also use the basic property of expectation of a product of independent random variables, that \({\mathrm{E}}[X_iY_j]={\mathrm{E}}[X_i]{\mathrm{E}}[Y_j]\).

License: This material is provided under an MIT license

Midterm exam information, STATS 401 W18

Scheduling

Instructions

Formulas

Question categories.