Variance and Covariance
Variance
The variance measures the spread of a distribution.
Definition
Let be a random variable with mean . The variance of , denoted by or or , is defined by
assuming this expectation exists. The standard deviation is , and is denoted by and .
We can’t use as a measure of spread, since . We can sometimes use as a measure of spread, but often we use the variance.
Theorem
Assuming the variance is well defined, it has the following properties:
- .
- If and are constants, then .
- If are independent and are constants, then
Sample Mean and Sample Variance
Definition
If are random variables, then we define the sample mean to be
and the sample variance to be
Theorem
Let be IID and let , . Then,
Covariance and Correlation
If and are random variables, then the covariance and correlation between and measure how strong the relationship is between and .
Definition
let and be random variables with means and , and standard deviations and . Define the covariance between and by
and the correlation by
Theorem
The covariance satisfies
The correlation satisfies
If for some constants and , then if , and if . If and are independent, then . The converse is not true in general.
Theorem
, and . More generally, for random variables ,
Definition
For a random vector , the covariance matrix or variance-covariance matrix is a square matrix giving the covariance between each pair of random variables of the vector . It is defined by
Intuitively, the covariance matrix generalizes the notion of variance to multiple dimensions.