It’s well-known that the average of i.i.d. r.v.’s with variance has variance .
Suppose that the r.v.’s ’s are only identically distributed with variance and pairwise correlation , then we have:
This equation suggests that as the size increases, though the second term disappears, the first term remains constant. So the averaging over more and more correlated r.v.’s will not keep improving the variance.
Wait a second! Our derivation of the above equation doesn’t depend on sign of . But the above equation seems to fail when since the variance would then be negative for a sufficiently large . What’s wrong with it?
Let’s consider the simple case when and . First of all, is it even possible to construct such 3 r.v.’s? We see that since ’s are identically distributed, if and , we have and . But this contradicts the fact that .
Actually, we cannot have arbitrary number of r.v.’s that are identically distributed with a negative pairwise correlation. We will show it by using the fact that the determinant of a correlation matrix must be positive semi-definite.
Let and the correlation matrices for the case of positive and negative pairwise correlations are denoted by:
respectively.
Then we have the following:
where we used the matrix determinant lemma: .
It shows that is always positive definite for . So there’s no problem with such identically distribution r.v.’s with a positive pairwise correlation .
Now, let’s compute determinant of :
To ensure , we need the condition that
Thus, we see that, for r.v.’s that are identically distributed with a negative pairwise correlation , actually cannot be smaller than . For example, 3 identically distributed r.v.’s cannot have a pairwise correlation smaller than and this bound becomes for 4 identically distributed r.v.’s.
Lastly, we make two more observations based on the above findings.
- For identically distributed r.v.’s ’s with pairwise correlation , what are the maximum and minimum values of ?
- How do we construct such ’s with pairwise correlation ?
Well, for a positive pairwise correlation, we’ve showed that is always positive definite. Thus, there exists a decomposition for some matrix . Then, for any i.i.d. r.v.’s ’s with unit variance, the linear transformation should have the desired variance-covariance matrix since
For a negative pairwise correlation , as long as , we know is also positive definite, thus the above construction also works in this case.