PCA: reduce high dimensional containing a high number of dimensions/features per observation

Principal components: a collection of points in real coordinatie space (real coordinate space of dimension is the set of te n-tuples of real numbers) are a sequence of p unit vectors

<aside> 💫 PCA is the process of computing the principal components and using them to perfrom a change of basis on the data, sometimes usimng only the first few principal components and ignore the rest.

</aside>

First principal components of a set of p variables: presumed to be jointly normally distributed, is the derived varaible formed as a linear combination of the orginal variables that explains the most variance.

Second principal components: explains the most variance is what’s left once the effect of first components is removed. (we may proceed thorugh p iterations until all variance is explained).