Gaussian Discrimnant Analysis#
Multivariate Gaussian#
Recall the multivariate gaussian can be represented as in terms of a quadratic function quadratic function \(q(x)\) which is called the quadratic form,
\(v\) : Deviation vector \(X_i-\mu\)
\(X_i\) : A row of data
\(\mu\) : A vector with each entry as the mean of the feature column.
The quadratic form \(q(x)\) describes the shape of the isocontours \(q(x) = c\) (for some real scalar \(c\)). For the Gaussian, the shape is a \(\dim(v)\) ellipsoid due to the mapping of a semi-circle function to parabolic function specified by,
Quadratic Discrimnant Analysis#
For each class in \(c : \{1,2,3,\ldots,C\}\) there exists a sample covariance matrix \(\hat\Sigma\),
The Bayes optimal rule maximizes \(P\) or rather \(\log P\) over the class \(c\) as its parameter. An equivalent description is to maximize the logistic function,
Linear Discrimnant Analysis#
Every class has the same sample covariance matrix which is calculated by the mean of each class’s covariance matrix. This is called the pooled within-class covariance matrix,
The Bayes optimal rule is equivalently a linear function,
Transformations#
Decorrelating: \(XU\)
Sphering: \(X\hat\Sigma^{-1/2}\)
Whitening: Decorrelating + Sphering