Given an input data \(X_0\) of \(d\) features. A fully connected layer of \(n\) units:
Trains \(n(d+1)\) parameters
Outputs an array of size \(n\)