Putting it all together
Normalizing Flows as Maximum Likelihood Estimation
At its most basic level, Normalizing Flows can be seen as a parametric density estimation method based on Maximum Likelihood. In our case, we took a simple density
Shifting a standard normal distribution
Since minimizing the KL divergence between the true data distribution and a model is equivalent to maximum likelihood estimation
and since the likelihood is tractable thanks to the change of variable formula, we can essentially fit
Normalizing Flows in the General Univariate Case
Now that we have build some sort of intuition for normalizing flows, let’s put everything together. Suppose that we have i.i.d. samples from a data distribution
The Normalizing Flows method starts with a simple density
Then, finds the parameter values that maximize the likelihood, for instance using gradient descent iterations