Control Variates for Variance Reduction

The full pdf of this post can be found here.

3 thoughts on “Control Variates for Variance Reduction

  1. There is a bug in the formula CG^2 ==> CG, yielding m = E(CG) / E(G^2). Also, the general solution (without assuming things are centered) is: cov(C, G) / var(G).

    1. Hi, thank you but I think this formula is correct.
      Var [(c (x) – m) G (x)] – Var [c (x) G (x)] = – 2 m E[c (x) G (x)^2] + m^2 E [G (x)^2]
      The minimum of that with respect to m is m = E[c(x) G(x)^2]/E[G(x)^2].
      Note that in the case of RL E[G(x)] = 0 by construction (policy gradient).

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Discover more from Invariance

Subscribe now to keep reading and get access to the full archive.

Continue reading