Control Variates for Variance Reduction

The full pdf of this post can be found here.

control_variates

4 thoughts on “Control Variates for Variance Reduction

  1. There is a bug in the formula CG^2 ==> CG, yielding m = E(CG) / E(G^2). Also, the general solution (without assuming things are centered) is: cov(C, G) / var(G).

    1. Hi, thank you but I think this formula is correct.
      Var [(c (x) – m) G (x)] – Var [c (x) G (x)] = – 2 m E[c (x) G (x)^2] + m^2 E [G (x)^2]
      The minimum of that with respect to m is m = E[c(x) G(x)^2]/E[G(x)^2].
      Note that in the case of RL E[G(x)] = 0 by construction (policy gradient).

  2. Thanks for the post! I am trying to read the references you listed in paragraph 3, but link1 and link3 are broken. I wonder if you have those files?

Leave a Reply to JonathanCancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Discover more from Invariance

Subscribe now to keep reading and get access to the full archive.

Continue reading