Control Variates for Variance Reduction

The full pdf of this post can be found here.

3 thoughts on “Control Variates for Variance Reduction”

dohmatob says:

April 24, 2018 at 4:28 pm

There is a bug in the formula CG^2 ==> CG, yielding m = E(CG) / E(G^2). Also, the general solution (without assuming things are centered) is: cov(C, G) / var(G).

Loading...

Reply
1. DJR says:
  
  April 24, 2018 at 10:00 pm
  
  Hi, thank you but I think this formula is correct.
  Var [(c (x) – m) G (x)] – Var [c (x) G (x)] = – 2 m E[c (x) G (x)^2] + m^2 E [G (x)^2]
  The minimum of that with respect to m is m = E[c(x) G(x)^2]/E[G(x)^2].
  Note that in the case of RL E[G(x)] = 0 by construction (policy gradient).
  
  Loading...
  
  Reply
dohmatob says:

April 24, 2018 at 4:29 pm

Nice blogpost, BTW!

Loading...

Reply