rkstgr t1_jbia52h wrote on March 9, 2023 at 6:06 AM

First of all, beta_t is just some predefined variance schedule (in literature often linear interpolated between 1e-2 and 1e-4) and it defines the variance of the noise that is added at step t. What you have in (1) is the variance of sample x_t which does not have to be beta_t.

What does hold for large t is var(x_t)=1 as our sample converges to ~ Normal Gaussian with mean 0 and var 1.

eugene129 OP t1_jbkfrsr wrote on March 9, 2023 at 6:05 PM

Hello, thanks for your reply. So... N(Xt ; ... , BtI) doesn't mean that the V(Xt) = Bt ?

rkstgr t1_jc9sf5c wrote on March 15, 2023 at 9:10 AM

Exactly, because you have x_t-1 with some (unknown, data/normalisation dependent) variance and you add noise with variance \beta to get x_t.