[D] would diffusion language models make sense? Submitted by hapliniste t3_yck1sx on October 24, 2022 at 7:20 PM in MachineLearning 7 comments 47
limpbizkit4prez t1_itnoeqq wrote on October 24, 2022 at 11:53 PM I've always applied an annealing schedule like that to LMs. Imo, it works incredibly well and generalizes great Permalink 3
Viewing a single comment thread. View all comments