Submitted by Kaarssteun t3_10zxpzy in singularity
blueSGL t1_j876jmh wrote
Reply to comment by FarFuckingOut in Recursive self-improvement (intelligence explosion) cannot be far away by Kaarssteun
entirely depends on having a good discriminator, look at the work going on in stable diffusion where outputs of the model are fed back in for further fine tuning.
or some of the work on doing automated dataset creation for fine tunes by prompting the model in a certain ways so it 'self corrects' and then collect the output and use [correction + initial question] for fine tunes.
Viewing a single comment thread. View all comments