Submitted by questionasker577 t3_10nn3k3 in singularity
visarga t1_j6arwxp wrote
Reply to comment by genshiryoku in Why did 2003 to 2013 feel like more progress than 2013 to 2023? by questionasker577
Generating data through RL like AlphaGo or "Evolution through Large Models" (ELM) seems to show a way out. Not all data is equally useful for the model, for example problem and task solving is more important that raw organic text.
Basically use LLM to generate and another system to evaluate, in order to filter the useful data examples.
Viewing a single comment thread. View all comments