I-am_Sleepy t1_j3qa3yq wrote on January 10, 2023 at 9:25 AM

Reply to comment by LetGoAndBeReal in [D] Simple Questions Thread by AutoModerator

I am not really in this field (NLP), but you should checkout Fast Model Editing at Scale from 2021 (use google scholar to find citation thread)

LetGoAndBeReal t1_j3r0p45 wrote on January 10, 2023 at 2:11 PM

Thank you for this. It seems this paper could surely help answer my question, if only I could understand it!

A challenge I keep coming up against in my quest to quickly learn about ML/NN is that almost everything I read is either too high level to provide meaningful explanation or too technically dense for me to follow. I guess I will just take note of this paper for now and circle back to it when I'm a bit further along.

I-am_Sleepy t1_j3vok1m wrote on January 11, 2023 at 11:30 AM

Hey, I’ve found another paper (Git Re-Basin) about merging model weight trained on a disjoint dataset while retaining both model performance. This paper is quite technical, but there is an implementation online. I think you should check it out