Viewing a single comment thread. View all comments

ReasonablyBadass t1_jae7zhu wrote on February 28, 2023 at 8:16 PM

Can't read the paper right now, can someone summarize: is it a new model or "just" the standard transformers but used on multi modal data? if it is new, what are the strucutral changes?