kittenkrazy t1_jcjxr0b wrote on March 17, 2023 at 11:34 AM

I wonder if this can effectively be used in LLaMA. 32K context would be a game changer

Nhabls t1_jck9a4c wrote on March 17, 2023 at 1:20 PM

Yeah just need enough training time and data to be able to train those 32k context layers effectively........................

Gpt4 32 k api when available ?

Definitely, but you'd need to further fine-tune the model to "teach" it to make use of the additional context

This

Someone should definitely look into this!