Viewing a single comment thread. View all comments

leoreno t1_jdp6421 wrote

Meta llama model and paper aimed to answer this

Tldr no of you get crafty about model serving efficiency

1