Viewing a single comment thread. View all comments

leoreno t1_jdp6421 wrote on March 26, 2023 at 2:19 AM

Meta llama model and paper aimed to answer this

Tldr no of you get crafty about model serving efficiency