Submitted by minimaxir t3_11fbccz in MachineLearning
Thunderbird120 t1_jakbyew wrote
Reply to comment by lucidraisin in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir
You're better qualified to know than nearly anyone who posts here, but is flash attention really all that's necessary to make that feasible?
lucidraisin t1_jakdtf7 wrote
yes
edit: it was also used to train Llama. there is no reason not to use it at this point, for both training and fine-tuning / inference
Viewing a single comment thread. View all comments