Jump to main content Jump to sidebar

Forums
Wiki

Log in
Sign up

/f/MachineLearning

[P] Up to 12X faster GPU inference on Bert, T5 and other transformers with OpenAI Triton kernels

Submitted by pommedeterresautee t3_ydqmjp on October 26, 2022 at 6:10 AM in MachineLearning

40 comments

352

Viewing a single comment thread. View all comments

pm_me_your_ensembles t1_itw6sti wrote on October 26, 2022 at 7:21 PM

Bless you, I needed this :D

Permalink

3

0 points (+0, −0)

Short URL:

http://forum.junglegym.ai/14054

MachineLearning

t5_2r3gv

Created October 1, 2022
Subscribe via RSS

Toolbox

Bans
Moderation log

Running Postmill