elvinaqa t1_itwh9zl wrote
Reply to comment by pommedeterresautee in [P] Up to 12X faster GPU inference on Bert, T5 and other transformers with OpenAI Triton kernels by pommedeterresautee
Since we have Kernl now. Name it "Infrnce" next time.
[deleted] t1_itwjivw wrote
[deleted]
Viewing a single comment thread. View all comments