oathbreakerkeeper t1_jdgjte0 wrote on March 24, 2023 at 6:14 AM

How do you use pure fp16 out of curiosity? I've only ever trained with mixed precision, letting pytorch handle the fp16 stuff from there.

Do you have an example of a github repo that does it?

Dependent_Ad5120 t1_je5qfmp wrote on March 29, 2023 at 4:35 PM

I don't have a github repo for this, but it is pretty simple:

```

model = nn.Transformer().cuda().half

input = torch.rand(..).cuda().half

with sdp_kernel(...enable only flash attn):

output = model(input)

```

These 4 lines should be enough.