Viewing a single comment thread. View all comments

Virtual_Giraffe_5173 t1_j6o6g7e wrote

It is not surprising that the performance is as good as with 32 bit networks.

That they train faster is more surprising. What is the reason for this?

My next question is: which frame work supports 16bit networks? Or do you plan to implement everything from scratch?

1

AzureNostalgia t1_j6olwgf wrote

They train faster because less memory is being used and also 16bit arithmetic is faster than 32bit

3