Comments

You must log in or register to comment.

JealousAd8448 t1_je0evw8 wrote

Impressive work! Will check it out and try to contribute to the project 💪

6

culebra_risa t1_je0g8qt wrote

Wow, reducing the finetuning time from 20 hours to 20 minutes is amazing :O

5

OrionJr t1_je0iequ wrote

Can’t seem to install deep speed on windows or wsl

2

x_ml OP t1_je0mp14 wrote

Deepspeed doesn't work on Windows yet but we were able to install in WSL. My colleague installed deepspeed in conda and then installed our package and it seemed to work.

3

sebzim4500 t1_je10iu2 wrote

>Lower-precision fine-tuning (like INT8, INT4)

How would this work? Are the weight internally represented as f16 and then rounded stochastically whenever they are used?

1

MohamedRashad t1_je12hzd wrote

Where does the model save after finetuned in the example in the README ?

1