Comments

You must log in or register to comment.

t1_je0evw8 wrote

Impressive work! Will check it out and try to contribute to the project 💪

6

t1_je0g8qt wrote

Wow, reducing the finetuning time from 20 hours to 20 minutes is amazing :O

5

t1_je0iequ wrote

Can’t seem to install deep speed on windows or wsl

2

OP t1_je0mp14 wrote

Deepspeed doesn't work on Windows yet but we were able to install in WSL. My colleague installed deepspeed in conda and then installed our package and it seemed to work.

3

t1_je0mp1z wrote

Unfortunately deepspeed is not easy to install on windows. Just use wsl, it did not give any problem to me using conda with python 3.8

1

t1_je10iu2 wrote

>Lower-precision fine-tuning (like INT8, INT4)

How would this work? Are the weight internally represented as f16 and then rounded stochastically whenever they are used?

1

t1_je12hzd wrote

Where does the model save after finetuned in the example in the README ?

1