JealousAd8448 t1_je0evw8 wrote on March 28, 2023 at 2:52 PM

#2,418,038

Impressive work! Will check it out and try to contribute to the project 💪

culebra_risa t1_je0g8qt wrote on March 28, 2023 at 3:01 PM

#2,418,250

Wow, reducing the finetuning time from 20 hours to 20 minutes is amazing :O

OrionJr t1_je0iequ wrote on March 28, 2023 at 3:15 PM

#2,418,602

Can’t seem to install deep speed on windows or wsl

x_ml OP t1_je0mp14 wrote on March 28, 2023 at 3:43 PM

#2,419,318

Replying to OrionJr (#2,418,602)

Deepspeed doesn't work on Windows yet but we were able to install in WSL. My colleague installed deepspeed in conda and then installed our package and it seemed to work.

JealousAd8448 t1_je0mp1z wrote on March 28, 2023 at 3:43 PM

#2,419,319

Replying to OrionJr (#2,418,602)

Unfortunately deepspeed is not easy to install on windows. Just use wsl, it did not give any problem to me using conda with python 3.8

subhash165 t1_je0mvvw wrote on March 28, 2023 at 3:44 PM

#2,419,350

Replying to OrionJr (#2,418,602)

I was able to run it with WSL (with miniconda environment)

sebzim4500 t1_je10iu2 wrote on March 28, 2023 at 5:11 PM

#2,421,762

>Lower-precision fine-tuning (like INT8, INT4)

How would this work? Are the weight internally represented as f16 and then rounded stochastically whenever they are used?

MohamedRashad t1_je12hzd wrote on March 28, 2023 at 5:23 PM

#2,422,080

Where does the model save after finetuned in the example in the README ?

[R] Build and personalize LLMs on your own data - Take back control with xTuring!

Comments

JealousAd8448 t1_je0evw8 wrote on March 28, 2023 at 2:52 PM

culebra_risa t1_je0g8qt wrote on March 28, 2023 at 3:01 PM

OrionJr t1_je0iequ wrote on March 28, 2023 at 3:15 PM

x_ml OP t1_je0mp14 wrote on March 28, 2023 at 3:43 PM

JealousAd8448 t1_je0mp1z wrote on March 28, 2023 at 3:43 PM

subhash165 t1_je0mvvw wrote on March 28, 2023 at 3:44 PM

sebzim4500 t1_je10iu2 wrote on March 28, 2023 at 5:11 PM

MohamedRashad t1_je12hzd wrote on March 28, 2023 at 5:23 PM