Deep-Station-1746 t1_ixu6ych wrote on November 26, 2022 at 11:23 AM

Load pt model with torch, get all weights with state_dict. Make them all into channels last format with permute. Load all the arrays into numpy. Load all the arrays into tensorflow. Write the network forward logic in tensorflow. Plug in weights. Run.

Background_Thanks604 t1_ixuccby wrote on November 26, 2022 at 12:35 PM

Thanks for clearification! Do you know if there is an tutorial/blog post for this approach?

Deep-Station-1746 t1_ixugp6t wrote on November 26, 2022 at 1:25 PM

Nope. I don't think so. If you need help and are willing to wait a bit (A bit busy right now), DM me and I'll take a look at your problem.

Background_Thanks604 t1_ixukvcg wrote on November 26, 2022 at 2:06 PM

Thx - appreciate it! I dont have a problem i just want to learn/try this approach because i never heard of it.

jobeta t1_ixum5yx wrote on November 26, 2022 at 2:18 PM

How complex is the model you want to translate?

Background_Thanks604 t1_ixun76x wrote on November 26, 2022 at 2:28 PM

I dont have a model to translate - i read about this approach in the comments and i want to learn about it.

ApeForHire t1_ixwveuz wrote on November 27, 2022 at 12:30 AM

I was actually able to do this once by relying heavily on Github's Copilot, ie just giving functions very specific names and commenting things like "this function converts a pytorch model into a vector of weights" etc. It worked pretty well and was simple for basic network architectures, though I imagine it could get more complicated.

Mefaso t1_iy26yec wrote on November 28, 2022 at 4:22 AM

https://github.com/lernapparat/lernapparat/blob/master/style_gan/pytorch_style_gan.ipynb

This notebook does the reverse - TF to torch for stylegan

CodaholicCorgi OP t1_iy1s4ws wrote on November 28, 2022 at 2:18 AM

I did it once in one of my projects. It's basically hand-picking weights from Pytorch model's layers to TensorFlow model's layers, but it feels more reliable than relying on ONNX with a bunch of warnings.

Though there are not much tutorials or blog posts about this, I will try creating a github repo for this later (just examples with simple layers), so many more people will know that this technique exists.

[D] Pytorch or TensorFlow for development and deployment?

Background_Thanks604 t1_ixu3170 wrote on November 26, 2022 at 10:25 AM