nins_
nins_ t1_j38fh4v wrote
I found the portable binaries of Real ESRGAN extremely easy to use (no setup involved). Just a command line tool. It does not use CUDA though. https://github.com/xinntao/Real-ESRGAN
They're using NCNN to package the model. Have a look. https://github.com/Tencent/NCNN
nins_ t1_iyl5j1t wrote
That's the entire business model for aws, Azure, gcp. There are several small players out there as well.
nins_ t1_iwlhcbk wrote
So it seems like you have 8 * 5 * 2 = 80 features per training sample if you want to use all 8 sides to predict your binary label. Is this representation causing problems?
nins_ t1_iuwwdrw wrote
Any chance this is your friend? :)
You can see my answer there for an alternative approach but in short, you can't just do a reverse pass to get the inputs.
nins_ t1_iuen1e3 wrote
Reply to comment by ChaosAdm in [D] How do I get certain number of frames before every point-of-impact in a tennis game video? by ChaosAdm
The manual annotation would involve you noting down the timestamps in a csv. Then you write a short script (I would do it with OpenCV) to read the video files, get 20 frames prior to each timestamp, save them as images into whatever directory structure you need.
Edit: MoviePy package will probably be easier than OpenCV for you.
nins_ t1_iu5na8f wrote
Reply to [D] Predicting input values from taget value by ARFGHA
Is the input space very large?
What if you train a model to predict the actual value using your dataset and then simply run an exhaustive prediction over your input space? Then lookup the closest model output value whenever you need it and check the corresponding input parameters.
nins_ t1_j5xjw35 wrote
Reply to [D] Pretraining for CNN by Dense-Smf-6032
Do you mean self-supervised learning for CNNs? SimCLR does work on CNNs. Also check out SOCO, SCRL, BYOL - there's a lot.