nmkd
nmkd t1_j7azisy wrote
Reply to comment by gravyvolcanoes in A Visual Search Engine: Same Energy is a visual search engine. You can use it to find beautiful art, photography, decoration ideas, or anything else. by GorgeousVillian
No, CLIP does not work on GIFs
nmkd t1_j6gaf5g wrote
Reply to comment by onnod in [R] InstructPix2Pix: Learning to Follow Image Editing Instructions by Illustrious_Row_9971
I think so, haven't tried though
nmkd t1_j6cipol wrote
In case someone is interested, I implemented this in my Stable Diffusion Windows GUI:
(Source Code: https://github.com/n00mkrad/text2image-gui/)
nmkd t1_j4vh7cl wrote
Reply to comment by kingdroopa in [D] Suggestion for approaching img-to-img? by kingdroopa
Okay, in that case, I'll try to be a bit more helpful lol.
I think you absolutely need to use something like YOLO for object identification/classification.
-
Humans and animals are warmer than the environment
-
Cars and other vehicles are warmer than the environment
-
Glass blocks IR but not visible light
You could get the overall "look" with just image-based networks, but to make it really convincing (more like COD's thermal vision) you need classification in order to make objects look hot that are supposed to be hot.
nmkd t1_j4vg72g wrote
Reply to [D] Suggestion for approaching img-to-img? by kingdroopa
You cannot just translate visible light to IR. No matter what machine learning you use, this is physically impossible.
nmkd t1_j45qfqn wrote
Reply to [D] Has ML become synonymous with AI? by Valachio
To the general public, absolutely, yes
nmkd t1_iv3miyj wrote
Reply to comment by Zer01123 in [D] NVIDIA RTX 4090 vs RTX 3090 Deep Learning Benchmarks by mippie_moe
4090 starts at 1950€ here in Germany
nmkd t1_ithk5u2 wrote
Reply to comment by iiitme in ADDICTION, me, 3D Sculpted, 2022 by Dylan_Kowalski_3D
Dark Lucy
nmkd t1_isy2xtz wrote
Reply to comment by 0x00groot in [D] Imagic Stable Diffusion training in 11 GB VRAM with diffusers and colab link. by 0x00groot
but not bitsandbytes as far as i know
nmkd t1_isxlyh2 wrote
Reply to comment by deep-yearning in [D] Imagic Stable Diffusion training in 11 GB VRAM with diffusers and colab link. by 0x00groot
This is not Windows compatible as far as I know.
nmkd t1_irx0805 wrote
Reply to comment by milleniumsentry in [D] Reversing Image-to-text models to get the prompt by MohamedRashad
Feeding the CLIP interrogator result back into Stable Diffusion results in completely different images though.
It's not good.
nmkd t1_irdjqtc wrote
Reply to comment by Smoke-away in An implementation of text-to-3D DreamFusion, powered by Stable Diffusion by Schneller-als-Licht
> I just tried v1.5 of their GUI
Elaborate?
nmkd t1_jdhmgpm wrote
Reply to comment by banmeyoucoward in [D] I just realised: GPT-4 with image input can interpret any computer screen, any userinterface and any combination of them. by Balance-
Nope, it's multimodal in terms of understanding language and images. It wasn't trained on mouse movement because that's neither language nor imagery.