Comments

You must log in or register to comment.

advertisementeconomy t1_ist76ic wrote

Interesting link.

> This implmentation requires a GPU with ~30GB of VRAM, I'd recommend an A100 from Lambda GPU Cloud which will take a little over 5 minutes to process a single image.

> Make sure you have downloaded the appropiate checkpoint for Stable Diffusion from huggingface and set up your environment correctly. (There are instructions for both in many other Stable Diffusion repos so please Google it if you're not sure.) Note there's plenty of room for optimisation on memory usage and training parameters (this is just a quick guess based on the paper, which doesn't have many details). So please experiment and let me know how it goes!

> Written by Justin Pinkney(@Buntworthy) @ Lambda Labs.

His Github: https://github.com/justinpinkney/stable-diffusion

The notebook: https://github.com/justinpinkney/stable-diffusion/blob/main/notebooks/imagic.ipynb

24

Ijustdowhateva t1_istcl6w wrote

The average person not only doesn't understand the significance of this tech, they don't even know it exists.

Ask your Uber driver about SD, he probably won't know what you're talking about.

This tech is going to improve at breakneck pace and absolutely take the world by complete surprise.

67

TheDividendReport t1_istvvzb wrote

At the rate image generation is progressing, trying to corner a specific application will be difficult. I say this after seeing DALL:E take payments for their service for maybe 3 months before stable diffusion completely kneecapped them with an open source program that does their tech better.

24

bitofaknowitall t1_isu063g wrote

Lol one hour later it was added as a feature to stable diffusion

27

Desperate_Donut8582 t1_isuiw49 wrote

I wonder how that will shape society and if software developers could make things that detect those kind of changes or how people communicate will differ…..I doubt majority of people are gonna use it tho

2

Coolmac t1_isujo1r wrote

I think that detection of these images will happen, but it is always going to follow rather than lead.

Human nature of believing whatever confirms our biases will make this tech truly scary.

And yes I'm pessimistic 😂

6

Diamond-Is-Not-Crash t1_isuqfxy wrote

Oh dear. The possibilities of this model are quite sinister. Oh well, such is progress /s

3

Romando1 t1_isv3fct wrote

Could this do it in real time with video frames?

7

HyperImmune t1_isw4u1h wrote

This is literally the second announcement of something like this TODAY. Unitune also announced to edit images with prompt. And here I thought we were already at breakneck speeds before today. Wild stuff.

3

Defiant_Station_5895 t1_isxcpdo wrote

If we get to the point where we can’t trust any information on the internet, trust that images and videos are genuine, where does that leave society? If this can’t be controlled, are we heading for a dark place where all we can believe is what we can actually see. Is this a major impending crisis?

1

Defiant_Station_5895 t1_itano3t wrote

I get your point and I did think that as I am writing this. But My trust hasn’t broken down to the point where I disconnect from the internet entirely. I can watch the news and be fairly confident those people said what they said and this is not a computer generated Joe Biden saying America is at war with NKorea.

1

ObjectiveDeal t1_itlueyk wrote

Tired of these shit. Hurry up , we don’t have time

1

turnip_burrito t1_iuhipf4 wrote

To remain a functional society, we'll have to trust centralized news outlets more than social media sources, or have computer programs which validate images/videos based on either metadata or statistical noise. Maybe even a suite of these programs and several centralized news outlets verifying. Or maybe some form of content delivery that encrypts legitimate videos at the time of recording so that when we recieve a video encrypted in this way, we know it's unaltered? I'm tired so i haven't thought this through.

Anyway, when video synthesis is perfected, we will need to treat video and images exactly like we do text now. Goodbye to the days of automatically trusting all video as authentic unfortunately. :(

2