bjergerk1ng

bjergerk1ng t1_jalzy46 wrote on March 2, 2023 at 11:38 AM

Reply to comment by xEdwin23x in [D] What are the most known architectures of Text To Image models ? by AImSamy

That's not diffusion though

bjergerk1ng t1_jakt9hi wrote on March 2, 2023 at 3:22 AM

Reply to comment by currentscurrents in [D] What are the most known architectures of Text To Image models ? by AImSamy

Source about Google using ViT?

bjergerk1ng t1_jakszgr wrote on March 2, 2023 at 3:20 AM

Reply to comment by LetterRip in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir

Is it possible that they also switched from non-chinchilla-optimal davinci to chinchilla-optimal chatgpt? That is at least 4x smaller

bjergerk1ng OP t1_j90mnb7 wrote on February 18, 2023 at 9:01 AM

Reply to comment by anonymousTestPoster in [D] Formalising information flow in NN by bjergerk1ng

He linked https://arxiv.org/abs/1905.04271, not sure what is happening lol.

bjergerk1ng OP t1_j8zo74z wrote on February 18, 2023 at 2:43 AM

Reply to comment by [deleted] in [D] Formalising information flow in NN by bjergerk1ng

Good point — what's shown in the paper (just skimmed through it) seems quite promising, wonder why this approach isn't seen more in literature

bjergerk1ng t1_j7ar6ps wrote on February 5, 2023 at 11:35 AM

Reply to comment by Myxomatosiss in [D] Are large language models dangerous? by spiritus_dei

Hi ChatGPT