bjergerk1ng
bjergerk1ng t1_jakt9hi wrote
Reply to comment by currentscurrents in [D] What are the most known architectures of Text To Image models ? by AImSamy
Source about Google using ViT?
bjergerk1ng t1_jakszgr wrote
Reply to comment by LetterRip in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir
Is it possible that they also switched from non-chinchilla-optimal davinci to chinchilla-optimal chatgpt? That is at least 4x smaller
bjergerk1ng OP t1_j90mnb7 wrote
Reply to comment by anonymousTestPoster in [D] Formalising information flow in NN by bjergerk1ng
He linked https://arxiv.org/abs/1905.04271, not sure what is happening lol.
bjergerk1ng OP t1_j8zo74z wrote
Reply to comment by [deleted] in [D] Formalising information flow in NN by bjergerk1ng
Good point — what's shown in the paper (just skimmed through it) seems quite promising, wonder why this approach isn't seen more in literature
Submitted by bjergerk1ng t3_11542tv in MachineLearning
bjergerk1ng t1_j7ar6ps wrote
Reply to comment by Myxomatosiss in [D] Are large language models dangerous? by spiritus_dei
Hi ChatGPT
bjergerk1ng t1_jalzy46 wrote
Reply to comment by xEdwin23x in [D] What are the most known architectures of Text To Image models ? by AImSamy
That's not diffusion though