clueless1245 t1_jcteoda wrote on March 19, 2023 at 12:48 PM

Reply to comment by farmingvillein in [R] ChatGLM-6B - an open source 6.2 billion parameter Eng/Chinese bilingual LLM trained on 1T tokens, supplemented by supervised fine-tuning, feedback bootstrap, and RLHF. Runs on consumer grade GPUs by MysteryInc152

Open source doesn't mean freely used. That's the whole reason there's an F in FOSS.

clueless1245 t1_jbdysis wrote on March 8, 2023 at 10:40 AM

Reply to comment by ImpossibleCat7611 in [D] I'm a dentist and during my remaining lifetime I would like to take part in laying groundwork for future autonomic robots powered by AI that are capable of performing dental procedures. What technologies should I start to learn? by Armauer

Important though to note are literally not enough people just taking stuff implemented in scikitlearn or whatever and applying that to their own problems, and in and of itself that can be novel and interesting even if its not a shiny new model.

> As I said domain knowledge and/or providing data and relying on the technical expertise of others is the most valuable direction to go.

Its mainly the way you wrote your comment that left a bad taste in my mouth, this line specifically is probs a fine recommendation for OP.

clueless1245 t1_jbdwxye wrote on March 8, 2023 at 10:13 AM

Reply to comment by ImpossibleCat7611 in [D] I'm a dentist and during my remaining lifetime I would like to take part in laying groundwork for future autonomic robots powered by AI that are capable of performing dental procedures. What technologies should I start to learn? by Armauer

Do you actually work in ML research lol? About as important as fundamental research on architectures losses and optimisers is the applied end of things and tons of applied stuff is absolutely something other domain experts can contribute to, non ML non CS expertise is absolutely essential to i.e. the stuff my group does. "State of the art on some famous benchmark" is not the be all and end all of this field and "only a small minority is able to make significant contributions" is an absurdly incorrect statement.

clueless1245 t1_jb5khy8 wrote on March 6, 2023 at 4:33 PM

Reply to comment by [deleted] in [R] We found nearly half a billion duplicated images on LAION-2B-en. by von-hust

You want this done in a controlled, methodical and documented manner, not earlier research which showed SD 1.5 to verbatim copy every line and minute contour of wood grain in a specific copyrighted "wooden table" background, found after training to be repeated tens of thousands of times in the input dataset (due to websites selling phone cases photoshopping phones onto it).

clueless1245 t1_j9x3dlc wrote on February 25, 2023 at 4:34 AM

Reply to comment by mosquitoLad in [D] What is the correct term for a non-GAN system where two or more networks compete as part of training? by mosquitoLad

Its also an issue for generator training though if the discriminator gets 100% all the time, if I remember correctly. Theres various stuff you can look up to make training more stable which I don't have on hand rn.

clueless1245 t1_j9wzcbn wrote on February 25, 2023 at 3:57 AM

Reply to comment by mosquitoLad in [D] What is the correct term for a non-GAN system where two or more networks compete as part of training? by mosquitoLad

Idk what he means specifically by the "gradient being passed between" two agents but in a GAN (part of) the loss function of the generator is the inverse of (part of) the loss function of the discriminator, so the gradients calculated at generator output and discriminator output are linked.

A less formal way of saying it: The generator's gradient depends on the discriminator's loss.

This should be true for any adversarial game, I would think?

clueless1245 t1_j83y36r wrote on February 11, 2023 at 1:44 PM

Reply to M1 MAX vs M2 MAX by markupdev

Much better to rent lol.

clueless1245 t1_j7iz62v wrote on February 7, 2023 at 3:13 AM

Reply to comment by currentscurrents in [N] Getty Images sues AI art generator Stable Diffusion in the US for copyright infringement by Wiskkey

Got it, thanks! Yeah, I guess it's more complicated than I thought.

clueless1245 t1_j7irhmo wrote on February 7, 2023 at 2:14 AM

Reply to comment by supersoldierboy94 in [D] Yann Lecun seems to be very petty against ChatGPT by supersoldierboy94

Hope you get the help you need 🙏.

clueless1245 t1_j7ipckn wrote on February 7, 2023 at 1:58 AM

Reply to [D] Yann Lecun seems to be very petty against ChatGPT by supersoldierboy94

Lol at your previous posts. https://www.reddit.com/r/LateStageCapitalism/comments/zdeix8/ai_art_is_very_dystopian/j044ec2/

You are obviously disturbed and just latching on to arguing for thousands of words online as an outlet.

Cannot wait for next month when Ukraine or COVID-19 is back on the news cycle and you move on to /r/worldnews.

clueless1245 t1_j7il1dv wrote on February 7, 2023 at 1:23 AM

Reply to [P] I have implemented an RL agent for trading EUR/USD and I don't know what to do next... by Kiizmod0

Your model is learning to do is predict future market data from past market data, which fundamentally is not worthwhile because market data hinges on real-world news. If you want massive quantities of real-world news data in a structured/tagged format, look at GDELT.

https://www.gdeltproject.org/

Also, look at using Kaggle's GPU notebooks instead of Google's. You get 30 hours a week if you verify with your phone number, instead of Google's arbitrary secret heuristic based cutoff. Or look at something like runpod or vast.ai, rates for non secure GPUs are like a few cents an hour and datacenter GPUs not that expensive either.

P.S There are arbitrage opportunities you can spot using purely market data, but those are generally very short-term, don't warrant powerful models to detect and are pounced on by trading bots run by trading firms.

clueless1245 t1_j7ijokz wrote on February 7, 2023 at 1:13 AM

Reply to comment by [deleted] in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

Nope, they're exactly the same as far as advancing human knowledge goes.

clueless1245 t1_j7id4sr wrote on February 7, 2023 at 12:24 AM

Reply to comment by [deleted] in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

Lol what? That's the exact rationale "Open"AI used for not releasing the model weights for Dalle-2 (and instead selling it to Microsoft).