Single_Blueberry t1_jcjsxa1 wrote on March 17, 2023 at 10:39 AM

>the fact that GPT 4 may be two magnitude orders bigger than GPT 3

I'm not aware of any reliable sources that claim that.

Intuitively I don't see why it would stop hallucinating. I imagine the corpus - as big as it may be - doesn't contain a lot of examples for the concept of "not knowing the answer".

That's something people use a lot in private conversation, but not in written language on the public internet or books. Which afaik is where most of the data comes from.

Available_Lion_652 t1_jcjtc6h wrote on March 17, 2023 at 10:44 AM

I don t understand why people down voted. I saw a claim that GPT 4 was trained on 25k Nvidia A100 for several months. It has used x100 more compute power than GPT3, based on that post. 20 B Llama model was trained on 1.4 trillions tokens. So yeah, I think that my post is based on these claims

Single_Blueberry t1_jcjvh6o wrote on March 17, 2023 at 11:09 AM

Again, can't find a reliable source for that.

I personally doubt that GPT-4 is significantly larger than GPT 3.x, simply because that would also further inflate inference cost, which you generally want to avoid in a product (as opposed to a research feat).

Better architecture, better RLHF, more and better train data, more train compute? Seems all reasonable.

Orders of magnitudes larger again? Don't think so.

[D] GPT-4 is really dumb

Available_Lion_652 t1_jcjrfnx wrote on March 17, 2023 at 10:20 AM

NotARedditUser3 t1_jcjsqta wrote on March 17, 2023 at 10:37 AM

yumiko14 t1_jcju8tw wrote on March 17, 2023 at 10:55 AM

NotARedditUser3 t1_jckof25 wrote on March 17, 2023 at 3:09 PM

[deleted] OP t1_jcknwkx wrote on March 17, 2023 at 3:06 PM

[deleted] OP t1_jcko3jr wrote on March 17, 2023 at 3:07 PM

Available_Lion_652 t1_jcjuxp5 wrote on March 17, 2023 at 11:03 AM

NotARedditUser3 t1_jckne7y wrote on March 17, 2023 at 3:02 PM

Available_Lion_652 t1_jckrfwd wrote on March 17, 2023 at 3:29 PM

Available_Lion_652 t1_jcjt3yi wrote on March 17, 2023 at 10:41 AM

kaoD t1_jcjvsmo wrote on March 17, 2023 at 11:13 AM

Available_Lion_652 t1_jcjw5cf wrote on March 17, 2023 at 11:17 AM

Single_Blueberry t1_jcjsxa1 wrote on March 17, 2023 at 10:39 AM

Available_Lion_652 t1_jcjtc6h wrote on March 17, 2023 at 10:44 AM

Single_Blueberry t1_jcjvh6o wrote on March 17, 2023 at 11:09 AM