starstruckmon t1_jcswg1g wrote on March 19, 2023 at 8:54 AM

Reply to comment by Taenk in [Research] Alpaca 7B language model running on my Pixel 7 by simpleuserhere

I've heard from some experienced testers that the 33B model is shockingly bad compared to even the 13B one. Despite what the benchmarks say. That we should either use the 65B one ( very good apparently ) or stick to 13B/7B. Not because of any technical reason but random luck/chance involved with training these models and the resultant quality.

I wonder if there's any truth to it. If you've tested it yourself, I'd love to hear what you thought.

Taenk t1_jctdmvi wrote on March 19, 2023 at 12:38 PM

I haven’t tried the larger models unfortunately. However I wonder how the model could be „shockingly bad“ despite having almost three times the parameter count.

starstruckmon t1_jcte34d wrote on March 19, 2023 at 12:42 PM

🤷

Sometimes models just come out crap. Like BLOOM which has almost the same number of parameters as GPT3, but is absolute garbage in any practical use case. Like a kid from two smart parents that turns out dumb. Just blind chance.

Or they could be wrong. 🤷