Jump to main content Jump to sidebar

Forums
Wiki

Log in
Sign up

/f/deeplearning

Do we really need 100B+ parameters in a large language model?

Submitted by Vegetable-Skill-9700 t3_121agx4 on March 25, 2023 at 4:24 AM in deeplearning

54 comments

43

Viewing a single comment thread. View all comments

fysmoe1121 t1_jdm2mtw wrote on March 25, 2023 at 12:16 PM

deep double descent => bigger is better

Permalink

2

0 points (+0, −0)

Short URL:

http://forum.junglegym.ai/125025

deeplearning

t5_2t5eh

Created October 1, 2022
Subscribe via RSS

Toolbox

Bans
Moderation log

Running Postmill