Submitted by michaelthwan_ai t3_121domd in MachineLearning
tonicinhibition t1_jdn4v86 wrote
Reply to comment by Veggies-are-okay in [N] March 2023 - Recent Instruction/Chat-Based Models and their parents by michaelthwan_ai
There's a YouTuber named Letitia, with a little Miss Coffee Bean character, who covers new models at a decent level.
CodeEmporium does a great job at introducing aspects of the GPT/ChatGPT architecture with increasing depth. Some of the videos have code.
Andrej Karpathy walks you through building GPT in code
As for the lesser known models, I just read the abstracts and skim the papers. It's a lot of the same stuff with slight variations.
michaelthwan_ai OP t1_jdpy5dy wrote
Thanks for the sharing above!
My choice is yk - Yannic Kilcher. Some "AI News" videos is a brief introduction and he sometimes go through certain papers in details. Very insightful!
Viewing a single comment thread. View all comments