SAbdusSamad
SAbdusSamad OP t1_j759v4v wrote
Reply to comment by Erosis in [D] Understanding Vision Transformer (ViT) - What are the prerequisites? by SAbdusSamad
I agree that having a background in RNNs and attention with RNNs can make the learning process for transformers, and by extension ViT, much easier.
SAbdusSamad OP t1_j75922f wrote
Reply to comment by atharvat80 in [D] Understanding Vision Transformer (ViT) - What are the prerequisites? by SAbdusSamad
These courses seem to have excellent content. I will definitely consider these as great resources.
SAbdusSamad OP t1_j758r29 wrote
Reply to comment by the_architect_ai in [D] Understanding Vision Transformer (ViT) - What are the prerequisites? by SAbdusSamad
Great advice. This seems to be a good starting point.
SAbdusSamad OP t1_j757w05 wrote
Reply to comment by SimonJDPrince in [D] Understanding Vision Transformer (ViT) - What are the prerequisites? by SAbdusSamad
I recently obtained a PDF of the book and began searching for information on ViT. Unfortunately, it appears that the book does not cover this topic. However, I plan to utilize the Transformer chapter to gain an understanding of ViT.
SAbdusSamad OP t1_j71z0zp wrote
Reply to comment by JustOneAvailableName in [D] Understanding Vision Transformer (ViT) - What are the prerequisites? by SAbdusSamad
Well, I do have idea about CNNs. I have limited knowledge of RNNs. But I don't have knowledge of Attention is All You Need.
Submitted by SAbdusSamad t3_10siibd in MachineLearning
SAbdusSamad OP t1_j79ub7q wrote
Reply to comment by SimonJDPrince in [D] Understanding Vision Transformer (ViT) - What are the prerequisites? by SAbdusSamad
I apologize for that oversight. Yes, the book does cover Transformers for images.