masterofn1 t1_jdu8jug wrote on March 27, 2023 at 6:08 AM Reply to [D] Simple Questions Thread by AutoModerator How does a Transformer architecture handle inputs of different lengths? Is the sequence length limit inherent to the model architecture or more because of resource issues like memory? Permalink 2
masterofn1 t1_jdu8jug wrote
Reply to [D] Simple Questions Thread by AutoModerator
How does a Transformer architecture handle inputs of different lengths? Is the sequence length limit inherent to the model architecture or more because of resource issues like memory?