PassingTumbleweed t1_j41pibv wrote
Yes. This thread made me think of Universal Transformers which has dynamic halting and has been around for a while now: https://openreview.net/forum?id=HyzdRiR9Y7
Raphaelll_ t1_j45u38j wrote
Did this ever get any traction?
PassingTumbleweed t1_j46sco1 wrote
That depends on what you mean. I don't think any of the LLMs use it, but it has some citations and follow-up literature.
Viewing a single comment thread. View all comments