[R] [N] In this paper, we show how a conversational model, 3.5x smaller than SOTA, can be optimized to outperform the baselines through Auxiliary Learning. Published in the ACL Anthology: "Efficient Task-Oriented Dialogue Systems with Response Selection as an Auxiliary Task."
Submitted by radi-cho t3_116kqvm in MachineLearning
Viewing a single comment thread. View all comments