fundamental_entropy t1_jasqy64 wrote on March 3, 2023 at 8:12 PM

Reply to comment by average-joee in What do you recommend for a text summarization task? by average-joee

Flan models are trained in almost every open dataset available in Generic English tasks. Recent research suggests models trained to perform multiple tasks (in fact ratios of different tasks too affect see flan 2022 paper) are better than models trained only on a given task. Flan T5 beats T5 in almost every task and sometimes Flan T5 XXL matches gpt3 type of prompt generation.

average-joee OP t1_jasr95t wrote on March 3, 2023 at 8:14 PM

Many thanks for your input!