KD_A OP t1_jeggt1k wrote on March 31, 2023 at 8:58 PM

Reply to comment by PassingTumbleweed in [P] CAPPr: use OpenAI or HuggingFace models to easily do zero-shot text classification by KD_A

Yes, exactly. There's nothing else to it haha

I only wish the API had an interface to let you cache the prompt's keys and values. That'd save you money, and make CAPPr strictly cheaper than sampling for classification tasks.

PassingTumbleweed t1_jegonam wrote on March 31, 2023 at 9:52 PM

Cool! I wonder if you've thought about synonyms. It seems like there might be a lot of cases where classes with more synonyms (or even cases like plurality , eg bird vs birds) are at a disadvantage.

KD_A OP t1_jegsqe6 wrote on March 31, 2023 at 10:22 PM

That's a good criticism. I'd guess that this issue is quite problem-dependent. And I'd hope that an LM is good enough to discriminate between the correct-but-many-synonyms class and the wrong-but-few-synonyms class. (We're using the word synonym, but we really mean "high probability token path given prompt".) It's hard for me to come up with examples where this problem arises in a real classification task. But they may be out there.

PassingTumbleweed t1_jegvhb5 wrote on March 31, 2023 at 10:42 PM

What I was thinking is that some kind of hierarchical LLM taxonomy might be interesting, where you can re-jigger the conditional probability tree onto any arbitrary vocab of token sequences.

KD_A OP t1_jegxas8 wrote on March 31, 2023 at 10:56 PM

Interesting, and I think I know what you mean. One naive idea is a "top-k tokens" system. This system considers the top k highest probability tokens (conditional on previous ones) for each completion token, and for each completion. And then take the sum of the average likelihoods across all k^n (n = # completion tokens) paths for each completion. That would be one way to address this synonym problem. But ofc it results in way more computation.

Edit: actually, thinking a bit more, I think the synonym problem is more-or-less a non-issue for LMs trained to do next-token prediction.

PassingTumbleweed t1_jeh0p1j wrote on March 31, 2023 at 11:22 PM

I'm curious to get your thoughts about a simple example where you have three classes: cat, dog, and bird. What happens if the top-1 prediction is "eagle"? Does that probability mass get discarded? Because it should probably go into the bird category

KD_A OP t1_jeh0ygl wrote on March 31, 2023 at 11:24 PM

Yup it gets totally discarded. Hopefully, the conditional probability of bird is higher than cat or dog.

PassingTumbleweed t1_jeh1248 wrote on March 31, 2023 at 11:25 PM

One thing I've seen with these LLMs is that you can prompt them with the classes using sort of a multiple choice style. It would be interesting to experiment with whether this can stabilize the outputs and reduce the amount of out of vocabulary predictions you get