maskedpaki t1_j8by3r7 wrote on February 13, 2023 at 4:04 AM

This is like 2 weeks old. If it really does surpass gpt3 with under a billion parameters then why isn't this on headlines.

d00m_sayer t1_j8bzbnm wrote on February 13, 2023 at 4:15 AM

Because people cannot use it yet 🙄

maskedpaki t1_j8c0eso wrote on February 13, 2023 at 4:24 AM

I've seen so many things like this that actually end up surpassing gpt3 on some narrow benchmark with more optimised prompting rather than just being a better model overall

I hope I'm wrong this time

94746382926 t1_j8c1t4r wrote on February 13, 2023 at 4:36 AM

Yeah we need more benchmarks.

beezlebub33 t1_j8d62pw wrote on February 13, 2023 at 12:59 PM

Benchmarks are really hard and expensive. And they are not fun or exciting for the people involved; the groups that make them really deserve more credit.

gay_manta_ray t1_j8c1uv8 wrote on February 13, 2023 at 4:37 AM

this benchmarks seems pretty comprehensive

maskedpaki t1_j8g2v6v wrote on February 14, 2023 at 1:14 AM

no actually seems like a pretty narrow science benchmark

if you told me the MMLU 0 shot was higher than 175 billion gpt 3.5 with under a billion parameters then id be absolutely shocked

Ok_Criticism_1414 OP t1_j8bywbq wrote on February 13, 2023 at 4:11 AM

because of ChatGPT hype ? Who knows. I think open AI already did it, the just dont showing to the puclic. Main thing i guess that Amazon made a different aproach integrating two modalities by prefintuning to be multimodal. You can read in the paper. + Looks like language + visual context gives a huge boost. But it already being done by Flamingo model so i gues the first is crucial.

turnip_burrito t1_j8bz5rv wrote on February 13, 2023 at 4:13 AM

10 days old, going by version 1 on arxiv.