wyrdwulf t1_jdikhuo wrote
Reply to comment by BullockHouse in [D] I just realised: GPT-4 with image input can interpret any computer screen, any userinterface and any combination of them. by Balance-
They had another model do that already.
BullockHouse t1_jdil2ok wrote
I'm familiar! I'm curious though if it can generalize well enough to play semi-competently without specialized training. Has implications for multi-modal models and robotics.
Viewing a single comment thread. View all comments