modcowboy t1_jdkz6of wrote on March 25, 2023 at 3:49 AM Reply to comment by MjrK in [D] I just realised: GPT-4 with image input can interpret any computer screen, any userinterface and any combination of them. by Balance- Probably would be easier for the LLM to interact with the website directly through the inspect tool vs machine vision training. Permalink Parent 3
modcowboy t1_jdkz6of wrote
Reply to comment by MjrK in [D] I just realised: GPT-4 with image input can interpret any computer screen, any userinterface and any combination of them. by Balance-
Probably would be easier for the LLM to interact with the website directly through the inspect tool vs machine vision training.