Submitted by wowimsupergay t3_127lmbl in singularity
wowimsupergay OP t1_jefz9vi wrote
Reply to comment by Ortus14 in What if language IS the only model needed for intelligence? by wowimsupergay
what I'm talking about is literally giving GPT eyes. ,right now it is multimodal because we can pass back RGB values and waveforms, in bytes (so text) .fundamentally though, GPT is not hearing or seeing anything. but I totally get what you're saying, and I do think multimodal intelligence .is the way to go.
also thank you for letting me know that multimodal intelligences use less computation per task, I did not know that. or rather, make better use of computation
Viewing a single comment thread. View all comments