Viewing a single comment thread. View all comments

Akimbo333 t1_j8p643a wrote

The huge performance boost of a mere 738M model, appears to be due to it being a multimodal model. Which can use not only text but image and other means as well.

1