Post by Ethan Mollick: This paper is a nice summary of what GPT-4’s vision capability can and can’t do It does an impressiv...

Post

Ethan Mollick @emollick · Nov 17, 2023

From Twitter

This paper is a nice summary of what GPT-4’s vision capability can and can’t do It does an impressive job on overall “reasoning” about images, but also gets details wrong and is open to adversarial attacks as it focuses on lot on the text in the image

Paper Jun 2, 2023

Grounded Intuition of GPT-Vision’s Abilities with Scientific Images

by Alyssa Hwang and 2 others

Post

Replies