Post
  • From Twitter

This paper is a nice summary of what GPT-4’s vision capability can and can’t do It does an impressive job on overall “reasoning” about images, but also gets details wrong and is open to adversarial attacks as it focuses on lot on the text in the image

Replies
No replies yet