In this video, we take a look at 22+ examples of the most incredible use cases for ChatGPT Vision. Everything from ChatGPT doing homework for you to architecture to image-to-code. This is the most impressive launch since code interpreter.
Enjoy 🙂
Become a Patron ? – https://patreon.com/MatthewBerman
Join the Discord ? – https://discord.gg/xxysSXBxFW
Follow me on Twitter ? – https://twitter.com/matthewberman
Follow me on TikTok ? – https://www.tiktok.com/@matthewberman60
Subscribe to my Substack ?? – https://matthewberman.substack.com
Media/Sponsorship Inquiries ? – https://bit.ly/44TC45V
0:00 – Intro
0:39 – Reasoning & Human Nature
2:18 – Human Cell Diagram
2:57 – Food & Recipes
3:38 – Circuit Diagram
4:12 – Mushrooms and Effects
4:54 – Interior Design
5:30 – Human Brain Complex Diagram
6:22 – Complex Parking Signs (Logic)
6:57 – Math Homework
7:40 – Architecture
8:37 – Research Paper (Education)
9:15 – ChatGPT Voice Dialogue
10:12 – Architecture & Building
11:10 – Crossword Puzzle (Reasoning)
11:32 – Image to Code
12:03 – Design to Code
12:43 – Image Recognition
14:03 – Image to Code
14:49 – Movie/Character Recognition
15:24 – Poker & Strategy
16:12 – Image Recognition
16:58 – Image to Code
17:26 – Chart Analysis
17:53 – Final Thoughts
Links:
This is absolutely wild. I am completely speechless. pic.twitter.com/wGTAx1hFgS
— Pietro Schirano (@skirano) September 27, 2023
ChatGPT breaks down this diagram of a human cell for a 9th grader.
This is the future of education. pic.twitter.com/L0Za0ZB5rs
— Mckay Wrigley (@mckaywrigley) September 28, 2023
ChatGPT-V Multimodal discerns an entire recipe from just a picture.
We are in a new world. pic.twitter.com/rpiQHMBTHy
— Brian Roemmele (@BrianRoemmele) September 28, 2023
ChatGPT takes 10 seconds to comfortably break down something that would take you 10 minutes to find and understand from an electronics textbook.
This is the future of learning. pic.twitter.com/JkcPWqwped
— Aadit Sheth (@aaditsh) September 29, 2023
GPT-4 vision truly "hallucinating" on mushrooms. 🍄 pic.twitter.com/zKLeJepYmC
— Pietro Schirano (@skirano) September 29, 2023
GPT-4 vision for interior design. đźŹ
I love how it's incorporating what it knows about me in the suggestion because of custom instructions.
Really incredible technology. pic.twitter.com/aAFI5ZgPLW
— Pietro Schirano (@skirano) September 28, 2023
https://t.co/NiThCQwHUA pic.twitter.com/3g42lHPP3T
— Teknium (e/λ) (@Teknium1) September 28, 2023
I will never get a parking ticket again. pic.twitter.com/yl7ND2rJeQ
— Peter Yang (@petergyang) September 27, 2023
OMG https://t.co/kDuustCuM4 pic.twitter.com/Ch2i7oWaPS
— Pietro Schirano (@skirano) September 28, 2023
Using GPT-4 Vision to name never-before-seen architectural styles created with Midjourney.
It excels at identifying diverse elements and assigning names to these distinctive creations. 🏛️✨ pic.twitter.com/lLb4p8Etkf
— Pietro Schirano (@skirano) September 27, 2023
Wow it can read text on an image of a paper too small for me to see :O pic.twitter.com/0qULXtEwJg
— Teknium (e/λ) (@Teknium1) September 27, 2023
ChatGPT having a conversation with an other instance of itself…
This will be a new form of entertainment.
Live streaming AI agents talking to each other.
Via @gopatrik
— Linus Ekenstam (@LinusEkenstam) September 27, 2023
ChatGPT is insane
I gave it the plans of a house and asked for instructions on how to build it
Here's my beautiful house a few home depot trips later and 2 days of work
The future is here pic.twitter.com/vmzkThvfxL
— gaut (@0xgaut) September 27, 2023
I didnt take the time to validate its answers but it's going https://t.co/lOqXmGrbOE pic.twitter.com/p2qMgnlBd4
— Teknium (e/λ) (@Teknium1) September 27, 2023
This guy gave ChatGPT a screenshot of a UI design.
ChatGPT returned with the code to recreate the UI.
This is the future.pic.twitter.com/KUAmJEaBpV
— Aadit Sheth (@aaditsh) September 27, 2023
Update: GPT-4 Vision can absolutely convert figma designs into working React components.
On the left, the design. On the right: the output.
I specifically asked it to write the component in React using MUI components, and gave it little other direction.
It even correctly… pic.twitter.com/dgtBY3gpZy
— Gabriel Garrett (@GabGarrett) September 27, 2023
Ok… I am impressed.
I was testing how much GPT can actually "see" using one of these viral ControlNet/logo images. It took some nudging but it got it.
"Thank you for pointing it out."
Not sure how I feel about this lol pic.twitter.com/0ex378JiCP
— Pietro Schirano (@skirano) September 27, 2023
it’s over for engineers
— gaut (@0xgaut) September 27, 2023
You can make GPT-4 visions answer questions it shouldn't by writing some specific custom instructions.
I won't share how I did it, but it's possible. pic.twitter.com/Dg0wjOzYLx
— Pietro Schirano (@skirano) September 27, 2023
OK, just got GPT-4 with vision, and it is both awesome and limited in the way Bing has been (no surprise, they are the same system), but it may be a bit more capable.
It does amazingly at many things, but also can mess up tiny details which leads to hallucinations.
Some tests: pic.twitter.com/uHy3r4ELkx
— Ethan Mollick (@emollick) September 27, 2023
Gave GPT-4V an image of my day planner, it made the code, it worked pic.twitter.com/v70r6114UZ
— Teknium (e/λ) (@Teknium1) September 27, 2023
https://twitter.com/michael_gaio/status/1706736537810223613