Mckay Wrigley @mckaywrigley, Twitter Profile

Mckay Wrigley @mckaywrigley

a year ago

I gave GPT-4 eyes. Here’s what I did: - added some data to a vision model - gave the AI camera access - asked it questions about the scene - it identified objects - it searched web for info - used that info to accurately answer Watch it get 3 questions 100% correct!

224 793 5K 1.1M 2K

Download Video

Mckay Wrigley @mckaywrigley

a year ago

And just for clarification the vision stuff isn’t GPT-4’s work. It can’t access your camera. I hooked up a *separate* vision model to it that handles the camera stuff. Paired the 2 for the idea.

16 6 258 45K 17

$fractal_mo Profile Picture$

Mo Balaa @fractal_mo

a year ago

@mckaywrigley This is hyperbole marketing. Why not stay focused on your startup.

5 0 6 9K 2

iamrobotbear 👺 @iamrobotbear

a year ago

@mckaywrigley Which vision models are you using?

1 0 4 10K 1

Fellps @felps_bra

a year ago

@mckaywrigley Can you archive all these data with timestamps and then every 15 seconds have GPT look into it and summarize what happened in the scene within these 15 seconds? Then, the summary can go into it's "memory". Repeat. Then you ask after 1 minute: "what happened in the last minute?"

1 0 5 4K 1

Amjad Masad @amasad

a year ago

@mckaywrigley Be curious what the prompt looked like — is it all the detected objects?

1 0 17 6K 0

John Ingham @jeingham

a year ago

@mckaywrigley Stunning! What an adventure this AI is. Have you shared any of your cobbling methods, IDE, Git repo, or languages? I'm very curious indeed. I'm using VSCode, GitHub, and learning the ways of Python. Thinking I'd like to start tinkering with it all myself. Leg up?

1 0 9 14K 1