this post was submitted on 08 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 11 months ago
MODERATORS
 

I keep diving and finding GPT-4V prototypes shared on X: e.g. narration for videos (source), posture correction (source), etc.

As foundation models in computer vision become even more accessible, will the field recover some attention (wrt to LLMs hype)?

you are viewing a single comment's thread
view the rest of the comments
[–] glitch83@alien.top 1 points 10 months ago

Maybe? Vision has been around a lot longer than NLP in industry. It’s permeated into some challenging areas like embedded and edge spaces due to privacy and requirements. If the foundation models can’t run on the edge then I can imagine foundation models only affecting a small portion of vision applications.