this post was submitted on 08 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 11 months ago
MODERATORS
 

I keep diving and finding GPT-4V prototypes shared on X: e.g. narration for videos (source), posture correction (source), etc.

As foundation models in computer vision become even more accessible, will the field recover some attention (wrt to LLMs hype)?

top 1 comments
sorted by: hot top controversial new old
[–] glitch83@alien.top 1 points 10 months ago

Maybe? Vision has been around a lot longer than NLP in industry. It’s permeated into some challenging areas like embedded and edge spaces due to privacy and requirements. If the foundation models can’t run on the edge then I can imagine foundation models only affecting a small portion of vision applications.