takezo07

joined 10 months ago
[โ€“] takezo07@alien.top 1 points 9 months ago

I'm surprised there is not more options....
As there is LLMs for almost everything!

[โ€“] takezo07@alien.top 1 points 10 months ago (1 children)

I found Blip: https://replicate.com/salesforce/blip?input=form&output=preview
But that's not exactly what i'm looking for. It does image captioning very well.
Like in the their example: "a woman sitting on the beach with a dog".
But i need a list of objects and "things" like : dog, woman, beach, wave, shirt...etc.

 

Hello,

I'm looking for an alternative to Google Vision AI (LABEL_DETECTION, OBJECT_LOCALIZATION) and Amazon Rekognition (DetectLabels).
Any ideas?

Thanks!