this post was submitted on 20 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 11 months ago
MODERATORS
 

I am a beginner to ML and cloud. I am planning on creating web APIs for some pre trained models i have gotten from the hugging face. My code need to take in text from other endpoints and return some classification data. I wanna deploy to google cloud, but unable to zero down on the best way. My options:-

  1. Dockerize. But then how do I scale it to use the gpus in the future.
  2. Colab Notebook. I was able to get the code working, but then how do I create an web endpoint? Missing knowledge here.
  3. Shud I just wrap my code in a flask endpoint and upload into an instance? Need some advice…
you are viewing a single comment's thread
view the rest of the comments
[–] grekt99@alien.top 1 points 10 months ago

Hey thank you so much for the info, that first link seems to be having wat I need. Will check it out.