btcmx

joined 10 months ago
[–] btcmx@alien.top 1 points 10 months ago

This post has a number of ML topics/skills you might (sooner or later) run into. In a sense it's kind of a roadmap/blueprint containing what you need to become an ML Engineer, (beyond learning a few topics/basics from a one(or more) courses.

 

I keep diving and finding GPT-4V prototypes shared on X: e.g. narration for videos (source), posture correction (source), etc.

As foundation models in computer vision become even more accessible, will the field recover some attention (wrt to LLMs hype)?

[–] btcmx@alien.top 1 points 10 months ago

As other have rightly pointed out, verify you're using the Data Loader the right way. Ideally you need to create a custom dataset (in PyTorch terms) and apply all the transformations in this custom dataset. This might be helpful. Also, have you tried PyTorch Lightning?

[–] btcmx@alien.top 1 points 10 months ago

For CV, Jetson family! The following source has a high level overview of challenges and solutions when deploying NVIDIA TAO on Jetson devices.

 

While diffusion models (e.g. Stable Diffusion) are all the rage, they don't seem to be prepared for downstream tasks. ControlNet looks great (on paper), but open-source implementations for mere mortals aren't ready for prime time.

Do you have examples that show the contrary? Will FAANGs and not-really-open Research Labs be the only ones capable of making it happen?