this post was submitted on 28 May 2026
14 points (67.5% liked)
Programming
27088 readers
413 users here now
Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!
Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.
Hope you enjoy the instance!
Rules
Rules
- Follow the programming.dev instance rules
- Keep content related to programming in some way
- If you're posting long videos try to add in some form of tldr for those who don't want to watch videos
Wormhole
Follow the wormhole through a path of communities !webdev@programming.dev
founded 3 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Qwen 3.6 and gemma4 models are the only ones usable for agentic prog sessions that I and my employer run locally. It's less stable and slower than third-party services, even on much better hardware (as it's with my employer). The best way is to go with a provider hosting deepseek flash/pro if your privacy policy allows though. It's going to be hard to beat their price.
I thought those didn't support tool calling. Has that changed?
they do
How many concurrent users and what hardware if i may ask?
it's an h100, I think, no idea about how many users
in my personal setup i use quantized versions on a 3080, which is not great, so I still lean a lot on APIs