this post was submitted on 03 Jun 2026
868 points (99.9% liked)
People Twitter
10030 readers
1585 users here now
People tweeting stuff. We allow tweets from anyone.
RULES:
- Mark NSFW content.
- No doxxing people.
- Must be a pic of the tweet or similar. No direct links to the tweet.
- No bullying or international politcs
- Be excellent to each other.
- Provide an archived link to the tweet (or similar) being shown if it's a major figure or a politician. Archive.is the best way.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
It might not be as impossible as it sounds. Some of the "open" models are rumored to be able to code. The real problem is that you likely need something with 128 GiB VRAM to run them with a reasonably large context window.
An Nvidia B200 (192 Gigs of RAM) sells somewhere between 30-50k a pop. That's feasible for a company.
And then you can serve one inference at a time. Hopefully your devs are well distributed over timezones :-)
Wonderfull idea, may be they can connect to the same PC, and we can call it main frame or something. xD
I don't see why it wouldn't be feasible to rent someone else's computer to use for something like this, seeing how it could amortize costs over time.
Qwen's 27B model from April outperforms its 397B model from February.
Local and small were always going to win.
Qwen 3.6 ? It is unstable though. It go awry more often than the 3.5 of the same size.