Programming

26701 readers

517 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Follow the programming.dev instance rules
Keep content related to programming in some way
If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities !webdev@programming.dev

founded 2 years ago

MODERATORS

snowe@programming.dev

Ategon@programming.dev

UlrikHD@programming.dev

bugsmith@programming.dev

Spyro@programming.dev

149

The West Forgot How to Build. Now It's Forgetting Code (techtrenches.dev)

submitted 16 hours ago by HaraldvonBlauzahn@feddit.org to c/programming@programming.dev

38 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] e8d79@discuss.tchncs.de 9 points 11 hours ago (2 children)

So how would I create such an "Open Source" model? They don't share the data used to create them do they? Let's not even get started on how much computing power I would need to train one of those things. These selfhosted models solve nothing except some data privacy issues. Sure you no longer send all your code to a shady AI company but you are still 100% dependent on them sharing their models.

[–] The_Decryptor@aussie.zone 6 points 10 hours ago* (last edited 10 hours ago) (1 children)

So how would I create such an “Open Source” model? They don’t share the data used to create them do they?

No, and going by the OSI definition of "open source AI" they don't have to, acknowledging that the training material is often copyrighted and can't be shared.

It's a strange definition of "open source", one where you're not actually allowed to see the source.

[–] Eyekaytee@aussie.zone 3 points 9 hours ago* (last edited 8 hours ago)

The model is named Apertus – Latin for “open” – highlighting its distinctive feature: the entire development process, including its architecture, model weights, training data and methods, is openly accessible and fully documented.

https://ethz.ch/en/news-and-events/eth-news/news/2025/09/press-release-apertus-a-fully-open-transparent-multilingual-language-model.html

There is also a move into synthetic data and human trained so we will have to see where the training data goes copyright wise in the future

[+] Eyekaytee@aussie.zone -6 points 10 hours ago (1 children)

Do you build your own Linux from scratch? If so why would you assume you can build an LLM from scratch?

[–] qqq@lemmy.world 6 points 9 hours ago (1 children)

It's mad easy to build your own Linux from scratch in comparison to building an LLM. You can have your own distro running in like an hour. With buildroot you can have it in even less than that.

[+] Eyekaytee@aussie.zone -6 points 8 hours ago (1 children)

I have no idea what you're talking about

[–] qqq@lemmy.world 4 points 8 hours ago (1 children)

... Then why did you use it as an example?

[–] Eyekaytee@aussie.zone -2 points 8 hours ago (1 children)

Because the average person is not building Linux from scratch nor would they know how to

[–] qqq@lemmy.world 1 points 8 hours ago (1 children)

The average person wouldn't be building an open source LLM either. I don't think I follow. I was just saying that your comparison wasn't going to hit correctly at all due to how easy it actually is to build Linux and a full Linux distribution.

[–] Eyekaytee@aussie.zone 0 points 8 hours ago* (last edited 8 hours ago)

The average person wouldn’t be building an open source LLM either

Yeah that's why I'm saying:

Do you build your own Linux from scratch? If so why would you assume you can build an LLM from scratch?

The OP is basically saying it's not really open source unless I can personally build it! Which I am saying I don't think is a requirement of open source software (your personal ability to compile software does not negate from it it's open sourceness)

tbh I wouldn't have an idea on how to build either, they are way above my skill level, i have no idea how to make a linux distro either, but i'm certain most are open source

Today, we’re launching Unsloth Studio (Beta): an open-source, no-code web UI for training, running and exporting open models in one unified local interface.

https://unsloth.ai/docs/new/studio

This was only recently released, maybe in the future we'll have training material uber compressed down in an open source format that anyone with the skill and knowledge can use and different 'distro' releases of LLM's, we already have tons of smaller models especially from European Universities and others

The EuroHPC Joint Undertaking (JU) provides access to the computing time and support services offered by the EuroHPC AI Factories. The AI Factories are open to European users from various sectors, including industry, research, academia and public authorities.

https://digital-strategy.ec.europa.eu/en/policies/ai-factories

We are only like 3-4 years into AI going mainstream if that, afaik the heat death of the universe is at least 1000 years away, we have lots of time to work and improve on them, I can only wonder where they will be at in 100 years, so I try not to make any damning facebook boomer tier statements about the future