Machine Learning

1 readers

1 users here now

Community Rules:

Be nice. No offensive behavior, insults or attacks: we encourage a diverse community in which members feel safe and have a voice.
Make your post clear and comprehensive: posts that lack insight or effort will be removed. (ex: questions which are easily googled)
Beginner or career related questions go elsewhere. This community is focused in discussion of research and new projects that advance the state-of-the-art.
Limit self-promotion. Comments and posts should be first and foremost about topics of interest to ML observers and practitioners. Limited self-promotion is tolerated, but the sub is not here as merely a source for free advertisement. Such posts will be removed at the discretion of the mods.

founded 2 years ago

MODERATORS

communick@academy.garden

[D] What AI topics are you curious about but rarely see in the spotlight? (alien.top)

submitted 2 years ago by GratefullyFriendly73@alien.top to c/machinelearning@academy.garden

53 comments fedilink hide all child comments

I'm a data engineer who somehow ended up as a software developer. So many of my friends are working now with the OpenAI api to add generative capabilities to their product, but they lack A LOT of context when it comes to how LLMs actually works.

This is why I started writing popular-science style articles that unpack AI concepts for software developers working on real-world application. It started kind of slow, honestly I wrote a bit too "brainy" for them, but now I've found a voice that resonance with this audience much better and I want to ramp up my writing cadence.

I would love to hear your thoughts about what concepts I should write about next?
What get you excited and you find hard to explain to someone with a different background?

you are viewing a single comment's thread
view the rest of the comments

[–] rejectedlesbian@alien.top 1 points 2 years ago (4 children)

optimizer OMG no one touched optimizes for decades.
we basically figure its ADAM/SGD and there wasnt really any improvement on it.

I tried finding an improvement to it myself for a few months but failed miserably

[–] charlesGodman@alien.top 1 points 2 years ago

There has been LOADS of research on deep learning optimisation in recent years. However, TLDR nothing beats ADAM.

[–] currentscurrents@alien.top 1 points 2 years ago

Learned optimizers look promising - training a neural network to train neural networks.

Unfortunately they're hard to train and nobody has gotten them to really work yet. The two main approaches are meta-training or reinforcement learning, but meta-training is very expensive and RL has all the usual pitfalls of RL.

[–] koolaidman123@alien.top 1 points 2 years ago

Nothing beats adamw + compute. Plus with the current data centric approach everything kinda converges at scale

[–] satireplusplus@alien.top 1 points 2 years ago

Because its super hard to build something that works better than ADAM across many tasks. There's probably no shortage of people trying to come up with something better.