overview for pm_me_your_pay

[R] Rethinking Open'sAI's Q-Learning : Insights from the Award-Winning 'Non-delusional Q-learning' Paper in c/machinelearning@academy.garden

[–] pm_me_your_pay_slips@alien.top 1 points 11 months ago (1 children)

it's very likely something like this: https://arxiv.org/pdf/2305.18290.pdf

Or finetuning on high quality datasets

[D] interview question: deploying LLM in c/machinelearning@academy.garden

[–] pm_me_your_pay_slips@alien.top 1 points 11 months ago

If you're asking that question here, you ma not be qualified for the job.

[R] Beyond U: Making Diffusion Models Faster & Lighter in c/machinelearning@academy.garden

[–] pm_me_your_pay_slips@alien.top 1 points 1 year ago (1 children)

Care to write a clear explanation of the method here?

[D] What is the future for ML researchers and startups? in c/machinelearning@academy.garden

[–] pm_me_your_pay_slips@alien.top 0 points 1 year ago (1 children)

Join a startup to work on these things. You'll very quickly realize why people are still pursuing PhDs in the field.

[R] Beyond U: Making Diffusion Models Faster & Lighter in c/machinelearning@academy.garden

[–] pm_me_your_pay_slips@alien.top 0 points 1 year ago (3 children)

This is gibberish. Was this a paper written by ChatGPT?

[D] Arbitrary Channel count in network needs to be reduced to 1 channel in c/machinelearning@academy.garden

[–] pm_me_your_pay_slips@alien.top 1 points 1 year ago

Use a transformer layer for aggregation if you want a learnable way of pooling them. Positional encoding and masking should help you with ensuring that order influences the prediction.

[D] Why choose an H100 over an A100 for LLM inference? in c/machinelearning@academy.garden

[–] pm_me_your_pay_slips@alien.top 1 points 1 year ago

A100s and H100s are great for training, but a bit of a waste for inference.