RegisteredJustToSay

joined 2 years ago

[R] Fine-tuning transformer-based models in c/machinelearning@academy.garden

[–] RegisteredJustToSay@alien.top 1 points 2 years ago

There's some good recent papers on how to tackle this. My favourite paper on the topic is probably here: Robust fine-tuning of zero-shot models - arXiv https://arxiv.org/pdf/2109.01903

But tl;dr: a weighted average of fine-tuned weights and original weights through a manually chosen weight tend to greatly mitigate this problem. I was surprised the paper didn't get more attention when it came out, but oh well.

permalink
fedilink
source
context