this post was submitted on 16 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 1 year ago
MODERATORS
 

I'm looking for suggestions for a transformer model that I can fine-tune for a text classification task. Due to hardware constraints the model has to be fairly small. Something in the order of a 50 MB weight file.

you are viewing a single comment's thread
view the rest of the comments
[–] NoIdeaAbaout@alien.top 1 points 1 year ago

That's very small for a trasformer, as a rule of thumb, this is meaning 25M parameters. Not sure there are similar ones

you can try this:

https://arxiv.org/pdf/2006.03236.pdf