koehr

joined 1 year ago

[–] koehr@alien.top 1 points 11 months ago (1 children)

I don't think so (unfortunately). The model size doesn't change, only the way it is traversed.

submitted 11 months ago by koehr@alien.top to c/localllama@poweruser.forum

15 comments fedilink

"UltraFastBERT", apparently a variant of BERT, that uses only 0.3% of it's neurons during inference, is performing on par with similar BERT models.

I hope that's going to be available for all kinds of models in the near future!

[–] koehr@alien.top 1 points 1 year ago

Assuming it will be open.