error 404 page not found
Machine Learning
Community Rules:
- Be nice. No offensive behavior, insults or attacks: we encourage a diverse community in which members feel safe and have a voice.
- Make your post clear and comprehensive: posts that lack insight or effort will be removed. (ex: questions which are easily googled)
- Beginner or career related questions go elsewhere. This community is focused in discussion of research and new projects that advance the state-of-the-art.
- Limit self-promotion. Comments and posts should be first and foremost about topics of interest to ML observers and practitioners. Limited self-promotion is tolerated, but the sub is not here as merely a source for free advertisement. Such posts will be removed at the discretion of the mods.
While not a transformer, what about "Gaussian naive Bayes"? It's not the best classifier around but for some tasks - it's good enough. I used it to build a small search term classifier model which basically classifies e-commerce search terms against a category or tag.
You could take a look here: https://sparsezoo.neuralmagic.com/?modelSet=natural_language_processing&size=10727959%2C64684665&sort=Size%3Aasc
Smallest model there is 10MB
I think it's still the case that it has to run on x86, though I think there was talk of an arm runtime.
Also, spending on your exact needs, there might be licensing issues. Still probably worth a look though
That's very small for a trasformer, as a rule of thumb, this is meaning 25M parameters. Not sure there are similar ones
you can try this:
Could you please provide some more information as to your constraints? If space is an issue, you might be better off with a more memory friendly model, like an LSTM. You even have per-token attention with some models.
There's a really interesting sparkfun video which I'll look around for, showing a question-answering model using some sort of BERT(?) running on a Raspberry Pi Zero-type chip, with 25-50MB of flash memory.