this post was submitted on 10 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 1 year ago
MODERATORS
 

I'm new to image classification and ML and this is going to be my first project on those topics. I'm considering using VGG16 because I saw some studies showing that it has a generally great accuracy score (80-95%) but I'm worried that the model might not be fast enough or the app file size might get massive if I want the app to be usable without internet connection.

What do you guys think?

top 5 comments
sorted by: hot top controversial new old
[–] ImaSakon@alien.top 1 points 1 year ago

VGG16 is outdated. CNNs based on architectures like EfficientNet and MobileNetV3 have superior accuracy. If attention mechanisms are acceptable in the network, Vision Transformers such as MobileViT are excellent.

[–] shubham0204_dev@alien.top 1 points 1 year ago

You may use MobileNet models as they use separable convolutions, which have lesser parameters and execution time than simple/regular convolutions. Moreover, MobileNets are easy to train and setup (tf.keras.applications.* has a pre-trained model) and can be used as a backbone model for fine-tuning on datasets other than the ImageNet.

Further, you can also explore quantization and weight pruning. These are some techniques that can be used to optimize models to have a smaller memory footprint and smaller execution time on embedded devices.

[–] puppet_pals@alien.top 1 points 1 year ago

No try mobile net

[–] IndieAIResearcher@alien.top 1 points 1 year ago

Try mobilenet of mobile vit

[–] dryden4482@alien.top 1 points 1 year ago

Look at tflight model maker. It will walk you through everything you need to make a light weight mobile friendly image classifier.

https://www.tensorflow.org/lite/models/modify/model_maker/image_classification