this post was submitted on 31 Oct 2023
2 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 1 year ago
MODERATORS
 

Does anyone know where to find the best models for unsupervised clustering problems that don't specify the number classes? For example I googled unsupervised MNIST but IIC which holds the record requires the output dimension (k=10) to be specified? Is there a name for unsupervised clustering without knowing the number of classes? (I know of density/hierarchical clustering algorithms but am unaware of many deep learning ones) And specifically are results charted anywhere? I'm researching the topic and it seems knowing the number of things you're looking for is half the battle. I can find papers on methods that aim to find the number of clusters etc but are there any benchmarks to compare?

you are viewing a single comment's thread
view the rest of the comments
[–] visarga@alien.top 1 points 1 year ago

You already mentioned hierarchical methods but I got my best class-count agnostic clustering with fclusterdata from scipy:

labels = fclusterdata(data, t=threshold, criterion='distance')