overview for higgsfield

1

[P] Higgsfield AI. Go chat with popular finetuned models (alien.top)

submitted 2 years ago by higgsfield_ai@alien.top to c/machinelearning@academy.garden

0 comments fedilink

Hey /r/MachineLearning, Higgsfield AI here

A few days ago, we built an easy-to-use platform for everyone in the community to finetune models. Many of you uploaded datasets, and they are waiting in the queue for training.

We received a lot of feedback, and many of you reached out, wanting the opportunity to try out the models.

We are happy to announce we made a chat interface for you to do that.

Let us know what you think.

Shout out to /u/WolframRavenwolf and his efforts in comparing the LLMs.

His post inspired the list of models we support now and we will extend it sooner.

HuggingFaceH4/zephyr-7b-beta
teknium/OpenHermes-2-Mistral-7B
jondurbin/airoboros-m-7b-3.1.2
ehartford/dolphin-2.1-mistral-7b
migtissera/SynthIA-7B-v1.3
mistralai/Mistral-7B-Instruct-v0.1
migtissera/SynthIA-7B-v2.0
teknium/CollectiveCognition-v1.1-Mistral-7B
ehartford/dolphin-2.2-yi-34b
NurtureAI/openchat_3.5-16k

Stay fine-tuned for future updates :)

Higgsfield AI. Go chat with popular finetuned models in c/localllama@poweruser.forum

[–] higgsfield_ai@alien.top 1 points 2 years ago

Hey there. Good catch, haven't realized a newer version is available. Will update soon!

[P] Higgsfield.AI – Anyone can train Llama 70B or Mistral for free in c/machinelearning@academy.garden

[–] higgsfield_ai@alien.top 1 points 2 years ago

From our experience, to get a very good results you need

High quality dataset. It's worth to spend more time on data cleaning. It's way better to have a smaller dataset with high quality points than a huge dataset with garbage.
You need to fully finetune it.

[P] Higgsfield.AI – Anyone can train Llama 70B or Mistral for free in c/machinelearning@academy.garden

[–] higgsfield_ai@alien.top 1 points 2 years ago (3 children)

We only do full fine-tune.

[P] Higgsfield.AI – Anyone can train Llama 70B or Mistral for free in c/machinelearning@academy.garden

[–] higgsfield_ai@alien.top 1 points 2 years ago

We support only large models (starting from 7B).

1

[P] Higgsfield.AI – Anyone can train Llama 70B or Mistral for free (alien.top)

submitted 2 years ago by higgsfield_ai@alien.top to c/machinelearning@academy.garden

14 comments fedilink

https://higgsfield.ai
We have a massive GPU cluster and developed our own infrastructure to manage the cluster and train massive models.

There's how it works:

You upload the dataset with preconfigured format into HuggingFaсe [1].
Choose your LLM (e.g. LLaMa 70B, Mistral 7B)
Place your submission into the queue
Wait for it to get trained.
Then you get your trained model there on HuggingFace.

Essentially, why would we want to do it?

We already have an experience with training big LLMs.
We could achieve near-perfect infrastructure performance for training.
Sometimes GPUs have just nothing to train.

Thus we thought it would be cool if we could utilize our GPU cluster 100%. And give back to Open Source community (already built an e2e distributed training framework [2]).

This is in an early stage, so you can expect some bugs.

Any thoughts, opinions, or ideas are quite welcome!

[1]: https://github.com/higgsfield-ai/higgsfield/blob/main/tutori...

[2]: https://github.com/higgsfield-ai/higgsfield