LocalLLaMA

4 readers

4 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

communick@poweruser.forum

We are Higgsfield AI. We have a large GPU cluster and want to finetune your dataset. (alien.top)

submitted 2 years ago by RiskApprehensive9770@alien.top to c/localllama@poweruser.forum

14 comments fedilink hide all child comments

Hey LocalLLaMA. It's Higgsfield AI, and we train huge foundational models.

We have a massive GPU cluster and developed our own infrastructure to manage the cluster and train massive models. We constantly lurked in this subreddit and learned a lot from this passionate community. Right now, we have spare GPUs, and we are excited to give back to this incredible community.

We built a simple web app where you can upload your datasets to finetune it. https://higgsfield.ai/

There's how it works:

You upload the dataset with preconfigured format into HuggingFaсe [1].
Choose your LLM (e.g. LLaMa 70B, Mistral 7B)
Place your submission into the queue
Wait for it to get trained.
Then you get your trained model there on HuggingFace.

[1]: https://github.com/higgsfield-ai/higgsfield/tree/main/tutorials

top 14 comments

sorted by: hot top controversial new old

[–] fadenb@alien.top 1 points 2 years ago

What do you consider "large" cluster? How many MW GPU capacity do you operate?

[–] herozorro@alien.top 1 points 2 years ago (1 children)

please do something like this, or provide detailed example, on how an open source framework api can be added to a coder LLM.

how do we prepare the data with code sample, docs, so the coder LLM learns it can can do code completions and answer documentation?

[–] RiskApprehensive9770@alien.top 1 points 2 years ago (1 children)

You can train on any dataset as long as it follows our format.

Soon we'll publish a video tutorial.

[–] herozorro@alien.top 1 points 2 years ago

but what would be the proper formatting example for code? just paste in a bunch of files from a repo? or should be more a cheatsheet format?

[–] mcmoose1900@alien.top 1 points 2 years ago

Well, awesome. Thanks.

I'll be over here assembling some TV show transcripts for a fandom tune.

Out of curiosity, is it a full finetune or a LoRA? What context length?

[–] tortistic_turtle@alien.top 1 points 2 years ago

cool stuff

[–] fctvzm@alien.top 1 points 2 years ago

👏

[–] MaxSan@alien.top 1 points 2 years ago

Just curious why you don't plug it into bittensor? I mean, you get best of both worlds then?

[–] dahara111@alien.top 1 points 2 years ago

Registered. I am very interested and grateful to use it, but I haven't uploaded the dataset to huggingface, so I can't use it yet.

And I don't understand the new learning paradigm that is done just by registering the model and dataset.

What is it that is running behind the scenes?
A very simple snippet OR code would be helpful to understand.

For example

If I give you a model and a dataset, the code will run something like this, and under what conditions will the training be finished.

[–] nomusichere@alien.top 1 points 2 years ago

I don't want to be the party pooper. But this site doesn't seem legit. Have any of you got any info other than provided?

[–] Terrible-Mongoose-84@alien.top 1 points 2 years ago

Hi, is it possible to learn a RAW file? I have about 2gb of artistic text marked up with tags and titles, is it possible to train Mistral on them?

[–] kristaller486@alien.top 1 points 2 years ago

This is awesome! What training parameters are you using?

[–] Scary-Knowledgable@alien.top 1 points 2 years ago

If you collapse does that end the universe????

[–] Grammar-Warden@alien.top 1 points 2 years ago

Spidey-Senses tingling: Caution