overview for tgredditfc

Fine tuning AI in c/localllama@poweruser.forum

[–] tgredditfc@alien.top 1 points 2 years ago

Maybe. I have not done it yet so I don’t know. You can google around.

Fine tuning AI in c/localllama@poweruser.forum

[–] tgredditfc@alien.top 1 points 2 years ago (2 children)

You can use oobabooga API to do that. I haven’t done it myself, can’t say much about it.

Fine tuning AI in c/localllama@poweruser.forum

[–] tgredditfc@alien.top 1 points 2 years ago (6 children)

You can start with reading Oobabooga’s wiki, I think it’s one of most beginner friendly tools. https://github.com/oobabooga/text-generation-webui/wiki/05-%E2%80%90-Training-Tab

1

How to train/finetune with long examples on dataset? (alien.top)

submitted 2 years ago by tgredditfc@alien.top to c/localllama@poweruser.forum

2 comments fedilink

I want to fine tune some LLM models with my own dataset which contains very long examples (a little > 2048 tokens). vRAM usage jumps up several GBs by just increasing the Cutoff Length from 512 to 1024.

Is there a way to feed those long examples into the models without increasing vRAM significantly?

When training an LLM how do you decide to use a 7b, 30b, 120b, etc model (assuming you can run them all)? in c/localllama@poweruser.forum

[–] tgredditfc@alien.top 1 points 2 years ago

If I can run them all I will just pick the biggest one.

What prompts/questions do you use to test a model’s capabilities? Ideally ones that aren’t included in their training data. in c/localllama@poweruser.forum

[–] tgredditfc@alien.top 1 points 2 years ago (2 children)

“Write the snake game using pygame”

ExLlamaV2: The Fastest Library to Run LLMs in c/localllama@poweruser.forum

[–] tgredditfc@alien.top 1 points 2 years ago

Thanks for sharing! I have been struggling with llama.cpp loader and GGUF (using oobabooga and the same LLM model), no matter how I set the parameters and how many offloaded layers to GPUs, llama.cpp is way slower to ExLlama (v1&2), not just a bit slower but 1 digit slower. I really don’t know why.

ExLlamaV2: The Fastest Library to Run LLMs in c/localllama@poweruser.forum

[–] tgredditfc@alien.top 1 points 2 years ago (2 children)

In my experience it’s the fastest and llama.cpp is the slowest.

Is it possible to fine tune a 33B model with 48GB vRAM? in c/localllama@poweruser.forum

[–] tgredditfc@alien.top 1 points 2 years ago

Thank you! It looks very deep to me, I will look into it.

Is it possible to fine tune a 33B model with 48GB vRAM? in c/localllama@poweruser.forum

[–] tgredditfc@alien.top 1 points 2 years ago

Thanks! I have some problems to load GPTQ models with transformer loader.

Is it possible to fine tune a 33B model with 48GB vRAM? in c/localllama@poweruser.forum

[–] tgredditfc@alien.top 1 points 2 years ago

Thanks for sharing!

1

Is it possible to fine tune a 33B model with 48GB vRAM? (alien.top)

submitted 2 years ago by tgredditfc@alien.top to c/localllama@poweruser.forum

10 comments fedilink

Currently I have 12+24GB VRAM and I get Out Of Memory all the time when try to fine tune 33B models. 13B is fine, but the outcome is not very good so I would like to try 33B. I wonder if it’s worthy to replace my 12GB GPU with a 24GB one. Thanks!

🐺🐦‍⬛ LLM Format Comparison/Benchmark: 70B GGUF vs. EXL2 (and AWQ) in c/localllama@poweruser.forum

[–] tgredditfc@alien.top 1 points 2 years ago (1 children)

I have 2 gpus and AWQ never works for me on Oobabooga, no matter how I split the vRAM, oom in most of the cases.

1

How does Apple’s new M3 128GB ram MacBook Pro compare with Nvidia A100? (alien.top)

submitted 2 years ago by tgredditfc@alien.top to c/localllama@poweruser.forum

5 comments fedilink

In terms of AI use, especially LLMs.

$5000 USD for the 128GB ram M3 MacBook Pro is still much cheaper than A100 80 GB.