Did you forget to unset the rope settings?
Codellama requires different rope than regular llama.
Also check your sampler settings.
Community to discuss about Llama, the family of large language models created by Meta AI.
Did you forget to unset the rope settings?
Codellama requires different rope than regular llama.
Also check your sampler settings.
No I didn't even know rope was a thing, I'm reading about it now... if you have any tl;dr please post it, this stuff seems pretty complicated.
I was loading the model with a llama.cpp invocation, didn't know about rope. What would change if I left the default values on?
worked great for me
Snowflake has a very nice comparison of the two:
Fine-Tuning Improves the Performance of Meta’s Code Llama on SQL Code Generation
The answer is you need more fine tuning.