this post was submitted on 18 Nov 2023
1 points (100.0% liked)

LocalLLaMA

3 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago
MODERATORS
 

Well the code quality has gotten pretty bad so I think it's time to cancel my subscription to ChatGPT Plus. I have an RX 6600 and an GTX 1650 Super so I don't think local models are a possible choise (at least for the same style of coding that is done with GPT-4). But I decided to post here anyway since you guys are very knowledgeable. I was looking at cursor.ai and it seemed pretty good. I don't know how good it's now since OpenAI has decreased the performance of GPT-4 but I have heard that the API is still ok. Also there is refract which could be a similar choise too. What do you recommend? I do coding in mostly Python and sometimes C++.

top 30 comments
sorted by: hot top controversial new old
[–] thevukaslt@alien.top 1 points 11 months ago (2 children)

I've been using playground since march. Pay per usage, model configuration and your data is not used for training. Some months I pay 30 bucks, some 5. It depends on the model and amount.

The interface is a bit worse as it's obviously not made for consumers but it's like the most obvious "cheat code" many don't realise.

Find here: https://platform.openai.com/playground?mode=chat&model=gpt-4-1106-preview

[–] Igoory@alien.top 1 points 11 months ago

I second this. I paid 5 bucks one month ago and to this day I only used $0.63!

[–] osures@alien.top 1 points 11 months ago

what do you think are the best settings for coding?

[–] Daemonix00@alien.top 1 points 11 months ago

Code quality gotten pretty bad?

Can you elaborate a bit? Im also using it for py and cpp.

[–] Entire_Cheetah_7878@alien.top 1 points 11 months ago

I think that the key to using any LLM for writing code is to not overwhelm the model with too many asks or feeding it too much code at once, even if the context window is large. Baby steps and EXTREMELY clear requirements for each one of those steps are absolutely critical to getting the most out of any of these models.

One hack I've found is to have the LLM (or yourself) comment your code very explicitly. When you feed it back, the improvement is substantial.

[–] sogerr@alien.top 1 points 11 months ago (1 children)
[–] Scizmz@alien.top 1 points 11 months ago

We know better.

[–] AfterAte@alien.top 1 points 11 months ago (3 children)

Take a look at Phind.com. They use the web to enhance their model's answers. That means that you can get up to date information on APIs instead of relying on the data with a cutoff of 2021 or 2022. You can use their in house Phind V8 model for free, but if you want to use GPT4, you get 10 tries a day. If you want more, they have paid plans. They recently announced that their free V8 model was as good as GPT4, but other people here have disagreed with them. I have never used GPT4, but their free Phind model was better than anything local we have.

[–] Flaky_Candidate7546@alien.top 1 points 11 months ago (1 children)

Well cutoff of latest version of GPT4 is at April 2023, so not really relevant now. But have to admit that Phind model is really impressive

[–] AfterAte@alien.top 1 points 11 months ago

Nice, I didn't know that.

[–] ab2377@alien.top 1 points 11 months ago

didnt know about phind.com, great model and great website, thanks for sharing.

[–] Crafty-Run-6559@alien.top 1 points 11 months ago

Find is really great in specific instances. As soon as I give it a bunch of code with any complexity in requests, it falls apart and does much worse than chatgpt.

[–] Scary-Knowledgable@alien.top 1 points 11 months ago (3 children)

Deepseek Coder 33B worked well for me, I asked it to make the game snake and it did it the first time with the 4bit GPTQ - https://huggingface.co/TheBloke/deepseek-coder-33B-instruct-GPTQ

Other models are available to run on CPU/GPU - https://huggingface.co/models?search=deepseek%2033b

[–] kingp1ng@alien.top 1 points 11 months ago (2 children)

What UI do you use with Deepseek Coder? I need one that can format markdown text well.

[–] LyPreto@alien.top 1 points 11 months ago (1 children)
[–] FullOf_Bad_Ideas@alien.top 1 points 11 months ago
[–] Scary-Knowledgable@alien.top 1 points 11 months ago

I use oobabooga, I'm actually testing out using it with Language Agent Tree Search to see if it can make better outputs -https://github.com/andyz245/LanguageAgentTreeSearch

[–] You_Wen_AzzHu@alien.top 1 points 11 months ago

Game Snake will be included in all models.

[–] AapoL092@alien.top 1 points 11 months ago

What specs? Didn't have enough memory to run it. I have an RX 6600 and a Ryzen 5 5600x also 16gb of ram.

[–] ab2377@alien.top 1 points 11 months ago (1 children)

also check https://coder.deepseek.com/chat, pretty large context and blazing fast.

[–] alx_www@alien.top 1 points 11 months ago

i’m sorry but why the fortnite skin😭😭😭

[–] yahma@alien.top 1 points 11 months ago (2 children)

What vscode extensions are you using with these local models?

[–] AapoL092@alien.top 1 points 11 months ago

I'm not using anything. I have so far just used the cursor.ai vscode fork which has a cool system for using GPT-4 or 3.5.

[–] YearZero@alien.top 1 points 11 months ago

I personally use one called Continue. It's amazing!

[–] RudiAlreadyTaken@alien.top 1 points 11 months ago (1 children)

Try Pieces, it is fo free. Its has a coding assistant (you can choose between different models) that you can feed with websites, files and snippets. The assistant is specialized for coding and deliveres really good results for me and when given additional context through files or website content it follows the instructions astonishingly well.

It is much more than just the coding assistant though, it is a tool to quick capture code snippets and leverage them. It also has integrations for the browser, vs code, jetbrains and obsidian.

Can only recommend it!

[–] tronathan@alien.top 1 points 11 months ago

Looks really nice - I watched the video demo and I can't say that my coding experience really calls for any of the things in the demo. Most of what I deal with is managing integration of a large set of data models. The actual coding is the easy part, figuring out what to code is the hard part.

[–] LoSboccacc@alien.top 1 points 11 months ago

Switch to paid api, install vs code, and a gpt integration plugin, you can use the old gpt4 full version, and you can also combine that with free services like code whisperer for basic completion so you only pay for where you're getting the most value out of.

[–] erinorina@alien.top 1 points 11 months ago (1 children)

i use both phind.com and poe.com (claude) , they are nice for now since they are free

[–] LyPreto@alien.top 1 points 11 months ago

claude is dogshit for code generation from my experience

[–] SpambotSwatter@alien.top 1 points 11 months ago

Hey, another bot replied to you; /u/thumbsdrivesmecrazy is a click-farming spam bot. Please downvote its comment and click the report button, selecting Spam then Link farming.

With enough reports, the reddit algorithm will suspend this spammer.


!^(If this message seems out of context, it may be because thumbsdrivesmecrazy is farming karma and may edit their comment soon with a link)!<