this post was submitted on 23 Nov 2023
1 points (100.0% liked)

LocalLLaMA

1 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 10 months ago
MODERATORS
 
  1. for coding
  2. for generating stories, writing email, poems etc.
  3. good overall
  4. etc.
you are viewing a single comment's thread
view the rest of the comments
[–] Illustrious-Lake2603@alien.top 1 points 10 months ago (12 children)

For Coding, DeepSeek coder 6.7b is exceptional

[–] Sufficient-Math3178@alien.top 1 points 10 months ago (5 children)

Models requiring remote code without any explanation are shady imo

[–] Knaledge@alien.top 1 points 10 months ago (2 children)
[–] Sufficient-Math3178@alien.top 1 points 10 months ago

AFAIK models used to be just plain code, when you load one, for example, it would do so by calling a method pickled inside the model file. Uploader could set up this method to do practically anything they want, and it doesn’t need to be obviously malicious since code runs just like a normal python script. For example, it could simply load/render a webp image that is designed to use the recent libwebp vulnerability.

They changed this a while back, so now you need to pass an argument when loading the model to allow this behavior, and this model requires it.

load more comments (1 replies)
load more comments (3 replies)
load more comments (9 replies)