this post was submitted on 27 Nov 2023
1 points (100.0% liked)

LocalLLaMA

3 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago
MODERATORS
 

Recently came across this AI Safety test report from LinkedIn: https://airtable.com/app8zluNDCNogk4Ld/shrYRW3r0gL4DgMuW/tblpLubmd8cFsbmp5

From this report it seems Llama 2 (7B version?) lacks some safety checks compared to OpenAI models. Same with Mistral. Did anyone find the same result? Has it been a concern for you?

you are viewing a single comment's thread
view the rest of the comments
[–] phree_radical@alien.top 1 points 11 months ago (1 children)

??

It's comparing base models (which are not trained to follow or refuse instructions) against instruction-tuned ones (OpenAI)

[–] CookieCat171@alien.top 1 points 11 months ago (1 children)
[–] phree_radical@alien.top 1 points 11 months ago

Looks like you've now made some changes. Columns now read "Llama2-7b-chat" instead of "llama2." Also, chat responses below the completions, chastising the inappropriate messages. However, a completion was generated, first, and the item is still marked as "fail." Very poor show