this post was submitted on 28 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 1 year ago
MODERATORS
 

Hi everybody!

Inspired by a recent thread, mentioning the insane goliath abilities I decided to merge four SFT Yi models to make 2 seperate 55B Yi, one with context 200K and one with 32K.

Try them out and let me know!

you are viewing a single comment's thread
view the rest of the comments
[–] BalorNG@alien.top 1 points 1 year ago

Did you do post-merge retraining? Without at least some results are going to be poor...