this post was submitted on 28 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 1 year ago
MODERATORS
 

Hi everybody!

Inspired by a recent thread, mentioning the insane goliath abilities I decided to merge four SFT Yi models to make 2 seperate 55B Yi, one with context 200K and one with 32K.

Try them out and let me know!

top 3 comments
sorted by: hot top controversial new old
[–] PierroZ-PLKG@alien.top 1 points 1 year ago

What are the eval results?

[–] BalorNG@alien.top 1 points 1 year ago

Did you do post-merge retraining? Without at least some results are going to be poor...

[–] _RADIANTSUN_@alien.top 1 points 1 year ago

Very cool. How did you merge them?