metalman123

joined 1 year ago
[–] metalman123@alien.top 1 points 11 months ago

Most models are going to require some heavy prompting to get them even close to the prose you're looking for.

I've heard good things about Nous-Capybara-34B and Goliath 120b

[–] metalman123@alien.top 1 points 11 months ago

Was wondering how long this would take to show up.

[–] metalman123@alien.top 1 points 11 months ago

Do you plan to fine tune the 7b as well?

[–] metalman123@alien.top 1 points 11 months ago

This style of captioning could be amazing for text to image datasets and i wouldn't be surprised to see them take a jump in quality as well.

[–] metalman123@alien.top 1 points 11 months ago (1 children)

120b Goliath writing really is impressive. up there with cluade and gpt 4. Toppy also pretty good for a 7b

[–] metalman123@alien.top 1 points 11 months ago

They have a premium 70b model that they have shown investors and enterprise customers.

They will likely opensource everything except the strongest model they have similar to what phind does.

I just hope we can still access the 70b model through the azure api.

[–] metalman123@alien.top 1 points 11 months ago

We learned that merging models absolutely works and that the 34b yi model appears to be the real deal.

(Maybe we should merge some yi fine tunes in the future)

[–] metalman123@alien.top 1 points 11 months ago

Intel has entered the game. Things are getting interesting.

If we ever get access to a mistral or yi 70b± model I think a lot of companies are going to be in trouble with their current models.

[–] metalman123@alien.top 1 points 1 year ago

Can't wait to see the benchmarks on these things.

[–] metalman123@alien.top 1 points 1 year ago

Really want to see the numbers with mistral instead of llama 7b

[–] metalman123@alien.top 1 points 1 year ago (1 children)

This looks promising. I'd like to see a similar setup on the new mistral models if they ever release them.

[–] metalman123@alien.top 1 points 1 year ago (1 children)

When do you expect to have benchmarks?

view more: next ›