this post was submitted on 20 Nov 2023
1 points (100.0% liked)

LocalLLaMA

4 readers
4 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago
MODERATORS
 

I've been experimenting with Zephyr and pretty surprised by the great performance. One problem I have with Zephyr is the difficulty to build structured outputs. For example if I ask it to return only True or False, it will return a lengthy explanation. This makes it tough to use Zephyr as part of a production system, since incorrectly structured outputs have huge implications.

Has anyone found some tricks to make Zephyr produce outputs in a defined format and do so deterministically?

top 4 comments
sorted by: hot top controversial new old
[–] Combinatorilliance@alien.top 1 points 2 years ago

There are all sorts of approaches

  • microsoft guidance
  • llama.cpp grammar constraints
  • someone recently made their own approach and posted it here, search for CAPPr
[–] AtticusAtom@alien.top 1 points 2 years ago

The correct answer here is reading logprobs/logits over the true/false tokens after parsing the input.

Using a grammar is overkill and a classification model might not get you the fidelity you want.

[–] InterestingBasil@alien.top 1 points 2 years ago

Fine tune of course

[–] DarthNebo@alien.top 1 points 2 years ago

I reverted to mistral-instruct