Llama.cpp's example server supports batching and custom grammar.
Its a work in progress for Aphrodite: https://github.com/PygmalionAI/aphrodite-engine/issues/36#issuecomment-1747429134
Community to discuss about Llama, the family of large language models created by Meta AI.
Llama.cpp's example server supports batching and custom grammar.
Its a work in progress for Aphrodite: https://github.com/PygmalionAI/aphrodite-engine/issues/36#issuecomment-1747429134