I deployed Llama 2 (GGUF and using CPU) as an Amazon ECS fargate service
I just bundled my entire Docker build into ECR and fired up my container
Community to discuss about Llama, the family of large language models created by Meta AI.
I deployed Llama 2 (GGUF and using CPU) as an Amazon ECS fargate service
I just bundled my entire Docker build into ECR and fired up my container
What about AWS Lambda? I tried it and it works. It is economical and fast.