Make your llama generation time fly with AWS Inferentia2 | Pasteblog