The cost of self-hosting the Llama-3 8B-Instruct model is about $17 per 1M tokens when using EKS, compared to $1 per 1M tokens with ChatGPT. Self-hosting the hardware can reduce the cost to less than $0.01 per 1M tokens, but it takes about 5.5 years to break even.

6m read timeFrom blog.lytix.co
Post cover image
5 Comments

Sort: