Calculate GPU memory requirements and max concurrent requests for self-hosted LLM inference. Support for Llama, Qwen, DeepSeek, Mistral and more. Plan your AI infrastructure efficiently.

The PH Community (ph) is a forum and resource hub for PHP developers, providing discussions, tutorials, and insights on PHP programming and web development. Developers can ask questions, share knowledge, and collaborate with fellow PHP enthusiasts on topics such as PHP frameworks, CMS platforms, and database integration. Additionally, PH Community covers PHP events, job opportunities, and community initiatives, fostering a supportive and engaged PHP developer community.

Product Hunt

SelfHostLLM is a tool that helps developers calculate GPU memory requirements and maximum concurrent requests for self-hosted large language model inference. It supports popular models like Llama, Qwen, DeepSeek, and Mistral, allowing users to plan their AI infrastructure efficiently with custom configurations.

SelfHostLLM: Calculate the GPU memory you need for LLM inference

<p>The UI reminds me of Evil Eye Software’s Optix Pro</p>