In this tutorial, we show how to access and use the new Serverless Inference feature from DigitalOcean’s GenAI Platform.

DigitalOcean Community's platform is a central hub for developers and sysadmins using DigitalOcean's cloud infrastructure, offering insights into cloud computing, DevOps practices, and open-source technologies. Through tutorials, Q&A, and community forums, DO_Community offers insights into deploying and managing applications on DigitalOcean's cloud platform. Developers can learn about Linux server administration, containerization, and automation tools to build and scale applications in the cloud.

DigitalOcean Community

DigitalOcean's GenAI Platform now offers Serverless Inference, allowing developers to access AI models without managing infrastructure. The tutorial covers creating API keys through both console and API methods, then demonstrates how to use Python with the OpenAI client library to query models like Llama3-8B. This serverless approach eliminates deployment headaches while providing access to NVIDIA GPU-powered inference for various AI applications.

Serverless Inference with the DigitalOcean GenAI Platform

Step 2A: Create a Model Access key with the API

Step 2B: Create a Model Access key with the Cloud Console

Step 3: Generating Text with Python and Serverless Inference