Arcee AI's Trinity Large-Thinking reasoning model is now available in Public Preview on DigitalOcean's Agentic Inference Cloud via Serverless Inference. The model ranks #2 on PinchBench for agentic tasks at approximately $0.90/M output tokens, making it significantly cheaper than top-ranked alternatives. It supports extended reasoning, multi-turn tool use, and long-running agentic workloads. Developers can query it immediately via API or console without provisioning infrastructure, and the Apache 2.0 licensed weights are available on Hugging Face for self-hosting or fine-tuning.
Table of contents
Why this model, why nowBuilt for real-world agent workloads on DigitalOceanA new phase of AI infrastructureGet started in secondsSort: