Clarifai and Vultr have announced groundbreaking benchmark results at the NVIDIA GTC in Washington, D.C., showcasing their collaboration in delivering exceptional AI inference performance on GPUs. The Clarifai Reasoning Engine, optimized for agentic AI inference, demonstrated remarkable speed and cost efficiency on Vultr’s large-scale dedicated GPU clusters. Independent tests revealed the engine processes 544 tokens per second with a minimal time to first token and an industry-leading cost efficiency, surpassing other GPU-based platforms. This performance is part of Clarifai’s 11.9 release, which includes new capabilities for advanced AI systems, such as expanded Vultr cloud instances and compatibility with various toolkits. The Clarifai Reasoning Engine, designed for enterprise-scale workloads, continuously optimizes performance without sacrificing accuracy. This partnership sets a new standard in AI inference, enabling faster innovation in reasoning and generative AI.

