**OpenAI Unveils AgentKit: A Comprehensive Platform for Crafting, Deploying, and Optimizing AI Agents**
OpenAI has introduced AgentKit, a cohesive platform that bundles a visual Agent Builder, an embeddable ChatKit UI, and expanded Evals into a single workflow for shipping production agents. The launch includes Agent Builder in beta and the rest generally available.
**Agent Builder (beta): A Visual Canvas for Complex Workflows**
Agent Builder, now in beta, offers a visual canvas for composing intricate, multi-step, multi-agent workflows using drag-and-drop nodes and connectors. Key features include:
– **Per-node guardrails**: Ensuring safety and policy enforcement at each step.
– **Preview runs**: Testing workflows before deployment.
– **Inline eval configuration**: Fine-tuning evaluation settings directly on the canvas.
– **Full versioning**: Tracking changes and iterations seamlessly.
Teams can start from templates or a blank canvas, with the Responses API powering execution. OpenAI highlights internal and customer usage, demonstrating how Agent Builder compresses iteration cycles when transitioning from prototype to production.
With Agent Builder, users can drag and drop nodes, connect tools, and publish their agentic workflows using ChatKit and the Agents SDK. OpenAI showcased an example of Albertsons using AgentKit to create an agent that improves ice cream sales by analyzing seasonality, historical trends, and external factors.
**Agents SDK: A Code-First Alternative**
For those preferring a code-first approach, the Agents SDK offers type-safe libraries in Node, Python, and Go. OpenAI positions the SDK as faster to integrate than manual prompt-and-tool orchestration, while sharing the same execution substrate (Responses API).
**ChatKit (GA): A Brand-Customizable Chat Interface**
ChatKit, now generally available, is a drop-in, brand-customizable chat interface for deploying agentic experiences on the web or in apps. It handles streaming, threads, and “thinking” UIs, with organizations using it for support and internal assistants.
**Built-in Tools and Connectors**
Agent workflows can call web search, file search, image generation, code interpreter, “computer use,” and external connectors, including Model Context Protocol (MCP) servers. This reduces glue code for common tasks.
**Connector Registry (beta): Centralized Admin Governance**
The Connector Registry, now in beta, offers centralized admin governance across ChatGPT and the API for data sources such as Dropbox, Google Drive, SharePoint, Microsoft Teams, and third-party MCPs. Rollout begins for customers with the Global Admin Console.
**Evals (GA) and Optimization**
New Evals capabilities include datasets, trace grading for end-to-end workflow assessment, automated prompt optimization, and third-party model evaluation. OpenAI emphasizes continuous measurement to raise task accuracy.
**Pricing and Availability**
ChatKit and the new Evals features are generally available, while Agent Builder is in beta. All are included under standard API model pricing, i.e., pay for model/compute usage rather than separate SKUs.
**Pieces of the Puzzle**
Design: Use Agent Builder to visually assemble agents and guardrails, or write agents with the Agents SDK against the Responses API. Deploy: Embed with ChatKit to deliver a production chat surface without building a frontend from scratch. Optimize: Instrument with Evals (datasets, trace grading, graders) and iterate prompts based on graded traces.
**Safety Considerations**
OpenAI’s launch materials pair Agent Builder with guardrails that can detect jailbreaks, mask/flag PII, and enforce policies at the node/tool boundary. Admins govern connections and data flows through the Connector Registry spanning both ChatGPT and the API.
**Our Assessment**
AgentKit is a consolidated stack that packages a visual Agent Builder for graph-based workflows, an embeddable ChatKit UI, and an Agents SDK sitting on top of the Responses API. This reduces bespoke orchestration and frontend work while keeping evaluation in-loop via datasets and trace grading. The value lies in operational aspects such as versioned node graphs, built-in tools, connector governance, and standardized eval hooks, which previously required custom infrastructure.
In conclusion, OpenAI’s AgentKit is a powerful, visual-first stack for building, deploying, and evaluating AI agents. By offering a cohesive platform with Agent Builder, ChatKit, and expanded Evals, OpenAI simplifies the process of creating and optimizing complex agentic workflows.