Overview of Deploying Agents in OCI Generative AI
You can deploy agents by using OCI Generative AI Applications, which provide a managed runtime for containerized agent workloads.
To deploy an agent, package it as a container image, upload it to Oracle Cloud Infrastructure Registry (OCIR), and deploy it by using the OCI Console, API, or CLI.
During deployment, configure:
- Scaling
- Storage
- Networking
- Authentication
After deployment, the service provisions an endpoint (for example, an HTTP URL) that clients or other agents can use to invoke the agent.
How it Works
After developing an agent locally (for example, by using LangGraph or similar frameworks), you create a Generative AI application to define the runtime configuration.
You then create a deployment by selecting a container image. The active deployment serves requests through the application endpoint. After the deployment is provisioned, the endpoint becomes available for invoking the agent.
Walkthrough
Use Generative AI Applications to deploy agents as managed containerized applications in OCI Generative AI.
With Generative AI Applications, you build a container image, upload it to Oracle Cloud Infrastructure Registry (OCIR), and deploy that image as a Generative AI Application by using the OCI Console, API, or CLI.
When you deploy an agent, you can configure how the application runs and how clients access it, including:
- Scaling
- Storage
- Networking
- Authentication
After the deployment is provisioned, OCI Generative AI provides an endpoint, such as an HTTP URL, that clients can use to invoke the deployed agent.
Deploying an agent is useful when you want a managed runtime for a containerized agent application, with OCI-managed deployment configuration and endpoint provisioning.
For more information, see the topics about Generative AI Applications and deploying containerized agent applications.