Enterprise AI Models in OCI Generative AI

Use Enterprise AI Models in OCI Generative AI to access pretrained hosted models, import supported custom models, and deploy models for enterprise inference workloads.

This section provides links to the models available in OCI Generative AI and to the resources used to deploy, manage, and scale those models in OCI.

What You Can Do with Enterprise AI Models

Use Enterprise AI Models when you want to:

Run inference with pretrained hosted models
Import and host supported custom models
Select on-demand and dedicated deployment options
Deploy models on dedicated AI clusters for production workloads
Manage endpoints and private network access
Review model and regional availability
Understand pricing and performance considerations

OCI Generative AI supports core model tasks such as:

Chat for conversational generation
Embeddings for semantic search, recommendation, classification, and clustering
Rerank for ordering documents by relevance to a query

Model Usage Options

OCI Generative AI supports multiple ways to use models:

Pretrained hosted models for managed inference through OCI
Imported models for supported custom model deployment
On-demand mode for shared managed access
Dedicated mode for isolated model serving on dedicated AI clusters

These options let you move from experimentation to production while selecting the level of control, performance isolation, and infrastructure management that fits your workload.

Model Infrastructure and Management

Enterprise AI Models in OCI Generative AI are supported by deployment and management resources such as:

Dedicated AI Clusters for isolated model hosting
Endpoints for serving model traffic
Private Endpoints for secure network access
Regional model availability for deployment planning
Performance and cost guidance for production workloads

Topics in This Section

Use the following topics to learn about Enterprise AI Models in OCI Generative AI:

Offered Pretrained Foundational Models in Generative AI
Learn about the pretrained hosted models available in OCI Generative AI.
Validated Models for Import
Review the supported custom models that you can import into OCI Generative AI.
On-Demand and Dedicated Modes for OCI Generative AI Models
Understand the deployment options for running models in shared or dedicated environments.
Managing Dedicated AI Clusters
Learn how to create and manage dedicated AI clusters for model hosting.
Dedicated AI Cluster Performance Benchmarks
Review benchmark guidance for dedicated AI cluster performance.
Generative AI Regions
See where OCI Generative AI is available.
Generative AI Models by Region
Review model availability by OCI region.
Managing Endpoints
Learn how to manage endpoints for model access.
Managing Private Endpoints
Learn how to configure private network access for OCI Generative AI.
Calculating Cost in Generative AI
Review pricing considerations for OCI Generative AI usage.

Oracle Cloud Infrastructure Documentation

Enterprise AI Models in OCI Generative AI

What You Can Do with Enterprise AI Models

Model Usage Options

Model Infrastructure and Management

Topics in This Section