Self-Hosted AI Agents: Why You Shouldn't Need an Enterprise Contract

The Self-Hosted Catch-22

You’ve decided you need to run AI agents on your own infrastructure. Maybe it’s compliance requirements. Maybe it’s data sovereignty. Maybe you just don’t want sensitive customer conversations flowing through third-party SaaS platforms.

So you evaluate the leading AI agent platforms. And you discover the catch:

Self-hosted deployment requires an Enterprise contract.

LangGraph Platform? Self-hosted is Enterprise tier only — requires contacting sales. Microsoft Agent Framework? Azure-first, on-prem is secondary. Most platforms treat self-hosted as a premium feature, not a default option.

This creates a frustrating situation: You can’t evaluate self-hosted capabilities without committing to a sales conversation. You can’t prototype on your infrastructure without negotiating a contract. And you definitely can’t move fast.

Who Actually Needs Self-Hosted?

Let’s be honest about the market reality: 76% of enterprises prefer to buy SaaS solutions rather than build. Most organizations solve data privacy concerns through contracts — BAAs, SOC 2 compliance, HIPAA certifications — rather than architecture.

But there’s a real ~10-20% of enterprises where self-hosted isn’t a preference — it’s a requirement:

Defense and Intelligence

Air-gapped networks. Security clearances. Classified data. These organizations can’t send data to external endpoints, period. Contractual compliance isn’t enough — architectural isolation is required.

Government (Certain Workloads)

Data sovereignty mandates. FedRAMP requirements. Some government workloads have explicit requirements about where data can be processed and stored.

Healthcare (Some Use Cases)

While many healthcare organizations use HIPAA-compliant SaaS (LiveKit is HIPAA compliant, for example), some use cases — especially those involving the most sensitive patient data or edge deployments — require on-premise processing.

Financial Services (High-Sensitivity)

Material non-public information. Trading signals. Some financial data is sensitive enough that even compliant SaaS creates unacceptable risk.

Legal (Privilege Concerns)

Attorney-client privilege considerations. Some law firms won’t send privileged communications through any external system, regardless of compliance certifications.

Internal AI Products

Companies building AI into their products — not just using AI for internal operations — often need to run on their own infrastructure for IP control, cost management, and architectural flexibility.

The Real Barriers to Self-Hosted AI

If you’re in the ~20% that genuinely needs self-hosted, you face several challenges that SaaS users don’t:

1. Infrastructure Complexity

Self-hosted AI agents need:

Container orchestration (Kubernetes)
Session state persistence (Redis, PostgreSQL)
Secret management
Networking and ingress
Monitoring and alerting

SaaS platforms abstract this away. Self-hosted means you own it.

2. Operational Expertise

Running AI agents in production requires understanding:

LLM-specific failure modes
Token management and cost optimization
Prompt versioning and rollback
Multi-provider failover

Most platform teams don’t have this expertise. Most AI teams don’t have operational experience.

3. Vendor Lock-In Risk

Self-hosted solutions from SaaS-first vendors often have subtle dependencies on their cloud services — telemetry that phones home, features that require cloud connections, updates that assume internet access.

4. Testing and Quality

SaaS platforms often bundle testing capabilities. Self-hosted deployments need separate tooling for load testing, evaluation, and quality assurance.

What Self-Hosted Should Look Like

Self-hosted AI agent infrastructure should be a first-class deployment option, not an afterthought. Here’s what that looks like:

Kubernetes-Native

If you’re running self-hosted, you’re probably running Kubernetes. The platform should work natively with K8s — Custom Resource Definitions, Helm charts, GitOps-compatible manifests.

# Deploy an agent with kubectl
apiVersion: omnia.altairalabs.ai/v1alpha1
kind: AgentRuntime
metadata:
  name: support-agent
spec:
  promptPackRef:
    name: support-prompts
  providerRef:
    name: claude-provider
  session:
    type: redis
    storeRef:
      name: redis-connection

No Phone-Home Requirements

Truly self-hosted means no telemetry sent to vendor servers, no license checks against external endpoints, no features that mysteriously stop working when the internet is unavailable.

Air-Gap Compatible

For the highest-security deployments, the platform should work in fully air-gapped environments — including Ollama or other locally-hosted LLMs when you can’t call external APIs.

Bring Your Own Everything

Your LLM provider: OpenAI, Anthropic, Google, or self-hosted models
Your session store: Redis, PostgreSQL, or in-memory for testing
Your monitoring stack: Prometheus, Grafana, or whatever you already use
Your auth system: OAuth, SAML, or your existing identity provider

Open Source Core

The deployment infrastructure should be open source — auditable, modifiable, and not subject to license changes. Enterprise features can be commercial, but the core should be open.

The TCO Reality Check

Self-hosted isn’t free. Even with open-source infrastructure, you’re paying in:

Infrastructure costs: Compute, storage, networking
Operational overhead: Someone has to keep it running
Talent requirements: You need people who understand both AI and operations

For many organizations, the SaaS premium is worth it to avoid these costs.

But for the organizations that genuinely need self-hosted — where compliance or data sovereignty are non-negotiable — the alternative isn’t “SaaS vs. self-hosted.” The alternative is “run AI agents on your infrastructure” or “don’t run AI agents at all.”

In that case, the question isn’t whether to pay the TCO. It’s whether self-hosted infrastructure exists that makes the TCO manageable.

The Path Forward

If you need self-hosted AI agent infrastructure:

1. Require Kubernetes-native. If a platform can’t deploy via kubectl apply and manage resources with standard K8s tooling, it’s not truly self-hosted — it’s SaaS with extra steps.

2. Verify air-gap compatibility. Can you run it without any external network calls? If not, understand exactly what external dependencies exist.

3. Confirm open-source availability. Can you audit the code? Run it without a license server? Fork it if the vendor disappears?

4. Evaluate framework support. Can you run LangChain, CrewAI, and custom agents? Or are you locked to one framework?

5. Check for testing integration. How will you validate agent quality? Is load testing and evaluation included, or do you need separate tooling?

Self-hosted AI infrastructure should be a deployment option, not a premium feature. You shouldn’t need an enterprise contract to run agents on your own computers.

Key Takeaways

Most platforms gate self-hosted behind Enterprise contracts — making evaluation difficult
~20% of enterprises genuinely need self-hosted due to compliance, data sovereignty, or architectural requirements
Self-hosted infrastructure should be Kubernetes-native, with no phone-home requirements
Open-source core protects against vendor lock-in and enables auditing
TCO is real — but for organizations that need self-hosted, the alternative is not running AI at all