AI Agent Governance (MCP)

From shadow MCP to governed AI agents.

Tap the shadow side, or any layer of the Cloudflare control plane.

Layer 2 · Catalog

MCP Server Portal

One Access-protected URL fronts every approved MCP server. Access policy sets which servers each user sees, admins curate the tool surface with aliases and allowlists, and Code Mode collapses large catalogs into a search/execute interface instead of loading every tool schema.

One URL, many approved servers
Per-user server visibility
Curated tools & aliases
Code Mode for large catalogs

Plan your MCP governance program See the reference architecture

On this page

Who this is for

Security teams evaluating risk from AI agents reaching corporate systems.

Platform and developer-experience teams supporting employees using MCP-capable AI clients.

Compliance leaders concerned about software provenance, audit trails, and unauthorised data access via agents.

Organizations standardizing on Cloudflare Zero Trust as the access control plane for both humans and AI.

Why ungoverned AI agents are a problem

Shadow MCP

Local MCP servers run on individual employee laptops, invisible to security teams and outside any centralized control or audit.

Software provenance gaps

MCP servers are often downloaded directly from public Git repositories without vetting, version tracking, or update governance.

No agent-aware authentication

Agents typically inherit a user's broad credentials with no per-server identity delegation, scope, or step-up authentication.

Unbounded data access

Once an agent connects to an MCP server for a corporate resource, there is little control over what data it reads, transforms, or sends to an LLM.

Direct upstream bypass

A portal-only deployment can still be bypassed: a blocked portal user may connect directly to the upstream MCP server URL. Cloudflare's own documentation calls this out — upstream MCP servers must also be Access-protected.

Context bloat in large catalogs

Loading every upstream tool schema into the agent context burns tokens and degrades tool selection as the catalog grows. Without a search/execute layer, large MCP portals become expensive and noisy.

No centralized audit

Without a unified control plane, security teams cannot answer "which agents called which tools on whose behalf with what data."

How Nanosek delivers MCP governance

Phase 1

Discover MCP activity

Use Cloudflare Gateway and DNS logs to surface local and remote MCP connections from the user population.
Inventory MCP-capable AI clients in use — Claude Desktop, Cursor, custom agents, internal tooling.
Identify which corporate resources (Slack, Confluence, internal APIs, code hosts) are being reached via MCP today.

Phase 2

Move from local to remote MCP servers

Host MCP servers as remote deployments — on Cloudflare Workers, on supported providers, or in-house with public endpoints.
Catalog vetted MCP server sources and versions; replace unvetted local clones.
Define ownership, change control, and update process per MCP server.

Phase 3

Front each MCP server with Cloudflare Access

Wrap every remote MCP server endpoint with Cloudflare Access — an MCP-server or self-hosted application with Managed OAuth for servers you host (the server validates the Access JWT rather than running its own OAuth), or an Access for SaaS (OIDC) app for provider-hosted MCP services.
Protect the upstream MCP server directly — not just the portal route — so a blocked portal user cannot fall back to the direct URL.
Use service tokens for controlled machine-to-machine MCP access; treat them as separate from human user flows.

Phase 4

Centralize with MCP Server Portals

Configure an MCP Server Portal that aggregates approved MCP servers behind a single Access-protected URL.
Use Access policy to control per-user server visibility; admins rename tools with aliases, hide tools, or run an allowlist-only mode.
For OAuth-capable upstream MCP servers, prefer per-user authentication. If an admin credential is used, scope it tightly — every portal user reaches the upstream server through it.
For large catalogs, enable Code Mode so the portal exposes a search/execute interface instead of loading every upstream tool schema into the model context.

Phase 5

Route portal traffic through Gateway for logs and DLP

Enable Gateway routing on the MCP Portal so MCP calls appear in Gateway HTTP logs.
Apply Gateway HTTP policies with DLP profiles targeted at the upstream MCP server host — not just the portal URL.
Note: HTTP DLP profiles apply to MCP Portal traffic. AI prompt DLP profiles do not.

Phase 6

Preserve user context downstream

For MCP servers that call internal Access-protected applications, configure Linked App Tokens so the original user identity is carried into the downstream app.
Self-hosted MCP: forward Cf-Access-Jwt-Assertion as Cf-Access-Token. SaaS MCP: forward the OAuth access_token as Authorization: Bearer.
The downstream app keeps enforcing its own per-user RBAC — the chain is only as tight as that application's permission model.

Phase 7

Wrap the LLM side with AI Gateway (adjacent)

Send the agent's outbound LLM provider calls through Cloudflare AI Gateway for caching, retries, rate limits, model fallback, and analytics.
AI Gateway is adjacent to MCP authorization — it is not the data-protection layer for MCP traffic. Use Gateway HTTP DLP for that.

Phase 8

Operate and audit

Send MCP Server Portal logs, Access events, Gateway HTTP logs, and AI Gateway events to SIEM. Logpush for MCP Portal logs is plan-dependent (Enterprise) — confirm the customer's plan.
Build dashboards for "which agent, called which tool, on whose behalf, with what data" — and DLP hits per route.
Review unusual agent behavior, scope creep, and exception requests on a defined cadence.

Architecture for governed AI agents

AI clients (Claude Desktop, Cursor, custom MCP hosts) connect to a single MCP Server Portal URL rather than individual MCP servers.

The portal sits behind Cloudflare Access. MCP clients authenticate using Access Managed OAuth (OAuth 2.0 authorization code flow) — the client may open a browser window for Access login, but this is not the same as a browser-cookie flow. Device authentication via Cloudflare One Client does not silently replace the portal login flow.

Access policy controls per-user server visibility in the portal. Admins curate the tool surface with aliases, toggles, and allowlists. For large catalogs, Code Mode exposes a search/execute interface instead of every upstream schema.

Each upstream MCP server is independently protected by Cloudflare Access — an MCP-server or self-hosted application with Managed OAuth for servers you host, or an Access for SaaS (OIDC) app for provider-hosted servers — so the direct URL cannot bypass the portal.

Gateway routing is enabled on the portal. MCP traffic appears in Gateway HTTP logs and is eligible for Gateway HTTP policies. DLP HTTP profiles target the upstream MCP server host to detect and block sensitive data before it leaves the edge. AI prompt DLP profiles do not apply to MCP Portal traffic.

When an MCP server calls a downstream Access-protected application, Linked App Tokens carry the original user identity (Cf-Access-Token header for self-hosted; Authorization: Bearer for SaaS). The downstream app enforces its own per-user RBAC.

Adjacent layer: outbound LLM provider traffic from the agent runs through Cloudflare AI Gateway for caching, retries, rate limits, model fallback, and analytics. AI Gateway is not the MCP authorization layer.

All activity is logged centrally — Access events, MCP Portal logs, Gateway HTTP logs, and AI Gateway events — and exported to SIEM (Logpush for MCP Portal logs is plan-dependent / Enterprise).

Cloudflare controls used for MCP governance

Remote MCP servers on Cloudflare

Replace local MCP deployments with managed, observable, versioned remote endpoints. Cloudflare Workers and the Agents SDK are the recommended runtime.

Cloudflare Access

Identity-aware authentication for the portal and every upstream MCP server. Servers you host use an Access MCP-server or self-hosted application with Managed OAuth — the server validates the Access JWT rather than running its own OAuth. Provider-hosted MCP services use an Access for SaaS app with the OIDC protocol.

MCP Server Portals

Centralize approved MCP servers behind one endpoint, control server visibility per user, and let admins curate tools and prompts (aliases, toggles, allowlist mode).

Code Mode

Reduces context bloat for large MCP catalogs by exposing a search/execute interface instead of every upstream schema. Cloudflare's published example: 52 tools → 2 portal tools, ~9,400 tokens → ~600 tokens. Actual savings depend on catalog size and agent prompting.

Gateway routing for MCP Portal

Route MCP Portal traffic through Cloudflare Gateway so HTTP policies, logs, and DLP can apply before traffic reaches upstream MCP servers.

HTTP DLP profiles

Attached to Gateway HTTP policies. Detect and block sensitive data sent to upstream MCP servers. AI prompt DLP profiles do not apply to MCP Portal traffic.

Linked App Tokens

Preserve user context when a protected MCP server calls downstream Access-protected applications. Attaches to self-hosted downstream apps only.

Service tokens

Controlled machine-to-machine MCP access. Service-token sessions are not human user sessions; for OAuth-capable upstream servers, the portal relies on the configured admin credential.

AI Gateway (adjacent)

Controls and observes LLM provider traffic — caching, retries, model fallback, rate limits, analytics. Adjacent to MCP authorization; not a replacement for Access, Gateway, or MCP Portal.

Logpush

Sends Access, MCP Portal, Gateway HTTP, and AI Gateway events to SIEM. Logpush for MCP Portal logs is plan-dependent (Enterprise).

Terraform / API automation

Keeps MCP Portal config, Access policies, Gateway routing, and DLP rules version-controlled and reviewable.

Control	When Nanosek uses it
Remote MCP servers on Cloudflare	Replace local MCP deployments with managed, observable, versioned remote endpoints. Cloudflare Workers and the Agents SDK are the recommended runtime.
Cloudflare Access	Identity-aware authentication for the portal and every upstream MCP server. Servers you host use an Access MCP-server or self-hosted application with Managed OAuth — the server validates the Access JWT rather than running its own OAuth. Provider-hosted MCP services use an Access for SaaS app with the OIDC protocol.
MCP Server Portals	Centralize approved MCP servers behind one endpoint, control server visibility per user, and let admins curate tools and prompts (aliases, toggles, allowlist mode).
Code Mode	Reduces context bloat for large MCP catalogs by exposing a search/execute interface instead of every upstream schema. Cloudflare's published example: 52 tools → 2 portal tools, ~9,400 tokens → ~600 tokens. Actual savings depend on catalog size and agent prompting.
Gateway routing for MCP Portal	Route MCP Portal traffic through Cloudflare Gateway so HTTP policies, logs, and DLP can apply before traffic reaches upstream MCP servers.
HTTP DLP profiles	Attached to Gateway HTTP policies. Detect and block sensitive data sent to upstream MCP servers. AI prompt DLP profiles do not apply to MCP Portal traffic.
Linked App Tokens	Preserve user context when a protected MCP server calls downstream Access-protected applications. Attaches to self-hosted downstream apps only.
Service tokens	Controlled machine-to-machine MCP access. Service-token sessions are not human user sessions; for OAuth-capable upstream servers, the portal relies on the configured admin credential.
AI Gateway (adjacent)	Controls and observes LLM provider traffic — caching, retries, model fallback, rate limits, analytics. Adjacent to MCP authorization; not a replacement for Access, Gateway, or MCP Portal.
Logpush	Sends Access, MCP Portal, Gateway HTTP, and AI Gateway events to SIEM. Logpush for MCP Portal logs is plan-dependent (Enterprise).
Terraform / API automation	Keeps MCP Portal config, Access policies, Gateway routing, and DLP rules version-controlled and reviewable.

Security outcomes with the Cloudflare control plane

Single sign-on identity on every MCP connection — users authenticate through Cloudflare Access, not per-laptop credentials.

One curated catalog: only approved MCP servers are reachable, with per-user visibility set by Access policy.

Least-privilege tool surface — admins alias, hide, or allowlist tools so an agent sees only what it should.

Sensitive data stays in — HTTP DLP profiles inspect MCP traffic through Gateway before it reaches upstream servers.

Centralized, structured logs — Access events, MCP Portal logs, and Gateway HTTP logs feed your SIEM (Logpush export is Enterprise).

No silent bypass — every upstream server is Access-protected directly, so a blocked user can't reach the direct URL.

User identity preserved downstream — Linked App Tokens carry the original user into apps that keep enforcing their own RBAC.

Lower context cost at scale — Code Mode collapses large catalogs into a search/execute interface (Cloudflare's example: 52 tools, roughly 9,400 to 600 tokens).

Central revocation — pull an agent's access from one control plane without redeploying servers.

Shadow MCP surfaced — Gateway HTTP visibility flags unmanaged MCP by path and JSON-RPC method before you enforce a portal.