The Authorisation Layer: The Infrastructure AI Agents Are Missing

Every serious software stack has layers. Compute, storage, networking, identity, observability. Each is a horizontal concern that cuts across every application built on top of it. You do not implement TLS per application. You do not build your own DNS resolver per service. You use the layer.

AI agents are being deployed without one of the most critical layers: authorisation.

Not authentication. Not access control on an API wrapper. Not a system prompt with instructions. A proper authorisation layer, one that intercepts every action an agent intends to take, evaluates it against policy, and makes an enforcement decision before the action executes.

This is not a feature. It is the missing infrastructure primitive of the current AI agent stack. And without it, enterprises are deploying agents that are, in the most literal sense, ungoverned.

Xybern is the authorisation layer for enterprise AI agents. This piece defines what that means, why it matters, and what the architecture looks like in practice.

What Agents Actually Do

Before you can govern agents, you need to be precise about what they do. An agent is not a chatbot. It is not a function that takes input and returns text. An agent is an autonomous process that:

Takes actions in external systems. APIs, databases, file systems, email platforms, ERP systems, trading infrastructure. Real systems with real consequences.

Makes decisions based on model reasoning. The model decides what to do next. That decision is conditioned on a prompt, a context window, tool outputs, and training data the operator does not fully control.

Operates over extended time horizons. A single agent run can involve dozens of sequential actions, each one building on the results of the last.

Chains with other agents. Multi-agent pipelines are the standard architecture for complex workflows. One agent's output becomes another agent's input and action.

Operates at machine speed. An agent does not pause between actions the way a human does. It executes as fast as the infrastructure allows.

This combination is what makes ungoverned agents dangerous. The blast radius of an agent error is not bounded by human reaction time. A misconfigured finance agent can execute hundreds of transactions before anyone notices. A poorly governed legal agent can file documents, send correspondence, or trigger contractual obligations that are difficult or impossible to reverse. A compromised customer-facing agent can exfiltrate data across thousands of interactions.

The question is not whether AI agents should be governed. It is where in the stack governance lives.

The Four Failures

When enterprise teams today try to govern their agents, they typically reach for one of four approaches. All four are insufficient.

Failure 1: System Prompt Instructions

"Do not take actions worth more than $10,000." "Always ask for confirmation before sending emails."

System prompt instructions are not enforceable. They are suggestions to the model. A sufficiently complex context window, an adversarial input, a jailbreak attempt, or a model update can cause the model to violate the instruction. You cannot audit adherence to a system prompt. You cannot demonstrate to a regulator that an agent followed a rule embedded in natural language. Prompts are not policy.

This is not a criticism of prompt engineering as a discipline. It is a statement about what prompts are structurally capable of. Governance requires deterministic enforcement. A language model is not a deterministic enforcement engine.

Failure 2: Application-Level Guards

Developers add conditional logic around action calls. if amount > 10000: raise Exception. This works until the codebase grows, until a developer forgets to add the guard on a new code path, until the business rule changes and the guard is not updated everywhere it appears, or until the agent takes an action the developer did not anticipate when writing the guard.

Application-level guards do not compose. They are not centralised. They cannot be audited as a whole. Changing a business rule requires a deployment. There is no single place to ask "what are the current governance rules for this agent?" because the rules are distributed across the codebase.

Failure 3: Traditional IAM

AWS IAM, Azure AD, Okta, and equivalent systems are designed for human identities and service accounts. They answer the question: is this principal allowed to call this API endpoint?

They do not answer: should this agent be allowed to send this specific email, with this specific content, to this specific recipient, at this specific time, given what it has already done in this session, and given that the amount involved exceeds the threshold defined in the policy that applies to agents with this trust score?

IAM operates at the permission level. Agents need governance at the action level. The agent has permission to call the email API. The governance question is whether it should, given the full context of this specific action.

Failure 4: Post-Hoc Logging

Logging what agents did after the fact is observability, not governance. An audit trail tells you what went wrong. An authorisation layer prevents it from happening. These are not substitutes for each other.

Post-hoc logging is necessary. It is not sufficient. Regulated enterprises cannot tell a compliance officer that they discovered the problem from the logs. The question regulators ask is not "what happened?" It is "what controls prevented the wrong thing from happening?"

The Authorisation Layer Pattern

What is actually needed is a dedicated authorisation layer: a horizontal infrastructure component that sits between AI agents and the systems they act on, intercepts every intended action, evaluates it in real time against active policies, and returns an enforcement decision before the action executes.

This is not a new concept in systems architecture. Policy enforcement is well understood in human IAM, in network security (firewalls, WAFs, zero-trust proxies), and in data governance (column-level access control, DLP). What is new is applying it to AI agent actions at runtime, with the speed, context-awareness, and policy expressiveness that agents require.

The authorisation layer has four responsibilities:

Intercept. Every agent action, before execution, is submitted to the authorisation layer. Not a sample. Not high-risk actions only. Every action. The enforcement call happens in the agent's hot path, before the downstream system is touched.

Evaluate. The layer evaluates the action against the full set of active policies. Policies express rules based on action type, content, metadata, agent identity, time, chain context, and trust score. Evaluation is deterministic, consistent, and auditable.

Decide. The layer returns one of three decisions: allow the action to proceed, block the action and return a reason the agent can handle, or escalate the action for human review and pause the agent until a human resolves it.

Record. Every decision is written to an immutable audit log before it is returned to the agent. The record includes the full action context, the policies evaluated, the decision, the reasoning, and a trust score. This record cannot be altered after the fact.

The Architecture

A concrete authorisation layer for AI agents looks like this:

 Agent Code
     │
     │  enforce(action)         ← SDK / auto-capture layer
     ▼
 Authorisation Layer
     │
     ├── 1. Identity resolution   (agent_id, role, trust score)
     ├── 2. Policy evaluation     (all active policies)
     ├── 3. Decision generation   (allow / block / escalate)
     ├── 4. Vault write           (immutable record, before response)
     │
     ▼
  Decision returned to agent
     │
     ├── ALLOW    → agent proceeds, action executes
     ├── BLOCK    → agent stops, reason logged, error handled
     └── ESCALATE → agent pauses, human reviews, agent resumes or stops

The key architectural properties:

The layer is a separate service, not agent code. Policies are centralised. One change applies everywhere, immediately, without a deployment.

The audit log is independent of the agent. Even if the agent code is replaced or compromised, the record of what was attempted is intact.

Enforcement is consistent across agents. Whether you have one agent or a thousand, running on one machine or across a distributed system, the same policies apply.

The layer composes across frameworks. CrewAI agents, LangGraph workflows, AutoGen systems, custom pipelines, all governed by the same layer without rebuilding enforcement for each.

Policy Expressiveness

The quality of an authorisation layer is determined by how expressive its policy language is. A layer that can only block specific action types is not sufficient for enterprise AI governance.

Production-grade authorisation requires policies that express:

Content rules. Block any action where the email body contains personal financial identifiers. Flag any prompt where the content matches known exfiltration patterns.

Metadata rules. Block any transaction where amount > 100000 and recipient_jurisdiction == "sanctioned". Escalate any action where data_classification == "confidential" and the agent's trust score is below 70.

Temporal rules. Allow database writes only between 09:00 and 17:00 on weekdays. Auto-expire all permissions for this agent after 30 minutes. Do not allow this agent to retry a blocked action more than twice in one hour.

Identity and role rules. This agent role has payments.read but not payments.execute. This agent has been granted temporary elevated permissions by a human operator, valid for 20 minutes, for this specific task.

Chain and session rules. If this agent has already sent an external communication in this session, escalate the next one for review. If the trust score has fallen below 60 across the last five decisions, require human approval for all subsequent actions.

Delegation rules. Agent A may delegate data.read to Agent B for this task, but not data.write. The delegation expires at session end. Agent B cannot further delegate.

This is the policy surface that enterprise AI governance actually requires. It is far beyond what application-level conditionals or system prompts can express, and it cannot be assembled from traditional IAM primitives.

Human-in-the-Loop as a Governance Primitive

Escalation is not failure. It is a deliberate governance mechanism.

For certain actions, the correct answer is neither allow nor block but escalate. A finance agent about to execute a large cross-border transfer. A legal agent about to file a regulatory document. An operations agent about to change production infrastructure. These are situations where human judgement adds value and where the cost of an autonomous wrong decision exceeds the cost of a brief pause for human review.

The authorisation layer handles escalation as a first-class workflow:

The agent submits the action and receives an escalate decision with a decision ID.
The action is paused. The agent waits, polling for a resolution.
A human operator reviews the action in a dedicated interface, with full context: what the agent is trying to do, why, what policies triggered escalation, the agent's recent decision history, and the trust score.
The operator approves or rejects. The decision is recorded immediately.
The agent resumes on approval or terminates on rejection.

The escalation loop is the bridge between fully autonomous agents and fully manual processes. It allows enterprises to deploy agents with confidence into high-stakes workflows, because the layer guarantees that actions above defined risk thresholds will surface to humans before executing.

Enterprise Implications

For engineering and security teams, the authorisation layer is infrastructure. For legal, compliance, and risk teams, it is the mechanism that makes AI agent deployment defensible.

Regulatory alignment. GDPR, SOX, HIPAA, DORA, the EU AI Act, and emerging AI-specific regulations share a common requirement: organisations must demonstrate meaningful control over automated decision-making systems. A centralised authorisation layer with an immutable audit log is the mechanism that makes this demonstration possible. "Every action was evaluated against defined policy before execution, and the full record is here" is a statement a compliance officer can work with.

Liability containment. When an AI agent causes harm, the first question from legal counsel and regulators is what controls were in place. Application-level guards distributed across a codebase are difficult to inventory and impossible to demonstrate comprehensively. A centralised policy engine with a complete decision log is not.

Operational confidence. Engineering teams can ship new agent capabilities faster when governance is a layer, not a per-agent implementation concern. You define the policy once. It applies to every agent that calls the enforcement endpoint. New agents inherit the governance posture of the organisation without requiring per-agent policy work.

Security boundary. AI agents are attack surfaces. Prompt injection attacks attempt to make agents take actions their operators did not intend. An authorisation layer that evaluates the action itself, not just the intent, provides a security boundary that prompt-level defences cannot. Even if the model is manipulated into attempting a malicious action, the layer blocks it before it executes.

Audit readiness. A compliance team asking what your agents did last quarter, and why, needs an answer they can provide in hours, not weeks. The authorisation layer's immutable decision log is the answer. Every action, every decision, every policy evaluated, every escalation, every resolution, retrievable and verifiable.

Breakglass: Governance for Emergencies

In production systems, there will be situations where the correct response is a controlled, temporary override of normal governance. A trading system needs to execute a time-sensitive action outside normal hours. An incident response agent needs elevated permissions to remediate a live security event. A senior operator needs to unblock an agent immediately during a critical workflow.

The authorisation layer handles this through a breakglass protocol: a human-initiated, time-limited override with mandatory justification, immediate audit recording, automatic expiry, and post-incident review requirement. The override does not disable governance. It creates a fully logged exception, visible in the audit trail, attributable to the human who initiated it.

Breakglass without an immutable audit trail is not governance. It is an undocumented gap. The authorisation layer closes it.

What Xybern Implements

Xybern is the authorisation layer for enterprise AI agents. Every agent action is enforced, audited, and governed before it executes.

The enforcement API intercepts actions via POST /v1/enforce/intercept, evaluates them against active policies in real time, and returns a decision with trust score and reasoning in under 10 milliseconds on the fast path. The SDK integrates with CrewAI, LangGraph, AutoGen, LlamaIndex, and custom pipelines. Auto-capture intercepts tool calls at the framework level without requiring manual enforcement calls at each action site.

The policy engine supports content, metadata, temporal, identity, chain, and delegation rules. The Policy-as-Code SDK allows engineering teams to define policies as version-controlled Python, deployed atomically with full provenance tracking. Shadow mode evaluates new policies against live traffic without enforcing them, eliminating the risk of false positives before a policy goes live.

The LLM Gateway sits in front of model providers and enforces policy on every prompt and completion, governing not just what agents do but what models receive and return.

The Provenance Vault records every enforcement decision before it is returned to the agent. The record is immutable. The Sentinel dashboard gives security teams a real-time view of every decision, every escalation, every policy evaluation. The human-in-the-loop interface closes the escalation loop in seconds.

The Infrastructure Question

The authorisation layer is horizontal infrastructure. Like a firewall, a service mesh, or an identity provider, it is a component that every team in an organisation benefits from without building it themselves.

The question for engineering and security leaders is not whether AI agents need governance. It is where governance lives in the stack.

If the answer is distributed across agent code, implemented differently per team, expressed as system prompt instructions and application-level conditionals, the organisation is accumulating governance debt at the rate it deploys agents. Every new agent is a new surface without a consistent enforcement posture. Every policy change is a deployment. Every audit is a reconstruction.

If the answer is a centralised authorisation layer, with a policy engine, an immutable audit log, and a human review interface, the organisation has infrastructure that scales with its AI adoption. New agents inherit governance. Policy changes are immediate. Audits are retrievable.

That infrastructure exists. It is the authorisation layer. And it is the piece that is missing from every AI agent stack deployed without one.

Xybern is the authorisation layer for enterprise AI agents. Every agent action is enforced, audited, and governed before it executes. Learn more at xybern.com or read the technical documentation at docs.xybern.com.

كل بنية تحتية برمجية جادة تقوم على طبقات. حوسبة، تخزين، شبكات، هوية، ورصد. كل طبقة منها تمثل اهتمامًا أفقيًا يمتد عبر كل تطبيق مبني فوقها. لا تُطبّق TLS لكل تطبيق على حدة. لا تبني محلل DNS خاصًا لكل خدمة. بل تستخدم الطبقة المخصصة لذلك.

تُنشر وكلاء الذكاء الاصطناعي اليوم دون امتلاك واحدة من أهم هذه الطبقات: طبقة التفويض.

ليس المقصود المصادقة. ولا التحكم في الوصول على مستوى غلاف API. ولا تعليمات مُضمَّنة في موجّه النظام. المقصود طبقة تفويض حقيقية، تعترض كل إجراء ينوي الوكيل تنفيذه، وتقيّمه وفق السياسات المعمول بها، وتُصدر قرار التطبيق قبل تنفيذ الإجراء.

هذا ليس ميزة إضافية. إنه العنصر البنيوي التحتي المفقود في منظومة وكلاء الذكاء الاصطناعي الحالية. وفي غيابه، تُنشر وكلاء في المؤسسات دون أي حوكمة حقيقية.

Xybern هي طبقة التفويض لوكلاء الذكاء الاصطناعي المؤسسي. هذه المقالة تُعرّف ما يعنيه ذلك، ولماذا يهم، وكيف تبدو البنية التقنية في الواقع العملي.

ما الذي تفعله الوكلاء فعلًا

قبل أن تتمكن من حوكمة الوكلاء، عليك أن تكون دقيقًا في فهم ما يفعلونه. الوكيل ليس روبوت محادثة. وليس دالة تأخذ مدخلًا وتُعيد نصًا. الوكيل هو عملية مستقلة تقوم بما يلي:

تنفيذ إجراءات في أنظمة خارجية. واجهات برمجية، قواعد بيانات، أنظمة ملفات، منصات بريد إلكتروني، أنظمة ERP وبنية تحتية للتداول. أنظمة حقيقية ذات عواقب حقيقية.

اتخاذ قرارات بناءً على استدلال النموذج. النموذج يقرر الخطوة التالية. وهذا القرار مشروط بموجّه، ونافذة سياق، ومخرجات أدوات، وبيانات تدريب لا يتحكم فيها المشغّل بالكامل.

العمل عبر أُطر زمنية ممتدة. قد تشمل تشغيلة وكيل واحدة عشرات الإجراءات المتسلسلة، كل منها يبني على نتائج ما سبقه.

التسلسل مع وكلاء آخرين. الأنابيب متعددة الوكلاء هي البنية المعيارية للعمليات المعقدة. مخرجات وكيل تصبح مدخلات وإجراءات وكيل آخر.

العمل بسرعة الآلة. الوكيل لا يتوقف بين الإجراءات كما يفعل الإنسان. ينفّذ بأقصى سرعة تسمح بها البنية التحتية.

هذا التوليف هو ما يجعل الوكلاء غير الخاضعين للحوكمة خطيرين. دائرة تأثير خطأ الوكيل لا تتحدد بزمن استجابة الإنسان. وكيل مالي مُهيَّأ بصورة خاطئة قد ينفذ مئات المعاملات قبل أن يلاحظ أحد. وكيل قانوني ضعيف الحوكمة قد يُودع وثائق، ويُرسل مراسلات، أو يُطلق التزامات تعاقدية يصعب التراجع عنها.

السؤال ليس ما إذا كانت وكلاء الذكاء الاصطناعي تحتاج إلى حوكمة. بل أين تقع الحوكمة في منظومة الطبقات.

الإخفاقات الأربعة

حين تحاول فرق المؤسسات اليوم حوكمة وكلائها، تلجأ عادةً إلى أحد أربعة أساليب. وكلها غير كافية.

الإخفاق الأول: تعليمات موجّه النظام

"لا تنفّذ إجراءات بقيمة تتجاوز عشرة آلاف دولار." "اطلب التأكيد دائمًا قبل إرسال رسائل البريد الإلكتروني."

تعليمات موجّه النظام غير قابلة للتطبيق. إنها اقتراحات للنموذج. نافذة سياق معقدة بما يكفي، أو مدخل معادٍ، أو تحديث للنموذج، قد يدفعه إلى مخالفة التعليمة. لا يمكنك مراجعة الامتثال لموجّه النظام. ولا يمكنك إثبات لجهة تنظيمية أن الوكيل اتبع قاعدة مُضمَّنة في لغة طبيعية. الموجّهات ليست سياسات.

الإخفاق الثاني: الحماية على مستوى التطبيق

يضيف المطوّرون منطقًا شرطيًا حول استدعاءات الإجراءات. if amount > 10000: raise Exception. يعمل هذا حتى تكبر قاعدة الكود، أو حتى ينسى مطوّر إضافة الحماية في مسار كود جديد، أو حتى يتغير القاعدة التجارية ولا تُحدَّث الحماية في كل مكان.

الحمايات على مستوى التطبيق لا تتكامل. إنها ليست مركزية. ولا يمكن مراجعتها كمنظومة متكاملة. تغيير قاعدة تجارية يستلزم نشرًا جديدًا.

الإخفاق الثالث: إدارة الهوية والوصول التقليدية

صُمّمت أنظمة AWS IAM وAzure AD وOkta وما يماثلها للهويات البشرية وحسابات الخدمة. تُجيب على السؤال: هل يحق لهذا الكيان استدعاء نقطة نهاية API هذه؟

لكنها لا تُجيب على: هل ينبغي لهذا الوكيل إرسال هذه الرسالة الإلكترونية تحديدًا، بهذا المحتوى، لهذا المستلم، في هذا التوقيت، نظرًا لما نفّذه بالفعل في هذه الجلسة؟

إدارة الهوية والوصول تعمل على مستوى الأذونات. الوكلاء يحتاجون حوكمة على مستوى الإجراء.

الإخفاق الرابع: التسجيل اللاحق

تسجيل ما فعله الوكلاء بعد وقوعه هو رصد، وليس حوكمة. سجل التدقيق يُخبرك بما حدث خطأ. طبقة التفويض تمنع حدوثه أصلًا. هذان ليسا بديلًا لبعضهما.

نمط طبقة التفويض

المطلوب هو طبقة تفويض مخصصة: مكوّن بنيوي أفقي يجلس بين وكلاء الذكاء الاصطناعي والأنظمة التي يتفاعلون معها، ويعترض كل إجراء منوي تنفيذه، ويقيّمه في الوقت الفعلي وفق السياسات النشطة، ويُعيد قرار التطبيق قبل تنفيذ الإجراء.

لطبقة التفويض أربع مسؤوليات:

الاعتراض. كل إجراء للوكيل، قبل تنفيذه، يُقدَّم إلى طبقة التفويض. ليس عيّنة. وليس الإجراءات عالية الخطورة فحسب. كل إجراء. يحدث استدعاء التطبيق في المسار الحار للوكيل، قبل لمس أي نظام مجاور.

التقييم. تُقيّم الطبقة الإجراء وفق كامل السياسات النشطة. تُعبّر السياسات عن قواعد مبنية على نوع الإجراء، والمحتوى، والبيانات الوصفية، وهوية الوكيل، والوقت، وسياق السلسلة، ودرجة الثقة. التقييم حتمي ومتسق وقابل للمراجعة.

الحكم. تُعيد الطبقة أحد ثلاثة قرارات: السماح بالمضي في الإجراء، أو رفضه وإعادة سبب يمكن للوكيل التعامل معه، أو تصعيده للمراجعة البشرية وإيقاف الوكيل حتى يُحسم الأمر.

التسجيل. يُكتب كل قرار في سجل تدقيق غير قابل للتعديل قبل إعادته إلى الوكيل. يتضمن السجل السياق الكامل للإجراء، والسياسات المُقيَّمة، والقرار، والمبررات، ودرجة الثقة.

البنية التقنية

تبدو طبقة التفويض الملموسة لوكلاء الذكاء الاصطناعي على النحو الآتي:

كود الوكيل
    |
    | enforce(action)       ← SDK / طبقة الالتقاط التلقائي
    v
طبقة التفويض
    |
    |-- 1. تحليل الهوية      (agent_id، الدور، درجة الثقة)
    |-- 2. تقييم السياسات    (جميع السياسات النشطة)
    |-- 3. إصدار القرار      (allow / block / escalate)
    |-- 4. كتابة في الخزينة  (سجل غير قابل للتعديل، قبل الإجابة)
    |
    v
 القرار يُعاد إلى الوكيل
    |
    |-- ALLOW    -> يُكمل الوكيل، ينفّذ الإجراء
    |-- BLOCK    -> يتوقف الوكيل، يُسجَّل السبب، يُعالَج الخطأ
    `-- ESCALATE -> يُوقَف الوكيل، مراجعة بشرية، يستأنف أو يتوقف

الخصائص البنيوية الجوهرية:

الطبقة خدمة مستقلة، وليست كود الوكيل. السياسات مركزية. تغيير واحد يُطبَّق في كل مكان، فورًا، دون نشر جديد.

سجل التدقيق مستقل عن الوكيل. حتى لو استُبدل كود الوكيل أو تعرّض للاختراق، يبقى سجل ما جرى محاولته سليمًا.

التطبيق متسق عبر جميع الوكلاء. سواء كان لديك وكيل واحد أو ألف، يعمل على جهاز واحد أو عبر نظام موزّع، تُطبَّق نفس السياسات.

الطبقة قابلة للتكوين عبر الأطر. وكلاء CrewAI، وسير عمل LangGraph، وأنظمة AutoGen، والأنابيب المخصصة، كلها تخضع لنفس الطبقة دون إعادة بناء التطبيق لكل منها.

التعبير عن السياسات

جودة طبقة التفويض تتحدد بمدى تعبيرية لغة سياساتها. طبقة لا تستطيع سوى حجب أنواع إجراءات بعينها لا تكفي لحوكمة الذكاء الاصطناعي المؤسسي.

التفويض الاحترافي يستلزم سياسات تُعبّر عن:

قواعد المحتوى. احجب أي إجراء تحتوي رسالته الإلكترونية على معرّفات مالية شخصية. ضع علامة على أي موجّه يطابق أنماط تسريب بيانات معروفة.

قواعد البيانات الوصفية. احجب أي معاملة تتجاوز فيها amount مئة ألف وrecipient_jurisdiction تساوي "مُقيَّدة". صعّد أي إجراء تكون فيه data_classification تساوي "سري" ودرجة ثقة الوكيل أقل من سبعين.

القواعد الزمنية. اسمح بالكتابة في قواعد البيانات فقط بين التاسعة والسابعة عشرة في أيام الأسبوع. أنهِ جميع أذونات هذا الوكيل تلقائيًا بعد ثلاثين دقيقة.

قواعد الهوية والأدوار. لهذا الوكيل صلاحية payments.read لكن ليس payments.execute. مُنح هذا الوكيل صلاحيات مؤقتة مرتفعة من قِبَل مشغّل بشري، صالحة لعشرين دقيقة لهذه المهمة تحديدًا.

قواعد السلسلة والجلسة. إن كان هذا الوكيل قد أجرى تواصلًا خارجيًا في هذه الجلسة، صعّد التواصل التالي للمراجعة. إن انخفضت درجة الثقة دون ستين عبر آخر خمسة قرارات، اشترط موافقة بشرية على جميع الإجراءات اللاحقة.

قواعد التفويض. يجوز للوكيل A تفويض data.read للوكيل B لهذه المهمة، لكن ليس data.write. ينتهي التفويض بنهاية الجلسة. لا يحق للوكيل B التفويض إلى وكيل آخر.

التدخل البشري كآلية حوكمة

التصعيد ليس فشلًا. إنه آلية حوكمة متعمّدة.

لبعض الإجراءات، الجواب الصحيح ليس السماح أو الرفض، بل التصعيد. وكيل مالي على وشك تنفيذ تحويل دولي ضخم. وكيل قانوني على وشك تقديم وثيقة تنظيمية. وكيل عمليات على وشك تغيير بنية تحتية حيّة. هذه مواقف يُضيف فيها الحكم البشري قيمة حقيقية.

تتعامل طبقة التفويض مع التصعيد كتدفق عمل من الدرجة الأولى:

يُقدّم الوكيل الإجراء ويتلقى قرار escalate مع معرّف القرار.
يتوقف الإجراء. ينتظر الوكيل استطلاعًا للحل.
يراجع مشغّل بشري الإجراء في واجهة مخصصة، مع السياق الكامل.
يوافق المشغّل أو يرفض. يُسجَّل القرار فورًا.
يستأنف الوكيل عند الموافقة أو يتوقف عند الرفض.

الانعكاسات المؤسسية

التوافق التنظيمي. GDPR وSOX وHIPAA وDORA وقانون الذكاء الاصطناعي الأوروبي وغيرها تشترك في متطلب واحد: إثبات السيطرة الفعلية على أنظمة اتخاذ القرار الآلي. طبقة التفويض المركزية مع سجل التدقيق غير القابل للتعديل هي الآلية التي تُمكّن هذا الإثبات.

احتواء المسؤولية القانونية. حين يتسبب وكيل في ضرر، يُسأل عن الضوابط الموضوعة. حمايات موزّعة عبر قاعدة كود يصعب حصرها وإثباتها شموليًا. محرك سياسات مركزي مع سجل قرارات كامل لا يُصعَب ذلك.

الثقة التشغيلية. يستطيع فرق الهندسة الشحن بسرعة أعلى حين تكون الحوكمة طبقة، لا اهتمامًا لكل وكيل على حدة. تعرّف على السياسة مرة واحدة. تُطبَّق على كل وكيل يستدعي نقطة نهاية التطبيق.

الحد الأمني. وكلاء الذكاء الاصطناعي أسطح هجوم. هجمات حقن الموجّهات تحاول دفع الوكلاء لتنفيذ إجراءات لم يقصدها مشغّلوهم. طبقة تفويض تُقيّم الإجراء ذاته، لا النية فحسب، تُوفّر حدًا أمنيًا لا تستطيع دفاعات مستوى الموجّه توفيره.

الجاهزية للتدقيق. فريق الامتثال الذي يسأل عمّا فعله وكلاؤك الربع الماضي، ولماذا، يحتاج إجابة يُقدّمها في ساعات لا أسابيع. سجل القرارات غير القابل للتعديل في طبقة التفويض هو الإجابة.

بروتوكول الطوارئ

في أنظمة الإنتاج، ستكون هناك حالات تكون فيها الاستجابة الصحيحة تجاوزًا مؤقتًا ومُتحكَّمًا به للحوكمة العادية. نظام تداول يحتاج تنفيذ إجراء حساس للوقت خارج ساعات العمل. وكيل استجابة حوادث يحتاج أذونات مرتفعة لمعالجة حدث أمني حي.

تتعامل طبقة التفويض مع هذا عبر بروتوكول الطوارئ: تجاوز بشري المبادرة، محدود بالوقت، مع مبرر إلزامي، وتسجيل فوري في سجل التدقيق، وانتهاء صلاحية تلقائي. التجاوز لا يُعطّل الحوكمة. يُنشئ استثناءً موثقًا بالكامل في سجل التدقيق، يمكن مراجعته تفصيليًا في تحليل ما بعد الحادث.

ما تُقدّمه Xybern

Xybern هي طبقة التفويض لوكلاء الذكاء الاصطناعي المؤسسي. كل إجراء لأي وكيل يُطبَّق ويُدقَّق ويُحكَم قبل تنفيذه.

تعترض واجهة برمجة التطبيقات الإجراءات عبر POST /v1/enforce/intercept، وتُقيّمها وفق السياسات النشطة في الوقت الفعلي، وتُعيد قرارًا مع درجة الثقة والمبرر في أقل من عشرة ميلي ثانية. يتكامل SDK مع CrewAI وLangGraph وAutoGen وLlamaIndex والأنابيب المخصصة. تعترض طبقة الالتقاط التلقائي استدعاءات الأدوات على مستوى الإطار دون الحاجة إلى استدعاءات تطبيق يدوية في كل موقع إجراء.

يدعم محرك السياسات قواعد المحتوى والبيانات الوصفية والزمنية والهوية والسلسلة والتفويض. يجلس بوابة LLM أمام مزوّدي النماذج ويُطبّق السياسات على كل موجّه واستكمال. يُسجّل خزينة المصدر كل قرار تطبيق قبل إعادته إلى الوكيل. تُقدّم لوحة Sentinel لفرق الأمن رؤية فورية على كل قرار وكل تصعيد وكل تقييم سياسة.

سؤال البنية التحتية

طبقة التفويض بنية تحتية أفقية. كجدار الحماية، أو شبكة الخدمات، أو مزوّد الهوية، إنها مكوّن تستفيد منه كل فرقة في المؤسسة دون الحاجة إلى بنائه بأنفسهم.

إن كانت الإجابة موزّعة عبر كود الوكيل، مُطبَّقة بشكل مختلف لكل فريق، تُعبَّر عنها كتعليمات موجّهات وشروط على مستوى التطبيق، فالمؤسسة تراكم ديونًا من الحوكمة بقدر ما تنشر وكلاء.

إن كانت الإجابة طبقة تفويض مركزية، مع محرك سياسات وسجل تدقيق غير قابل للتعديل وواجهة مراجعة بشرية، فالمؤسسة تبني على بنية تحتية تتناسب مع تبنّيها للذكاء الاصطناعي. الوكلاء الجدد يرثون الحوكمة. تغييرات السياسات فورية. عمليات التدقيق قابلة للاسترجاع.

تلك البنية التحتية موجودة. إنها طبقة التفويض. وهي القطعة المفقودة في كل منظومة وكلاء ذكاء اصطناعي تُنشر دونها.

Xybern هي طبقة التفويض لوكلاء الذكاء الاصطناعي المؤسسي. كل إجراء لأي وكيل يُطبَّق ويُدقَّق ويُحكَم قبل تنفيذه. تعرّف على المزيد على xybern.com أو اطّلع على التوثيق التقني على docs.xybern.com.

What Agents Actually Do

The Four Failures

Failure 1: System Prompt Instructions

Failure 2: Application-Level Guards

Failure 3: Traditional IAM

Failure 4: Post-Hoc Logging

The Authorisation Layer Pattern

The Architecture

Policy Expressiveness

Human-in-the-Loop as a Governance Primitive

Enterprise Implications

Breakglass: Governance for Emergencies

What Xybern Implements

The Infrastructure Question

ما الذي تفعله الوكلاء فعلًا

الإخفاقات الأربعة

الإخفاق الأول: تعليمات موجّه النظام

الإخفاق الثاني: الحماية على مستوى التطبيق

الإخفاق الثالث: إدارة الهوية والوصول التقليدية

الإخفاق الرابع: التسجيل اللاحق

نمط طبقة التفويض

البنية التقنية

التعبير عن السياسات

التدخل البشري كآلية حوكمة

الانعكاسات المؤسسية

بروتوكول الطوارئ

ما تُقدّمه Xybern

سؤال البنية التحتية

Want more insights?

Get in Touch

Apply for this Role

What Agents Actually Do

The Four Failures

Failure 1: System Prompt Instructions

Failure 2: Application-Level Guards

Failure 3: Traditional IAM

Failure 4: Post-Hoc Logging

The Authorisation Layer Pattern

The Architecture

Policy Expressiveness

Human-in-the-Loop as a Governance Primitive

Enterprise Implications

Breakglass: Governance for Emergencies

What Xybern Implements

The Infrastructure Question

ما الذي تفعله الوكلاء فعلًا

الإخفاقات الأربعة

الإخفاق الأول: تعليمات موجّه النظام

الإخفاق الثاني: الحماية على مستوى التطبيق

الإخفاق الثالث: إدارة الهوية والوصول التقليدية

الإخفاق الرابع: التسجيل اللاحق

نمط طبقة التفويض

البنية التقنية

التعبير عن السياسات

التدخل البشري كآلية حوكمة

الانعكاسات المؤسسية

بروتوكول الطوارئ

ما تُقدّمه Xybern

سؤال البنية التحتية

Want more insights?

Get in Touch

Security & Compliance

Apply for this Role

Application Received!