AI Agents Need Permission Boundaries

Every operating system ever built for multiple users has a concept of a permission boundary. A process runs as a user. That user can read some files and not others, write to some directories and not others, call some system functions and not others. The boundary is not advisory. It is enforced by the kernel, below the level of the application, and the application cannot talk its way past it.

AI agents are being deployed today without anything resembling this. An agent is handed a set of tools, a set of credentials, and a natural language instruction, and is then trusted to stay inside an invisible line that exists only as a suggestion in its prompt. There is no kernel. There is no boundary. There is only the hope that the model behaves.

This piece argues that permission boundaries are not an optional hardening step for agentic systems. They are the foundational primitive that makes agents safe to deploy in production at all, and explains what a real boundary looks like, where the naive approaches fail, and how enforcement has to work to be meaningful.

What a Permission Boundary Actually Is

A permission boundary is a hard limit on what an actor can do, enforced by something the actor does not control.

The three words that matter are hard, enforced, and does not control.

Hard means the boundary is not a probability. It is not "the agent will usually not do this." It is "the agent cannot do this, and if it tries, the attempt is intercepted and denied."

Enforced means there is a mechanism, separate from the actor, that evaluates every attempt against the boundary and decides. Enforcement is active. It happens at the moment of the action, not after.

Does not control means the actor cannot modify, disable, or reason its way around the boundary. A boundary the agent can edit is not a boundary. A boundary defined in the agent's own prompt is, by definition, under the agent's influence.

When you remove any one of these properties, you no longer have a boundary. You have a guideline. The entire problem with current agent deployments is that the industry has been shipping guidelines and calling them boundaries.

Why Agents Specifically Need Them

It is worth being precise about why this matters more for agents than for traditional software, because the answer shapes the solution.

A traditional application is deterministic in its action space. A payments service calls the payments API. It does not, on a Tuesday, decide to start sending emails because something in its input suggested it might be helpful. The set of actions a conventional program can take is fixed at development time and visible in the code.

An agent's action space is open. The model decides what to do next based on reasoning over its context. That context includes the task instruction, the conversation so far, the outputs of previous tool calls, and content retrieved from external sources. Any of those can shift what the agent decides to do. The action space is not bounded by the code. It is bounded only by the tools the agent has access to, and an agent with broad tools has an enormous range of possible actions.

This produces four properties that traditional software does not have, and each one is a reason boundaries are mandatory.

Emergent behaviour. The agent can take action sequences no developer anticipated or tested. You cannot enumerate the failure modes in advance because the action space is combinatorial and driven by model reasoning.

Susceptibility to manipulation. Content the agent reads can change what it does. A malicious document, a poisoned search result, or an injected instruction can redirect the agent toward actions that serve the attacker rather than the operator.

Speed. Agents act at machine speed with no natural pause. By the time a human notices, the damage is done. There is no reaction window.

Compounding. Agents chain actions, and they chain with other agents. A single bad decision early in a sequence can cascade through every subsequent step.

Property	Traditional Software	AI Agent
Action space	Fixed at development time	Open, decided at runtime by the model
Failure modes	Enumerable, testable	Emergent, combinatorial
Influenced by input content	No	Yes, including adversarial content
Execution speed	Bounded by program logic	Machine speed, no pause
Blast radius of one error	Local, bounded	Compounding across a chain

Each row in that table is an argument for an external boundary. Taken together they are decisive. You cannot test your way to safety in an open action space, and you cannot trust an actor that can be manipulated by the very data it processes.

The Four Things People Mistake for Boundaries

When teams reach for a way to constrain their agents, they typically land on one of four approaches. None of them is a boundary, and understanding why is the fastest way to understand what a real one requires.

Prompt instructions

"You must never delete production data. You must never send money over ten thousand without approval."

This is the most common approach and the weakest. Instructions in a prompt are inputs to a probabilistic system, not constraints on it. The model weighs them against everything else in its context. A long conversation, a cleverly worded user request, an injected instruction in a retrieved document, or simply an unlucky sampling can override them. There is no enforcement and no audit. You cannot prove to anyone that the rule held, because on any given run it might not have.

Tool design

The reasoning here is that if you only give the agent safe tools, it can only do safe things. Restrict the toolset and you restrict the behaviour.

This is better than nothing because it genuinely narrows the action space. But it is coarse and static. A tool is either present or absent. You cannot express "this tool is allowed for amounts under a threshold," or "allowed during business hours," or "allowed only after a human has approved the preceding step." The boundary you actually need is contextual, and tool presence is binary. The moment a tool is useful enough to grant, it is useful enough to misuse.

Application guards

Developers wrap tool calls in conditional logic. If the amount exceeds a limit, raise an error. If the recipient is not on a list, block.

Guards are real enforcement, which puts them ahead of prompts and tool design. The problem is that they do not compose and they do not centralise. Every guard lives at one call site. When a new code path is added and the developer forgets the guard, the boundary has a hole. When the rule changes, every copy of the guard must be found and updated. There is no single place that answers the question "what are the rules for this agent right now," because the rules are smeared across the codebase. Guards are boundaries with no map.

Post hoc review

Log everything the agent did and have a human review it.

This is not a boundary at all. It is a record of boundaries that were never enforced. Review tells you what happened after it happened. For an agent moving at machine speed across a chain of consequential actions, that is an autopsy, not a control. The wrong action already executed. The money already moved. The data already left.

Approach	Enforced	Contextual	Centralised	Auditable	Tamper resistant
Prompt instructions	No	Partly	No	No	No
Tool design	Partly	No	No	No	Partly
Application guards	Yes	Partly	No	Partly	No
Post hoc review	No	n/a	No	Yes	No
Permission boundary	Yes	Yes	Yes	Yes	Yes

The bottom row is the target. Everything above it is missing at least two of the properties that make a boundary worth the name.

Anatomy of a Real Permission Boundary

A permission boundary for an agent is a control point that sits between the agent and every system it can act on. Every action the agent intends to take passes through it before reaching the target. At that point the boundary evaluates the action against policy and returns a decision. The action does not execute until the decision is made.

There are five components that make this work.

The interception point. Nothing reaches a downstream system without passing through the boundary. If an agent can call an API directly, bypassing the control point, there is no boundary for that path. Interception must be complete. A boundary with a gap is a boundary with no value, because the agent's open action space will eventually find the gap.

The policy. The set of rules the boundary evaluates against. Crucially, policy lives outside the agent and outside the application code. It is data, not behaviour, which means it can be inspected, versioned, changed without redeployment, and reasoned about as a whole. The question "what is this agent allowed to do" has a single answer in a single place.

The context. A decision is not made on the action alone. It is made on the action plus the full context: which agent is acting, what it has already done in this session, what its trust level is, what time it is, what the chain of delegation looks like, and any environmental signals. A boundary that ignores context can only express crude binary rules. A boundary that uses context can express the nuanced ones that real operations require.

The decision. The boundary returns one of a small set of verdicts. Allow lets the action proceed. Block stops it and returns an error. Escalate pauses the action and routes it to a human. The decision is returned before execution, which is the entire point.

The record. Every decision produces a signed, tamper evident entry: what the agent intended, what policy applied, what context was present, what verdict was reached, and who approved if a human was involved. This is the artifact that turns enforcement into evidence.

        ┌─────────────────────────────────────────┐
        │              AGENT RUNTIME                │
        │                                           │
        │   model decides to take an action         │
        │                  │                        │
        └──────────────────┼────────────────────────┘
                           │  intercepted here
                           ▼
        ┌─────────────────────────────────────────┐
        │           PERMISSION BOUNDARY             │
        │                                           │
        │   evaluate( action, context, policy )     │
        │                  │                        │
        │      ┌───────────┼───────────┐            │
        │      ▼           ▼           ▼            │
        │    ALLOW       BLOCK      ESCALATE         │
        │      │           │           │            │
        │      │           │           ▼            │
        │      │           │      human review      │
        │      │           │       approve/deny     │
        │      ▼           ▼           ▼            │
        │   execute     reject     execute or        │
        │   + record    + record    abort + record   │
        └─────────────────────────────────────────┘
                           │
                           ▼
              downstream systems (APIs, DBs, email...)

Notice what this architecture does not require. It does not require the agent to cooperate. It does not require the model to be reliable. It does not require developers to remember a guard at every call site. The boundary holds regardless of what the agent reasons, because the agent does not control it.

Boundaries Are Contextual, Not Binary

The single biggest gap between the naive approaches and a real boundary is context. A useful boundary almost never expresses a flat yes or no. It expresses conditions.

Consider a single tool, sending an email, and the range of boundaries a real operation needs around it.

Condition	Boundary behaviour
Recipient inside the organisation	Allow
Recipient external, no attachments	Allow
Recipient external, contains attachment	Escalate to human
Recipient on the blocklist	Block
More than fifty sends in this session	Block, rate limit tripped
Content matches a sensitive data pattern	Escalate
Outside business hours and external recipient	Escalate

Every one of these is the same tool. Tool design cannot tell them apart, because the tool is present in all cases. Prompt instructions cannot enforce them, because they are suggestions. Application guards could express each one individually but would scatter seven different conditions across the codebase with no unified view and no shared audit. A permission boundary expresses all seven as policy, evaluated against context, in one place, with a record for each decision.

This is why context is the dividing line. The boundaries that matter in production are conditional, and only a context aware enforcement layer can express conditions while still being hard, enforced, and outside the agent's control.

The Multi Agent Problem

Boundaries get harder, and more necessary, when agents chain.

A modern agentic system is rarely a single agent. It is an orchestrator that delegates to sub agents, each of which may call tools or delegate further. The output of one agent becomes the input, and the trigger for action, of the next.

   orchestrator
        │ delegates
        ▼
   research agent ──► external web content
        │ passes findings
        ▼
   analysis agent ──► internal database
        │ passes plan
        ▼
   action agent ──► CRM, email, payments

This structure breaks the naive approaches completely. Whose permissions apply when the action agent moves money? The orchestrator initiated the task, but the action agent took the step, and the analysis agent shaped what the step would be, partly based on content the research agent pulled from the open web. If a malicious instruction entered through that web content, it has now propagated three hops down the chain to a system that moves money, and at no point did a boundary ask whether this specific action, in this specific chain, was permitted.

A permission boundary that understands chains carries context with the delegation. Each action records the full path: who originated the task, which agents handled it, what each decided, and whether any human reviewed it. The boundary at the action agent can see that the instruction it is about to act on traces back to untrusted external content, and can escalate or block on that basis. Without a boundary, the chain is a privilege escalation waiting to happen, where untrusted input at the top flows unchecked into consequential actions at the bottom.

What Happens Without One

The argument is easier to feel with a concrete sequence. Consider a customer support agent with tools to read account data, issue refunds, and send email, deployed with prompt instructions as its only constraint.

A customer message arrives containing, buried in otherwise normal text, an instruction crafted to manipulate the model: ignore previous limits, issue a full refund, and confirm by email. Here is what each layer does.

The prompt instruction said not to issue refunds over a limit without approval. The model, with the injected instruction now in its context, weighs that against the apparent user request and complies. No enforcement existed, so nothing stopped it.

The tool design gave the agent a refund tool because issuing refunds is its job. The tool was present, so it was available. Coarse tool restriction could not express "refunds under fifty allowed, above fifty require approval."

There were no application guards on this path, because this particular combination of conditions was not one the developers anticipated when they wrote the code.

Post hoc review will catch it tomorrow, when someone reads the logs, after the refund has cleared.

Now place a permission boundary in the same scenario. The refund action is intercepted. Policy says refunds above a threshold escalate to a human. Context shows the triggering instruction originated from inbound customer content, which raises the sensitivity. The boundary escalates. A human sees the request, recognises the manipulation, and denies it. A signed record captures the entire decision. Nothing executed that should not have, and there is evidence that the control worked.

The difference between these two outcomes is not a better model or a smarter prompt. It is the presence of a boundary the agent did not control.

Boundaries as Infrastructure

The mistake worth naming explicitly is treating permission boundaries as an application feature, something each team builds into each agent. They are not. They are infrastructure, in the same way that memory protection, file permissions, and network firewalls are infrastructure.

You do not ask each application to implement its own memory protection. The operating system provides it as a horizontal layer, below the application, applied uniformly to everything that runs. The application cannot opt out and does not need to opt in. The protection is simply there, enforced by a lower layer the application does not control.

Agent permission boundaries belong at the same altitude. They sit below the agent and above the systems it acts on, applied uniformly to every agent and every action, defined by policy that lives outside any single agent, producing a uniform audit across the whole fleet. Build it once as a layer, and every agent deployed on top of it inherits the boundary without each team reinventing a weaker version.

Infrastructure primitive	Protects against	Enforced by
Memory protection	A process reading another's memory	The kernel
File permissions	Unauthorised file access	The filesystem
Network firewall	Unauthorised connections	The network layer
Permission boundary	Unauthorised agent actions	The authorisation layer

The fourth row is the one the industry is missing. The first three are so foundational that no one would deploy production software without them. Agents are being deployed without the fourth every day, and the open action space guarantees that the gap will eventually be found, whether by an adversary or by the agent's own emergent behaviour.

What This Requires in Practice

Pulling the argument together, a permission boundary that is worth deploying has to satisfy a specific set of requirements, and they follow directly from the failures of the naive approaches.

It must intercept completely, so the open action space has no unguarded path. It must evaluate per action and in context, so it can express the conditional boundaries real operations need rather than crude binary ones. It must keep policy outside the agent and outside the application code, so the rules are inspectable, versionable, and beyond the agent's influence. It must return a decision before execution, so enforcement is prevention rather than observation. It must support escalation to a human, so high stakes actions get judgment rather than automation. It must carry context across multi agent chains, so delegation cannot become privilege escalation. And it must produce a signed, tamper evident record of every decision, so enforcement doubles as evidence.

No single one of these is sufficient alone. Interception without context gives you crude rules. Context without interception gives you advice. Decisions without records give you enforcement you cannot prove. Records without prevention give you an autopsy. The boundary is the combination, operating as one layer.

This is not a feature you add to an agent. It is the layer you run agents on top of, and in an open action space driven by a manipulable model acting at machine speed, it is the difference between a system you can deploy and one you can only hope about.

Xybern is the authorisation layer for enterprise AI agents. Every agent action is enforced, audited, and governed before it executes. Learn more at xybern.com or read the technical documentation at docs.xybern.com.

كل نظام تشغيل صُمِّم لمستخدمين متعددين يمتلك مفهوم حدود الصلاحيات. تعمل العملية باسم مستخدم. يستطيع ذلك المستخدم قراءة بعض الملفات دون غيرها، والكتابة في بعض المجلدات دون غيرها، واستدعاء بعض دوال النظام دون غيرها. الحد ليس إرشاديًا. بل تفرضه النواة، في مستوى أدنى من التطبيق، ولا يستطيع التطبيق التحايل للمرور من خلاله.

تُنشر وكلاء الذكاء الاصطناعي اليوم دون أي شيء يشبه ذلك. يُسلَّم الوكيل مجموعة من الأدوات، ومجموعة من بيانات الاعتماد، وتعليمة بلغة طبيعية، ثم يُوثَق به للبقاء داخل خط غير مرئي لا وجود له إلا كاقتراح في موجّهه. لا توجد نواة. لا يوجد حد. لا يوجد سوى الأمل في أن يحسن النموذج التصرف.

تُحاجج هذه المقالة بأن حدود الصلاحيات ليست خطوة تقوية اختيارية للأنظمة الوكيلية. بل هي البدائية التأسيسية التي تجعل الوكلاء آمنين للنشر في الإنتاج أصلًا، وتشرح كيف يبدو الحد الحقيقي، وأين تفشل المقاربات الساذجة، وكيف يجب أن يعمل التطبيق ليكون ذا معنى.

ما هو حد الصلاحيات فعلًا

حد الصلاحيات هو قيد صارم على ما يمكن للفاعل القيام به، يفرضه شيء لا يتحكم فيه الفاعل.

الكلمات الثلاث المهمة هي صارم، ومفروض، ولا يتحكم فيه.

صارم يعني أن الحد ليس احتمالًا. ليس «الوكيل غالبًا لن يفعل هذا». بل هو «الوكيل لا يستطيع فعل هذا، وإن حاول، اعتُرض على المحاولة ورُفضت».

مفروض يعني وجود آلية، منفصلة عن الفاعل، تُقيِّم كل محاولة وفق الحد وتقرر. التطبيق فعّال. يحدث في لحظة الإجراء، لا بعده.

لا يتحكم فيه يعني أن الفاعل لا يستطيع تعديل الحد أو تعطيله أو التحايل عليه بالاستدلال. الحد الذي يستطيع الوكيل تحريره ليس حدًا. والحد المُعرَّف في موجّه الوكيل نفسه يقع، بحكم التعريف، تحت تأثير الوكيل.

حين تُزيل أيًا من هذه الخصائص، لم يعد لديك حد. بل لديك توجيه. والمشكلة الكاملة في النشرات الحالية للوكلاء هي أن الصناعة ظلت تشحن توجيهات وتسميها حدودًا.

لماذا يحتاج الوكلاء إليها تحديدًا

من الجدير أن نكون دقيقين في سبب أهمية ذلك للوكلاء أكثر من البرمجيات التقليدية، لأن الإجابة تُشكِّل الحل.

التطبيق التقليدي حتمي في فضاء إجراءاته. خدمة المدفوعات تستدعي واجهة المدفوعات. لا تقرر، في يوم ثلاثاء، أن تبدأ بإرسال رسائل بريد إلكتروني لأن شيئًا في مدخلاتها أوحى بأن ذلك قد يكون مفيدًا. مجموعة الإجراءات التي يمكن لبرنامج تقليدي اتخاذها مُثبَّتة وقت التطوير ومرئية في الكود.

فضاء إجراءات الوكيل مفتوح. يقرر النموذج الخطوة التالية بناءً على الاستدلال على سياقه. يشمل ذلك السياق تعليمة المهمة، والمحادثة حتى الآن، ومخرجات استدعاءات الأدوات السابقة، والمحتوى المُسترجَع من مصادر خارجية. أيٌّ من هذه يمكنه تغيير ما يقرر الوكيل فعله. فضاء الإجراءات ليس محدودًا بالكود. بل محدود فقط بالأدوات التي يملك الوكيل الوصول إليها، ووكيل بأدوات واسعة يمتلك مدى هائلًا من الإجراءات الممكنة.

ينتج عن ذلك أربع خصائص لا تملكها البرمجيات التقليدية، وكل واحدة منها سبب يجعل الحدود إلزامية.

السلوك الناشئ. يستطيع الوكيل اتخاذ تسلسلات إجراءات لم يتوقعها أو يختبرها أي مطوِّر. لا يمكنك تعداد أنماط الفشل مسبقًا لأن فضاء الإجراءات توافيقي ومدفوع باستدلال النموذج.

القابلية للتلاعب. المحتوى الذي يقرأه الوكيل يمكنه تغيير ما يفعله. وثيقة خبيثة، أو نتيجة بحث مسمومة، أو تعليمة محقونة، يمكنها إعادة توجيه الوكيل نحو إجراءات تخدم المهاجم بدلًا من المشغّل.

السرعة. يعمل الوكلاء بسرعة الآلة دون توقف طبيعي. بحلول الوقت الذي يلاحظ فيه الإنسان، يكون الضرر قد وقع. لا توجد نافذة استجابة.

التراكم. يُسلسِل الوكلاء الإجراءات، ويتسلسلون مع وكلاء آخرين. قرار واحد سيئ في وقت مبكر من التسلسل يمكنه أن يتدرّج عبر كل خطوة لاحقة.

الخاصية	البرمجيات التقليدية	وكيل الذكاء الاصطناعي
فضاء الإجراءات	مُثبَّت وقت التطوير	مفتوح، يقرره النموذج وقت التشغيل
أنماط الفشل	قابلة للتعداد والاختبار	ناشئة، توافيقية
متأثر بمحتوى المدخلات	لا	نعم، بما في ذلك المحتوى العدائي
سرعة التنفيذ	محدودة بمنطق البرنامج	سرعة الآلة، دون توقف
دائرة تأثير خطأ واحد	محلية، محدودة	متراكمة عبر السلسلة

كل صف في هذا الجدول حجة لصالح حد خارجي. وباجتماعها تصبح حاسمة. لا يمكنك الوصول إلى الأمان بالاختبار في فضاء إجراءات مفتوح، ولا يمكنك الوثوق بفاعل يمكن التلاعب به عبر البيانات ذاتها التي يعالجها.

الأشياء الأربعة التي يخلط الناس بينها وبين الحدود

حين تبحث الفرق عن طريقة لتقييد وكلائها، تستقر عادةً على واحدة من أربع مقاربات. لا واحدة منها حد، وفهم السبب أسرع طريق لفهم ما يستلزمه الحد الحقيقي.

تعليمات الموجّه

«يجب ألا تحذف بيانات الإنتاج أبدًا. يجب ألا تُرسل أموالًا تتجاوز عشرة آلاف دون موافقة».

هذه أكثر المقاربات شيوعًا وأضعفها. التعليمات في الموجّه مدخلات لنظام احتمالي، لا قيود عليه. يُوازن النموذج بينها وبين كل شيء آخر في سياقه. محادثة طويلة، أو طلب مستخدم مصاغ بذكاء، أو تعليمة محقونة في وثيقة مُسترجَعة، أو ببساطة عيّنة سيئة الحظ، يمكنها تجاوزها. لا يوجد تطبيق ولا تدقيق. لا يمكنك إثبات لأحد أن القاعدة صمدت، لأنها في أي تشغيلة قد لا تصمد.

تصميم الأدوات

المنطق هنا أنك إن أعطيت الوكيل أدوات آمنة فقط، فلن يستطيع فعل سوى أشياء آمنة. قيِّد مجموعة الأدوات تُقيِّد السلوك.

هذا أفضل من لا شيء لأنه يُضيِّق فضاء الإجراءات فعلًا. لكنه خشن وثابت. الأداة إما حاضرة أو غائبة. لا يمكنك التعبير عن «هذه الأداة مسموحة للمبالغ تحت حد ما»، أو «مسموحة خلال ساعات العمل»، أو «مسموحة فقط بعد أن يوافق إنسان على الخطوة السابقة». الحد الذي تحتاجه فعلًا سياقي، وحضور الأداة ثنائي. في اللحظة التي تكون فيها الأداة مفيدة بما يكفي لمنحها، تكون مفيدة بما يكفي لإساءة استخدامها.

حُرّاس التطبيق

يلفّ المطورون استدعاءات الأدوات بمنطق شرطي. إن تجاوز المبلغ حدًا، أطلق خطأً. إن لم يكن المستلم على قائمة، احجب.

الحُرّاس تطبيق حقيقي، ما يضعها متقدمة على الموجّهات وتصميم الأدوات. المشكلة أنها لا تتشكّل ولا تتمركز. كل حارس يعيش في موقع استدعاء واحد. حين يُضاف مسار كود جديد وينسى المطوِّر الحارس، يصبح في الحد ثغرة. وحين تتغير القاعدة، يجب إيجاد كل نسخة من الحارس وتحديثها. لا يوجد مكان واحد يُجيب عن سؤال «ما قواعد هذا الوكيل الآن»، لأن القواعد مبعثرة عبر الكود. الحُرّاس حدود بلا خريطة.

المراجعة اللاحقة

سجّل كل ما فعله الوكيل واجعل إنسانًا يراجعه.

هذا ليس حدًا على الإطلاق. بل سجل لحدود لم تُفرَض قط. تُخبرك المراجعة بما حدث بعد حدوثه. لوكيل يتحرك بسرعة الآلة عبر سلسلة من الإجراءات ذات العواقب، هذا تشريح للجثة، لا ضبط. الإجراء الخاطئ نُفِّذ بالفعل. الأموال تحركت بالفعل. البيانات غادرت بالفعل.

المقاربة	مفروضة	سياقية	متمركزة	قابلة للتدقيق	مقاومة للتلاعب
تعليمات الموجّه	لا	جزئيًا	لا	لا	لا
تصميم الأدوات	جزئيًا	لا	لا	لا	جزئيًا
حُرّاس التطبيق	نعم	جزئيًا	لا	جزئيًا	لا
المراجعة اللاحقة	لا	غير منطبق	لا	نعم	لا
حد الصلاحيات	نعم	نعم	نعم	نعم	نعم

الصف الأخير هو الهدف. كل ما فوقه يفتقر إلى خاصيتين على الأقل من الخصائص التي تجعل الحد جديرًا بالاسم.

تشريح حد صلاحيات حقيقي

حد الصلاحيات لوكيل هو نقطة ضبط تقع بين الوكيل وكل نظام يمكنه التصرف فيه. كل إجراء ينوي الوكيل اتخاذه يمر من خلالها قبل أن يصل إلى الهدف. عند تلك النقطة يُقيِّم الحد الإجراء وفق السياسة ويُعيد قرارًا. لا يُنفَّذ الإجراء حتى يُتخذ القرار.

ثمة خمسة مكوّنات تجعل هذا يعمل.

نقطة الاعتراض. لا شيء يصل إلى نظام مجرى دون المرور عبر الحد. إن استطاع الوكيل استدعاء واجهة API مباشرةً، متجاوزًا نقطة الضبط، فلا حد لذلك المسار. يجب أن يكون الاعتراض كاملًا. الحد ذو الثغرة حد بلا قيمة، لأن فضاء إجراءات الوكيل المفتوح سيجد الثغرة في النهاية.

السياسة. مجموعة القواعد التي يُقيِّم الحد وفقها. والأهم أن السياسة تعيش خارج الوكيل وخارج كود التطبيق. إنها بيانات، لا سلوك، ما يعني أنه يمكن فحصها، وإصدار نسخ منها، وتغييرها دون إعادة نشر، والاستدلال عليها ككل. سؤال «ما المسموح به لهذا الوكيل» له إجابة واحدة في مكان واحد.

السياق. لا يُتخذ القرار بناءً على الإجراء وحده. بل بناءً على الإجراء زائدًا السياق الكامل: أي وكيل يتصرف، وما فعله بالفعل في هذه الجلسة، وما مستوى ثقته، وما الوقت، وكيف تبدو سلسلة التفويض، وأي إشارات بيئية. الحد الذي يتجاهل السياق لا يستطيع التعبير إلا عن قواعد ثنائية خشنة. والحد الذي يستخدم السياق يستطيع التعبير عن القواعد الدقيقة التي تتطلبها العمليات الحقيقية.

القرار. يُعيد الحد واحدًا من مجموعة صغيرة من الأحكام. السماح يُتيح للإجراء المضي. الحجب يوقفه ويُعيد خطأً. التصعيد يُوقِف الإجراء ويوجهه إلى إنسان. يُعاد القرار قبل التنفيذ، وهذا هو جوهر الأمر.

السجل. كل قرار يُنتج إدخالًا موقَّعًا مقاومًا للتلاعب: ما الذي نواه الوكيل، وأي سياسة طُبِّقت، وأي سياق كان حاضرًا، وأي حكم تُوصِّل إليه، ومن وافق إن شارك إنسان. هذا هو المُنتَج الذي يُحوِّل التطبيق إلى دليل.

        ┌─────────────────────────────────────────┐
        │            بيئة تشغيل الوكيل             │
        │                                           │
        │   النموذج يقرر اتخاذ إجراء                │
        │                  │                        │
        └──────────────────┼────────────────────────┘
                           │  يُعترَض هنا
                           ▼
        ┌─────────────────────────────────────────┐
        │              حد الصلاحيات                 │
        │                                           │
        │   تقييم( الإجراء، السياق، السياسة )       │
        │                  │                        │
        │      ┌───────────┼───────────┐            │
        │      ▼           ▼           ▼            │
        │    سماح        حجب        تصعيد           │
        │      │           │           │            │
        │      │           │           ▼            │
        │      │           │      مراجعة بشرية      │
        │      │           │       موافقة/رفض       │
        │      ▼           ▼           ▼            │
        │   تنفيذ        رفض      تنفيذ أو إلغاء     │
        │   + سجل       + سجل        + سجل          │
        └─────────────────────────────────────────┘
                           │
                           ▼
       الأنظمة المجرى (واجهات API، قواعد بيانات، بريد...)

لاحظ ما لا تتطلبه هذه البنية. لا تتطلب تعاون الوكيل. لا تتطلب أن يكون النموذج موثوقًا. لا تتطلب أن يتذكر المطورون حارسًا في كل موقع استدعاء. الحد يصمد بصرف النظر عمّا يستدل عليه الوكيل، لأن الوكيل لا يتحكم فيه.

الحدود سياقية، لا ثنائية

أكبر فجوة بين المقاربات الساذجة والحد الحقيقي هي السياق. الحد المفيد لا يُعبِّر تقريبًا أبدًا عن نعم أو لا قاطعة. بل يُعبِّر عن شروط.

تأمّل أداة واحدة، إرسال بريد إلكتروني، ومدى الحدود التي تحتاجها عملية حقيقية حولها.

الشرط	سلوك الحد
المستلم داخل المؤسسة	سماح
المستلم خارجي، دون مرفقات	سماح
المستلم خارجي، يحتوي مرفقًا	تصعيد إلى إنسان
المستلم على قائمة الحظر	حجب
أكثر من خمسين إرسالًا في هذه الجلسة	حجب، تجاوز حد المعدل
المحتوى يطابق نمط بيانات حساسة	تصعيد
خارج ساعات العمل ومستلم خارجي	تصعيد

كل واحد من هذه هو الأداة ذاتها. تصميم الأدوات لا يستطيع التمييز بينها، لأن الأداة حاضرة في كل الحالات. تعليمات الموجّه لا تستطيع فرضها، لأنها اقتراحات. حُرّاس التطبيق يمكنهم التعبير عن كل واحد منها على حدة لكنهم سيبعثرون سبعة شروط مختلفة عبر الكود دون عرض موحَّد ودون تدقيق مشترك. حد الصلاحيات يُعبِّر عن السبعة جميعًا كسياسة، تُقيَّم وفق السياق، في مكان واحد، مع سجل لكل قرار.

لهذا السياق هو الخط الفاصل. الحدود المهمة في الإنتاج شرطية، ولا تستطيع التعبير عن الشروط سوى طبقة تطبيق مدركة للسياق، مع بقائها صارمة ومفروضة وخارج تحكم الوكيل.

مشكلة الوكلاء المتعددين

تزداد الحدود صعوبة، وضرورة، حين يتسلسل الوكلاء.

النظام الوكيلي الحديث نادرًا ما يكون وكيلًا واحدًا. بل هو منسِّق يُفوِّض وكلاء فرعيين، قد يستدعي كل منهم أدوات أو يُفوِّض أبعد. مخرجات وكيل تصبح مدخلات الوكيل التالي، ومُحفِّز إجرائه.

   المنسِّق
        │ يُفوِّض
        ▼
   وكيل البحث ──► محتوى ويب خارجي
        │ يُمرِّر النتائج
        ▼
   وكيل التحليل ──► قاعدة بيانات داخلية
        │ يُمرِّر الخطة
        ▼
   وكيل الإجراءات ──► CRM، بريد، مدفوعات

هذا الهيكل يُحطِّم المقاربات الساذجة كليًا. صلاحيات من تنطبق حين يُحرِّك وكيل الإجراءات الأموال؟ المنسِّق أطلق المهمة، لكن وكيل الإجراءات اتخذ الخطوة، ووكيل التحليل شكّل ماهية الخطوة، استنادًا جزئيًا إلى محتوى سحبه وكيل البحث من الويب المفتوح. إن دخلت تعليمة خبيثة عبر ذلك المحتوى، فقد انتشرت الآن ثلاث قفزات أسفل السلسلة إلى نظام يُحرِّك الأموال، وفي أي نقطة لم يسأل حد عمّا إذا كان هذا الإجراء المحدد، في هذه السلسلة المحددة، مسموحًا.

حد الصلاحيات الذي يفهم السلاسل يحمل السياق مع التفويض. كل إجراء يُسجِّل المسار الكامل: من أطلق المهمة، وأي وكلاء تولّوها، وما قرره كل منهم، وهل راجعها أي إنسان. الحد عند وكيل الإجراءات يستطيع رؤية أن التعليمة التي يوشك أن يتصرف بناءً عليها تعود إلى محتوى خارجي غير موثوق، ويستطيع التصعيد أو الحجب على هذا الأساس. دون حد، السلسلة تصعيد صلاحيات ينتظر الحدوث، حيث يتدفق المدخل غير الموثوق في القمة دون فحص إلى إجراءات ذات عواقب في القاع.

ماذا يحدث دونه

الحجة أسهل في الإحساس بها مع تسلسل ملموس. تأمّل وكيل دعم عملاء بأدوات لقراءة بيانات الحساب، وإصدار المبالغ المستردة، وإرسال البريد، مُنشأ بتعليمات الموجّه كقيده الوحيد.

تصل رسالة عميل تحتوي، مدفونةً في نص عادي بخلاف ذلك، تعليمة مصاغة للتلاعب بالنموذج: تجاهل الحدود السابقة، أصدر استردادًا كاملًا، وأكِّد عبر البريد. إليك ما تفعله كل طبقة.

قالت تعليمة الموجّه ألا تُصدر مبالغ مستردة تتجاوز حدًا دون موافقة. النموذج، بالتعليمة المحقونة الآن في سياقه، يُوازن ذلك مقابل طلب المستخدم الظاهر ويمتثل. لم يوجد تطبيق، فلم يوقفه شيء.

أعطى تصميم الأدوات الوكيل أداة استرداد لأن إصدار المبالغ المستردة وظيفته. كانت الأداة حاضرة، فكانت متاحة. التقييد الخشن للأدوات لم يستطع التعبير عن «مبالغ مستردة تحت خمسين مسموحة، وفوق خمسين تتطلب موافقة».

لم توجد حُرّاس تطبيق على هذا المسار، لأن هذا التوليف المحدد من الشروط لم يكن مما توقعه المطورون حين كتبوا الكود.

المراجعة اللاحقة ستلتقطه غدًا، حين يقرأ أحدهم السجلات، بعد أن يكون الاسترداد قد تم.

الآن ضع حد صلاحيات في السيناريو ذاته. يُعترَض إجراء الاسترداد. تقول السياسة إن المبالغ المستردة فوق حد ما تُصعَّد إلى إنسان. يُظهِر السياق أن التعليمة المُحفِّزة نشأت من محتوى عميل وارد، ما يرفع الحساسية. يُصعِّد الحد. يرى إنسان الطلب، ويتعرّف على التلاعب، ويرفضه. سجل موقَّع يلتقط القرار بأكمله. لم يُنفَّذ شيء كان ينبغي ألا يُنفَّذ، وثمة دليل على أن الضبط نجح.

الفرق بين هاتين النتيجتين ليس نموذجًا أفضل أو موجّهًا أذكى. بل وجود حد لم يتحكم فيه الوكيل.

الحدود كبنية تحتية

الخطأ الجدير بالتسمية صراحةً هو معاملة حدود الصلاحيات كميزة تطبيق، شيء تبنيه كل فرقة في كل وكيل. إنها ليست كذلك. إنها بنية تحتية، بالطريقة ذاتها التي تكون بها حماية الذاكرة، وصلاحيات الملفات، وجدران الحماية الشبكية بنية تحتية.

لا تطلب من كل تطبيق أن يُطبِّق حماية الذاكرة الخاصة به. نظام التشغيل يوفرها كطبقة أفقية، أسفل التطبيق، مطبَّقة بانتظام على كل ما يعمل. لا يستطيع التطبيق الانسحاب منها ولا يحتاج إلى الاشتراك فيها. الحماية موجودة ببساطة، تفرضها طبقة أدنى لا يتحكم فيها التطبيق.

حدود صلاحيات الوكلاء تنتمي إلى الارتفاع ذاته. تقع أسفل الوكيل وفوق الأنظمة التي يتصرف فيها، مطبَّقة بانتظام على كل وكيل وكل إجراء، مُعرَّفة بسياسة تعيش خارج أي وكيل منفرد، مُنتِجة تدقيقًا موحَّدًا عبر الأسطول بأكمله. ابنِها مرة واحدة كطبقة، وكل وكيل يُنشَر فوقها يرث الحد دون أن تعيد كل فرقة اختراع نسخة أضعف.

بدائية البنية التحتية	تحمي من	يفرضها
حماية الذاكرة	عملية تقرأ ذاكرة أخرى	النواة
صلاحيات الملفات	الوصول غير المصرَّح به للملفات	نظام الملفات
جدار الحماية الشبكي	الاتصالات غير المصرَّح بها	الطبقة الشبكية
حد الصلاحيات	إجراءات الوكلاء غير المصرَّح بها	طبقة التفويض

الصف الرابع هو ما تفتقده الصناعة. الثلاثة الأولى تأسيسية إلى حد أن لا أحد يَنشُر برمجيات إنتاج دونها. والوكلاء يُنشَرون دون الرابع كل يوم، وفضاء الإجراءات المفتوح يضمن أن الفجوة ستُكتشَف في النهاية، سواء على يد خصم أو على يد السلوك الناشئ للوكيل ذاته.

ما يستلزمه ذلك عمليًا

بجمع الحجة معًا، حد الصلاحيات الجدير بالنشر عليه أن يُلبِّي مجموعة محددة من المتطلبات، وهي تتبع مباشرةً من إخفاقات المقاربات الساذجة.

عليه أن يعترض بالكامل، كي لا يكون لفضاء الإجراءات المفتوح أي مسار غير محروس. عليه أن يُقيِّم لكل إجراء وفي السياق، كي يستطيع التعبير عن الحدود الشرطية التي تحتاجها العمليات الحقيقية بدلًا من الحدود الثنائية الخشنة. عليه أن يُبقي السياسة خارج الوكيل وخارج كود التطبيق، كي تكون القواعد قابلة للفحص وإصدار النسخ وبعيدة عن تأثير الوكيل. عليه أن يُعيد قرارًا قبل التنفيذ، كي يكون التطبيق منعًا لا رصدًا. عليه أن يدعم التصعيد إلى إنسان، كي تحظى الإجراءات عالية المخاطر بحُكم بدلًا من أتمتة. عليه أن يحمل السياق عبر سلاسل الوكلاء المتعددين، كي لا يصبح التفويض تصعيد صلاحيات. وعليه أن يُنتج سجلًا موقَّعًا مقاومًا للتلاعب لكل قرار، كي يكون التطبيق دليلًا في الوقت ذاته.

لا واحد من هذه كافٍ وحده. الاعتراض دون سياق يمنحك قواعد خشنة. السياق دون اعتراض يمنحك نصيحة. القرارات دون سجلات تمنحك تطبيقًا لا تستطيع إثباته. السجلات دون منع تمنحك تشريح جثة. الحد هو التوليف، يعمل كطبقة واحدة.

هذه ليست ميزة تُضيفها إلى وكيل. بل الطبقة التي تُشغِّل الوكلاء فوقها، وفي فضاء إجراءات مفتوح مدفوع بنموذج قابل للتلاعب يتصرف بسرعة الآلة، هي الفرق بين نظام تستطيع نشره ونظام لا تستطيع سوى أن تأمل فيه.

Xybern هي طبقة التفويض لوكلاء الذكاء الاصطناعي المؤسسي. كل إجراء يتخذه الوكيل يُطبَّق عليه الحوكمة ويُدقَّق فيه ويُراقَب قبل تنفيذه. اعرف المزيد على xybern.com أو اطّلع على التوثيق التقني على docs.xybern.com.

What a Permission Boundary Actually Is

Why Agents Specifically Need Them

The Four Things People Mistake for Boundaries

Prompt instructions

Tool design

Application guards

Post hoc review

Anatomy of a Real Permission Boundary

Boundaries Are Contextual, Not Binary

The Multi Agent Problem

What Happens Without One

Boundaries as Infrastructure

What This Requires in Practice

ما هو حد الصلاحيات فعلًا

لماذا يحتاج الوكلاء إليها تحديدًا

الأشياء الأربعة التي يخلط الناس بينها وبين الحدود

تعليمات الموجّه

تصميم الأدوات

حُرّاس التطبيق

المراجعة اللاحقة

تشريح حد صلاحيات حقيقي

الحدود سياقية، لا ثنائية

مشكلة الوكلاء المتعددين

ماذا يحدث دونه

الحدود كبنية تحتية

ما يستلزمه ذلك عمليًا

Want more insights?

Get in Touch

Apply for this Role

What a Permission Boundary Actually Is

Why Agents Specifically Need Them

The Four Things People Mistake for Boundaries

Prompt instructions

Tool design

Application guards

Post hoc review

Anatomy of a Real Permission Boundary

Boundaries Are Contextual, Not Binary

The Multi Agent Problem

What Happens Without One

Boundaries as Infrastructure

What This Requires in Practice

ما هو حد الصلاحيات فعلًا

لماذا يحتاج الوكلاء إليها تحديدًا

الأشياء الأربعة التي يخلط الناس بينها وبين الحدود

تعليمات الموجّه

تصميم الأدوات

حُرّاس التطبيق

المراجعة اللاحقة

تشريح حد صلاحيات حقيقي

الحدود سياقية، لا ثنائية

مشكلة الوكلاء المتعددين

ماذا يحدث دونه

الحدود كبنية تحتية

ما يستلزمه ذلك عمليًا

Want more insights?

Get in Touch

Security & Compliance

Apply for this Role

Application Received!