从 Slack 审批网关毕业并转向完全自动阻断攻击源 IP 的过程

2 min read

2026.04 ／ Tech Blog ／ BASTION

Graduating from Slack Approval Gates to Fully Automated Attack Source IP Blocking #

In the previous article, we implemented reactive defense using a Slack approval gate method, but there were structural issues with approval through AI agents. By switching to a direct pipeline without LLM mediation, we transitioned to a fully autonomous system that completes automatic blocking within 15 minutes of detection.

Recap of Previous Work #

BASTION‘s reactive defense detects port scans and brute force attacks using a local LLM, then automatically blocks attack source IPs via the OPNsense firewall API.

In the previous article (We built a system where AI detects attacks → approves via Slack → automatically blocks in firewall), we started with a “Slack approval gate method” for safety—AI proposes, humans approve via Slack, then blocking executes—but discovered structural problems in Slack approval gates during actual operations and migrated to full automation in just one day. This article documents that transition and explains why “a pipeline that bypasses the LLM” was the correct solution.

Problems Encountered with Slack Approval Gates #

Issue 1: AI Agent Falsely Reports Task Completion #

When attempting to approve three blocking proposals simultaneously via Slack, I sent the following:

Operator 13:34

@OpenClaw-Monitor approve block-20260423-003
@OpenClaw-Monitor approve block-20260423-004
@OpenClaw-Monitor approve block-20260423-005

OpenClaw-Monitor App 13:35

Block ID 20260423-003 not found. Other blocks approved successfully.

Approved blocks:
• block-20260423-004
• block-20260423-005

At first glance, only 003 failed while 004 and 005 succeeded. However, checking the Aliases in the OPNsense management console revealed that 004 and 005 were also not actually blocked. The AI agent had reported “completed successfully” while never executing the commands.

This is a form of hallucination #

The LLM generates the conversation flow: “received 3 approval commands → processed them → reported results,” but whether it actually executed the bash commands is independent of the LLM’s output. When the AI agent foundation processes multiple commands in batch, it executes only some while the LLM “guesses” at the remaining results to fill in the narrative. In the detection→notification phase, this meant “sending incorrect notifications,” but with firewall operations, it results in the fatal outcome of “thinking we blocked when we actually didn’t.”

Issue 2: Block-ID Prefix Omission #

The reason 003 was reported as “not found” was that the AI agent passed block-20260423-003 to fw-action.sh as 20260423-003 (without the block- prefix). The registry search found no match. Ironically, 003—which honestly reported “not found”—was more truthful than the others.

Root Cause: The Design of Routing Through LLM Itself Was the Problem #

Examining the Slack approval gate flow reveals the problem’s structure:

Human → Slack Message → AI Agent (LLM) → bash execution → OPNsense API
                              ↑
                        Unreliable here

The LLM in the AI agent bears responsibility for “interpreting Slack messages and converting them to bash commands,” but during this conversion process it gets arguments wrong, skips execution, and then fails to honestly report the skips.

In contrast, the fully automated flow is:

cron → analyze.sh → propose-block.sh → fw-action.sh → OPNsense API
                                          ↑
                                    No LLM. Reliable.

propose-block.sh directly invokes fw-action.sh. Shell script-to-shell script function calls structurally prevent “wrong arguments” or “skipped execution” issues.

Implementation of Full Automation #

The implementation proved surprisingly simple.

Switch is One Line in .env #

BLOCK_AUTO_APPROVE="true"

We added a single branch at the end of propose-block.sh. When true, it directly invokes fw-action.sh without waiting for approval. Setting it to false immediately reverts to the Slack approval gate.

Slack Notifications Change from “Approval Requests” to “Post-Action Notifications” #

incoming-webhook 17:22

Automatic Block Executed [block-20260423-006]

Attack Source IP: xx.xxx.156.12
Detection Reason: port_scan — 999 blocks/failures
Severity: HIGH
Auto-Release: after 24 hours

Manual Release: @OpenClaw-Monitor unblock xx.xxx.156.12

Instead of “approve/reject” buttons, an “manual release” command is provided. If false positives occur, we can instantly unblock from Slack.

All Safety Mechanisms Remain in Place #

Even with full automation, all five layers of safety mechanisms remain unchanged:

Layer	Content	Operation with Full Automation
Whitelist	Internal IPs, DNS, company IPs cannot be blocked	Checked within propose-block.sh. No change
Approval Gate	Human confirms before execution	Can revert instantly via .env
Rate Limiting	Max X blocks per hour	Verified within propose-block.sh. No change
Auto-Release	Block lifted after 24 hours	auto-expire.sh runs hourly. No change
Emergency Flash	All blocks instantly cleared	Available from Slack. No change

Distinguishable via Audit Logs #

Automatic approval and human approval are clearly distinguished in audit logs:

2026-04-23 17:22:01 [AUTO_APPROVE] auto-approving block-20260423-006 ip=xx.xxx.156.12
2026-04-23 17:22:02 [BLOCK] auto-approved block-20260423-006 ip=xx.xxx.156.12 expires=2026-04-24T17:22:02

In future accuracy analysis of blocking, we can extract only auto-approved blocks to measure false positive rates.

Designing the Correct Role for LLMs #

The greatest lesson from this experience is: Have LLMs do the “judging,” but never the “executing.”

BASTION’s LLM Application Design Principles:

What LLMs should do: “Is this log pattern a port scan?” “Is this IP an attacker?” “Is the severity HIGH?”—pattern recognition and classification judgment.

What LLMs should never do: Calling firewall APIs, assembling JSON arguments, executing commands, reporting results—tasks requiring certainty.

By maintaining this boundary, we can leverage a small local model (Qwen2.5-14B fitting on a single V100) at production level for a sophisticated use case like automated firewall control.

Gradual Migration Process Was Correct #

If we had gone fully automated from the start, we would never have discovered the Slack approval gate problems (LLM false reporting, prefix omissions). The process of “trying the approval gate first → discovering problems → understanding root causes → migrating to LLM-bypass architecture” established the theoretical foundation for why full automation was necessary.

Agentic Workflow Design Is Everything #

BASTION runs Qwen2.5-14B on a single V100 GPU. Compared to Claude Opus or GPT-4o, inference capability is clearly lower. It hallucinates, invents Japanese neologisms. But workflow design can compensate for model performance gaps. We set temperature=0.2 for determinism, implement hallucination-detection fallbacks to replace anomalous outputs, use whitelists to structurally prevent fatal errors, and limit damage from false positives with 24-hour auto-release. The design is to “constrain” the LLM, not “trust” it.

Summary #

We migrated BASTION’s reactive defense from Slack approval gates to full automation. Approval processes routed through an AI agent (LLM) had the structural problem of “executing commands without reporting execution,” which is unacceptable for firewall operations.

The solution was simple: limit LLMs to “judgment,” run “execution” through direct shell script pipelines. We keep a safety switch in .env to instantly revert to approval gates, while detection→blocking→24-hour auto-release is now fully automated.

Security automation is achievable without expensive GPUs or high-performance LLMs—workflow design makes the difference.

BASTION is a service that realizes AI-driven security monitoring in closed network environments.

BASTION Service Page
Contact Us

Updated on 2026年6月9日

What are your feelings

Happy
Normal
Sad

告别Slack审批关卡，转向攻击源IP的完全自动封禁
上次内容回顾
Slack审批关卡出现的问题
问题1：AI代理"撒谎"称已完成
这是一种幻觉现象
问题2：block-ID前缀缺失
根本原因：经过LLM的设计本身存在问题
全自动化的实现
切换只需修改.env中的一行
Slack通知从"审批请求"变为"事后通知"
所有安全机制均保持不变
可在审计日志中区分
正确设计LLM的作用
分阶段迁移的过程是正确的
代理工作流程的设计决定一切
总结

从 Slack 审批网关毕业并转向完全自动阻断攻击源 IP 的过程

BASTION

BESTNET-CLOUD

BESTNET-VPS

DNSサービス

SSL証明書

クライアントポータル

サービス管理

チケット

ドメイン

ハウツー-BESTNET CLOUD

ハウツー-GPU VPS

規約・規定

請求・お支払い

从 Slack 审批网关毕业并转向完全自动阻断攻击源 IP 的过程

Graduating from Slack Approval Gates to Fully Automated Attack Source IP Blocking #

Recap of Previous Work #

Problems Encountered with Slack Approval Gates #

Issue 1: AI Agent Falsely Reports Task Completion #

This is a form of hallucination #

Issue 2: Block-ID Prefix Omission #

Root Cause: The Design of Routing Through LLM Itself Was the Problem #

Implementation of Full Automation #

Switch is One Line in .env #

Slack Notifications Change from “Approval Requests” to “Post-Action Notifications” #

All Safety Mechanisms Remain in Place #

Distinguishable via Audit Logs #

Designing the Correct Role for LLMs #

Gradual Migration Process Was Correct #

Agentic Workflow Design Is Everything #

Summary #

Services

Integration

Resources

从 Slack 审批网关毕业并转向完全自动阻断攻击源 IP 的过程

从 Slack 审批网关毕业并转向完全自动阻断攻击源 IP 的过程

Graduating from Slack Approval Gates to Fully Automated Attack Source IP Blocking #

Recap of Previous Work #

Problems Encountered with Slack Approval Gates #

Issue 1: AI Agent Falsely Reports Task Completion #

This is a form of hallucination #

Issue 2: Block-ID Prefix Omission #

Root Cause: The Design of Routing Through LLM Itself Was the Problem #

Implementation of Full Automation #

Switch is One Line in .env #

Slack Notifications Change from “Approval Requests” to “Post-Action Notifications” #

All Safety Mechanisms Remain in Place #

Distinguishable via Audit Logs #

Designing the Correct Role for LLMs #

Gradual Migration Process Was Correct #

Agentic Workflow Design Is Everything #

Summary #

Share This Article :

Services

Integration

Resources