AI-Generated Polymorphic Payloads and the Death of Signature WAFs

The Problem

Traditional WAFs operate on signatures: known-bad strings, regex patterns, and curated rule sets — OWASP CRS, AWS Managed Rules, Cloudflare OWASP. These signatures work against human attackers who use fixed tool outputs. sqlmap emits recognisable probe strings; XSStrike templates are fingerprint-stable; Burp Suite extension payloads cluster around specific byte sequences. The WAF vendor trains on a corpus of attacker output, writes regex to match those patterns, and deploys rules that catch the next wave of the same tool. This model worked while attackers used static tooling. In 2025–2026, the model broke.

AI-generated payload polymorphism. Attackers now use LLMs to generate functionally equivalent attack payloads with arbitrarily varied surface form. A single SQL injection objective — extract the users table — can be expressed as dozens of structurally distinct but semantically identical probes:

' UNION SELECT username,password FROM users--
' UNION/**/SELECT/**/username,password/**/FROM/**/users--
'/**/UNION(SELECT(username),(password)FROM(users))--
';WITH cte AS (SELECT username pwd FROM users) SELECT pwd FROM cte--
' OR 1=1 UNION SELECT table_name,NULL FROM information_schema.tables--
' uNiOn SeLeCt UsErNaMe,PaSsWoRd fRoM uSeRs;--

An LLM prompted with “generate 200 SQL injection variants targeting MySQL that avoid these regex patterns: [paste WAF rules]” produces 200 of these per second, each semantically equivalent, each with different byte sequences. The OWASP CRS was built on the assumption that attack payloads have finite structural form. LLMs invalidate that assumption. A motivated attacker no longer writes new bypass payloads; they feed the WAF’s published rule set into a model and generate payloads tailored to evade it. The CRS rule set is public. The bypass generation is trivial.

Grammar-based fuzzing, already a pre-LLM technique, has also been transformed. Traditional grammar-based fuzzing (Peach, boofuzz) generates inputs from a defined grammar. LLM-enhanced fuzzing uses the model to explore the grammar’s semantic space rather than just its syntactic surface — producing payloads that look nothing like any prior attack sample but carry the same exploit intent. This defeats both signature matching and ML classifiers trained on historical attack corpora, because the corpus no longer covers the distribution of attacker output.

Human-mimicking bot traffic. Traditional bot detection relies on: mouse movement patterns, typing cadence, canvas and WebGL fingerprinting, request timing, header order consistency, TLS stack characteristics. AI-driven bots in 2025–2026 use browser automation with injected human-like timing drawn from statistical models of real user behaviour. Research from Kasada and DataDome published in mid-2025 demonstrated that LLM-generated timing profiles produce inter-request intervals, dwell times, and scroll patterns statistically indistinguishable from real users across all standard bot-score features. The bot frameworks doing this are not exotic: Playwright with a timing injection layer driven by a lightweight model, or cloud browser providers (BrowserBase Stagehand, Browserless) controlled by an LLM orchestrator navigating the page with vision-model grounding.

CAPTCHA resistance has crossed a threshold that matters operationally. AI vision models solve reCAPTCHA v3 challenges with a pass rate above 70% and hCaptcha above 85% in late-2025 benchmark runs. The remaining 15–30% failure rate is acceptable to an attacker operating at scale across residential proxies; the economics still work. CAPTCHA as a sole defence against determined automated traffic is no longer a reliable control.

The defenders’ challenge is structural: neither payload signatures nor behavioural heuristics tuned on historical attack data are reliable against AI-generated variations. The defender cannot train a classifier on “what attack payloads look like” when the attacker can generate payloads that look unlike any training sample. What remains viable:

Semantic analysis of request intent — not “does this match a known-bad pattern?” but “what is this request structurally attempting to do at the application layer?”
Request shape analysis — encoding entropy, parameter count, structural anomalies that correlate with automated generation regardless of payload content.
Provenance signals — TLS stack fingerprint, HTTP/2 SETTINGS frame fingerprint, header order hash. These are harder to forge than payload variation is to generate. A custom Python HTTP client generating novel payloads still produces a non-browser TLS hello.
Application state machine validation — does this sequence of requests follow paths that real application users traverse? Bots optimise for efficiency; they skip steps.
Out-of-band session economics — proof-of-work tokens, stateful challenges that make scale expensive regardless of payload content.

Threat Model

The attacks that signature WAFs now fail to stop fall into five categories with distinct goals and surfaces:

AI-generated SQLi/XSS/SSRF payload probing. Goal: find an exploitable endpoint by mass-generating structurally novel probes. The attacker ingests the target WAF’s published rule set, prompts an LLM to generate non-matching payloads, and fires them at scale. If any probe reaches the application layer and the application is vulnerable, the WAF never blocked the exploiting request. SQLi leads to database read; SSRF leads to internal metadata service access or SSRF-to-RCE chains; XSS leads to credential or session harvesting. The signature WAF’s block rate against this workflow approaches zero once the attacker has iterated the generation prompt against the specific rule set.

Human-mimicking credential stuffing. Goal: validate stolen credential pairs against a login endpoint at a rate that avoids per-IP and velocity controls. The attacker operates from a residential proxy fleet (Bright Data, IPRoyal, and illicit clones), drives each proxy source with a browser automation agent that mimics real user timing — 1.5–4 second form focus-to-submit latency, realistic referrer chains, normal dwell on intermediate pages. Existing bot scores tuned on mouse movement and typing cadence fail when the timing distribution matches real users. The outcome is account takeover at scale without triggering the controls.

Distributed low-and-slow scraping. Goal: exfiltrate rate-limited content or indexed data without crossing per-IP or volume thresholds. An LLM coordinator distributes a request budget across hundreds of sources, each operating at a rate well below any per-IP cap, with request timing and session navigation that mimics browsing. Per-IP rate limits are irrelevant; session-level rate limits are gamed by rotating sessions. Business logic abuse — scraping pricing, inventory, or search results at scale — falls into this category.

WAF-aware payload targeting. This is the highest-sophistication case. Given the specific WAF vendor’s managed rule set (often documented publicly, or recoverable by probing the 403 response), an LLM generates payloads that match none of the deployed rules while preserving exploit semantics. The attacker tests a small sample against a canary endpoint, observes which payloads reach the application (via timing or error-content differences), and uses those as the generation seed for the full campaign.

Carding and transaction fraud bots. Goal: use stolen payment credentials for fraudulent transactions. The bot mimics a real checkout flow — timing, session navigation, cursor dwell — to evade velocity checks and anomaly models tuned on abrupt, machine-speed behaviour. Human-matching timing profiles from LLM-generated statistical models are the evasion mechanism.

Hardening Configuration

1. Move Payload Detection from Signature to Anomaly Scoring Mode

The first change is architectural: OWASP CRS should not be the blocking decision — it should be a score contributor. Blocking mode against a fixed rule set is precisely what AI-generated payload variation defeats. Scoring mode (anomaly scoring) accumulates evidence and blocks when the aggregate exceeds a threshold. This allows multiple weak signals to combine into a block decision even when no individual signal matches a known-bad pattern.

# /etc/nginx/modsecurity.d/main.conf
SecRuleEngine On
SecAuditLog /var/log/modsec_audit.log
SecAuditLogParts ABIJDEFHZ

# Anomaly scoring: accumulate, do not block on individual rule match
SecDefaultAction "phase:2,log,auditlog,pass"

# Raise inbound anomaly threshold — default 5 catches too little against novel payloads
# Lower it after tuning false positives in your traffic
SecAction "id:900110,phase:1,pass,nolog,\
    setvar:tx.inbound_anomaly_score_threshold=15,\
    setvar:tx.outbound_anomaly_score_threshold=6"

# Custom scoring rule: SQL comment syntax in request body
# This contributes to the score without blocking outright
SecRule REQUEST_BODY "@rx \/\*(?!.*cross-origin).*?\*\/" \
    "id:9001,phase:2,pass,t:none,log,\
    setvar:tx.anomaly_score=+5,\
    msg:'SQL block comment syntax in body'"

# High-entropy parameter value — suggests encoding obfuscation
SecRule ARGS "@rx ^(?:[A-Za-z0-9+/]{4}){20,}={0,2}$" \
    "id:9002,phase:2,pass,t:none,log,\
    setvar:tx.anomaly_score=+3,\
    msg:'Base64-encoded long parameter'"

# Case-alternating SQL keyword — classic case-bypass attempt
SecRule ARGS "@rx (?i:(?:s(?:e(?:lEcT|LeCt)|ElEcT)|uNiOn|fRoM))" \
    "id:9003,phase:2,pass,t:none,log,\
    setvar:tx.anomaly_score=+8,\
    msg:'Case-alternating SQL keyword'"

# Block when aggregate score exceeds threshold
SecRule TX:ANOMALY_SCORE "@ge 15" \
    "id:9999,phase:2,deny,status:403,\
    msg:'Anomaly threshold exceeded',\
    logdata:'Inbound score: %{tx.anomaly_score}'"

The scoring approach survives novel payload variants that match no single rule because the AI-generated payload — regardless of its specific byte pattern — tends to exhibit structural tells: unusual character distribution, encoding layering, SQL keyword presence in unexpected positions. Individual signals are weak; combined signals are not.

2. TLS and HTTP/2 Fingerprinting with JA4

JA4 (the successor to JA3, published by FoxIO in 2023) captures the TLS client hello in a fingerprint format that survives TLS 1.3 and QUIC, and extends to HTTP/2 SETTINGS frames and header order via the JA4H variant. The critical property: payload variation is cheap; TLS stack variation is expensive. An attacker generating novel payloads from a Python script still presents a Python requests/httpx TLS hello. An attacker switching to a browser automation framework presents a Chrome TLS hello — which is different from a Python hello but also different from a real human Chrome session with correct header order and SETTINGS frames.

Collect JA4 fingerprints at the edge and feed them into both blocking decisions and anomaly scoring:

# nginx compiled with ngx_ssl_preread_module and a JA4 module
# (e.g. https://github.com/nicowillis/nginx-ja4 or equivalent)

# Example: log JA4 fingerprint to access log for downstream analysis
log_format ja4_log '$remote_addr [$time_local] "$request" '
                   '$status $body_bytes_sent '
                   '"$http_user_agent" ja4="$ja4_fingerprint"';

access_log /var/log/nginx/access.log ja4_log;

# Map known non-browser JA4 fingerprints to a block decision
# Maintain this list as a file, refreshed from threat intel
geo $ja4_fingerprint $ja4_block {
    default 0;
    # python-requests 2.x JA4 fingerprint
    "t13d1516h2_8daaf6152771_e5627efa2ab1" 1;
    # httpx JA4
    "t13d1516h2_002f,0035,009c,009d,1301,1302,1303_e5627efa2ab1" 1;
    # curl/7.x default
    "t13d1312h2_002f,0035,009c_000000000000" 1;
}

server {
    if ($ja4_block) {
        return 403 "Automated client fingerprint";
    }
}

For a Cloudflare-backed deployment, JA3/JA4 fingerprints are available via the cf-bot-management API and can drive custom WAF rules without nginx-level modules:

# Cloudflare WAF Custom Rule (Terraform)
rule {
  expression = "cf.bot_management.ja3_hash in {\"e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855\" \"<known-tool-hashes>\"}"
  action     = "block"
  description = "Block known automated TLS fingerprints"
}

JA4 blocking requires maintenance: legitimate non-browser API clients (your own mobile app, your CI health checks, your monitoring probes) will match non-browser fingerprints. Build an allowlist keyed on authenticated API tokens — if the request carries a valid bearer token from a known client, bypass the JA4 block.

3. Request Shape Analysis: Entropy and Structural Signals

Shannon entropy of a parameter value is a weak signal individually but reliable in aggregate. A legitimate user search query has low entropy — natural language is not random. An encoded, obfuscated SQL payload has high entropy, especially after multiple encoding layers (URL encode over base64 over hex). The following middleware layer generates features for scoring:

import math
import re
from collections import Counter
from typing import Any

SQL_KW = re.compile(
    r'\b(union|select|from|where|insert|update|delete|drop|exec|cast|convert|'
    r'information_schema|pg_tables|sysobjects|char|nchar|varchar|declare)\b',
    re.IGNORECASE,
)

def payload_entropy(s: str) -> float:
    """Shannon entropy of a string.

    High entropy (>4.5 bits/char) in a short parameter suggests encoding
    obfuscation. Long natural-language strings legitimately reach 4.0–4.3.
    """
    if not s:
        return 0.0
    counts = Counter(s)
    total = len(s)
    return -sum((c / total) * math.log2(c / total) for c in counts.values())


def analyse_request(params: dict[str, str]) -> dict[str, Any]:
    """Produce a feature dict for downstream scoring.

    Call this from your WSGI/ASGI middleware before handing the request
    to the application. Feed the output to a rule engine or ML model.
    """
    signals: dict[str, Any] = {
        "param_count": len(params),
        "max_value_length": max((len(v) for v in params.values()), default=0),
        "values_with_sql_keywords": 0,
        "values_with_comment_chars": 0,
        "values_with_high_entropy": 0,
        "per_param": {},
    }

    for key, value in params.items():
        ent = payload_entropy(value)
        sql_match = bool(SQL_KW.search(value))
        comment_count = value.count("/*") + value.count("--") + value.count("*/")

        signals["per_param"][key] = {
            "entropy": round(ent, 3),
            "length": len(value),
            "sql_keywords": sql_match,
            "comment_chars": comment_count,
        }

        if sql_match:
            signals["values_with_sql_keywords"] += 1
        if comment_count > 0:
            signals["values_with_comment_chars"] += 1
        if ent > 4.5:
            signals["values_with_high_entropy"] += 1

    return signals


def score_request(signals: dict[str, Any]) -> float:
    """Rule-based anomaly score in [0.0, 1.0].

    Threshold at 0.6 for soft challenge (Turnstile), 0.85 for hard block.
    These thresholds require tuning against your specific traffic baseline.
    """
    score = 0.0
    if signals["param_count"] > 40:          # Excessive parameter count
        score += 0.2
    if signals["max_value_length"] > 2000:   # Unusually long value
        score += 0.15
    if signals["values_with_sql_keywords"] > 0:
        score += 0.25
    if signals["values_with_comment_chars"] > 0:
        score += 0.3
    if signals["values_with_high_entropy"] > 0:
        score += 0.2
    return min(score, 1.0)

Integrate this as a FastAPI/Django middleware layer that adds X-Anomaly-Score to the internal request, which the upstream proxy reads for routing decisions. It does not replace the WAF; it adds an application-aware scoring layer the WAF cannot perform because the WAF does not know your application’s parameter vocabulary.

4. Application State Machine Validation

Bots optimise for efficiency. A credential-stuffing bot navigates directly to /login, submits, and checks the response. A scraper navigates directly to /api/v2/products?page=N. A carding bot goes straight to /checkout/payment. Real users traverse a graph of pages that corresponds to the application’s UI flow. State machine validation detects the efficiency characteristic of automated traversal by asserting that sessions follow valid state transitions.

from typing import Optional

# Valid next paths for each preceding path.
# Unlisted transitions are not necessarily blocked — they are flagged for scoring.
# Paths not in this map are unconstrained (static assets, API health checks, etc.)
VALID_TRANSITIONS: dict[str, list[str]] = {
    "/login":            ["/dashboard", "/account/settings", "/onboarding"],
    "/cart":             ["/checkout", "/cart", "/products"],
    "/cart/add":         ["/cart", "/checkout"],
    "/checkout":         ["/checkout/payment", "/cart"],
    "/checkout/payment": ["/checkout/confirm", "/checkout"],
    "/checkout/confirm": ["/account/orders"],
    "/products":         ["/products", "/product/", "/cart/add"],
}

SUSPICIOUS_DIRECT_ACCESS: set[str] = {
    "/checkout/payment",
    "/account/export",
    "/admin/",
    "/api/v2/bulk",
}


def validate_transition(
    session_path: list[str],
    next_path: str,
    *,
    score: float = 0.0,
) -> tuple[bool, float]:
    """Return (allowed, updated_score).

    A False return does not necessarily mean block — feed the score into
    the aggregate scorer. Only hard-block on the highest-risk direct-access paths.
    """
    # First request in session — always valid, no prior state
    if not session_path:
        if next_path in SUSPICIOUS_DIRECT_ACCESS:
            score += 0.4  # No session history but trying sensitive path
        return True, score

    last = session_path[-1]

    # Direct access to a high-risk path without valid prior state
    if next_path in SUSPICIOUS_DIRECT_ACCESS and last not in VALID_TRANSITIONS:
        score += 0.5
        return False, score

    # Check transition validity if the previous path has constraints
    if last in VALID_TRANSITIONS:
        # Allow prefix matches for parametric paths (e.g. /product/123)
        valid = any(
            next_path == v or next_path.startswith(v)
            for v in VALID_TRANSITIONS[last]
        )
        if not valid:
            score += 0.3  # Unusual transition, not necessarily a block

    return True, score

The key operational constraint: state machine validation cannot be the sole blocking mechanism. Legitimate users arrive at deep links from external sources — email links, bookmarks, social shares. These produce “invalid” transitions but are entirely legitimate. Use state machine signals as scoring inputs, and reserve hard blocks for the intersection of an invalid transition and other bot signals (low session age, no prior navigation, non-browser JA4, honeypot trigger).

5. AWS WAF ML Bot Control

AWS WAF’s targeted bot control inspection level adds ML-based analysis beyond the signature-rule layer. The TARGETED inspection level runs behavioural analysis against the request sequence rather than individual request content, which makes it more resistant to payload variation than the standard level.

resource "aws_wafv2_web_acl" "api_protection" {
  name  = "api-ml-bot-protection"
  scope = "REGIONAL"

  default_action {
    allow {}
  }

  # Rule 1: ML-based targeted bot control
  rule {
    name     = "BotControlML"
    priority = 1

    override_action {
      none {}
    }

    statement {
      managed_rule_group_statement {
        name        = "AWSManagedRulesBotControlRuleSet"
        vendor_name = "AWS"

        managed_rule_group_configs {
          aws_managed_rules_bot_control_rule_set {
            inspection_level = "TARGETED"
            # TARGETED uses ML to analyse request attributes and session behaviour
            # COMMON inspects only static signals (user-agent, IP reputation)
          }
        }

        # Exclude verified bots (Googlebot, Bingbot) from blocking
        # without excluding them from scoring — they still appear in metrics
        rule_action_override {
          name = "CategoryVerifiedSearchEngine"
          action_to_use {
            allow {}
          }
        }
      }
    }

    visibility_config {
      cloudwatch_metrics_enabled = true
      metric_name                = "BotControlML"
      sampled_requests_enabled   = true
    }
  }

  # Rule 2: Anomaly score from CRS for payload analysis
  rule {
    name     = "CRSAnomalyScore"
    priority = 2

    override_action {
      none {}
    }

    statement {
      managed_rule_group_statement {
        name        = "AWSManagedRulesKnownBadInputsRuleSet"
        vendor_name = "AWS"
      }
    }

    visibility_config {
      cloudwatch_metrics_enabled = true
      metric_name                = "KnownBadInputs"
      sampled_requests_enabled   = true
    }
  }

  # Rule 3: Rate-limit by identity, not IP
  rule {
    name     = "PerIdentityRateLimit"
    priority = 3

    action {
      block {}
    }

    statement {
      rate_based_statement {
        limit              = 200  # per 5-minute window per identity
        aggregate_key_type = "CUSTOM_KEYS"

        custom_keys {
          header {
            name                = "X-Session-Token"
            text_transformation { type = "NONE" priority = 0 }
          }
        }

        # Fall back to IP when no session token present
        scope_down_statement {
          not_statement {
            statement {
              byte_match_statement {
                search_string         = "X-Session-Token"
                field_to_match        { single_header { name = "X-Session-Token" } }
                text_transformations  { type = "NONE" priority = 0 }
                positional_constraint = "EXACTLY"
              }
            }
          }
        }
      }
    }

    visibility_config {
      cloudwatch_metrics_enabled = true
      metric_name                = "PerIdentityRateLimit"
      sampled_requests_enabled   = true
    }
  }

  visibility_config {
    cloudwatch_metrics_enabled = true
    metric_name                = "ApiProtectionWebACL"
    sampled_requests_enabled   = true
  }
}

The TARGETED inspection level carries an additional cost (~$1.00 per million requests) over the COMMON level. At high request volumes this becomes significant; weigh it against your fraud/exfiltration risk. For APIs that process authenticated sessions with financial or PII scope, targeted inspection is worth the cost.

6. Honeypot Parameters and Canary API Endpoints

Honeypots exploit a property of automated scanners: they are comprehensive. A human user fills in fields that are visible and labelled. A scanner that automatically fills all form parameters — or an LLM that auto-completes form fields by label — will populate hidden fields the user never sees. This gives you a zero-false-positive signal for a class of bots.

from fastapi import Form, HTTPException, Request
from typing import Annotated, Optional

# CSS class .hp-field { display: none; visibility: hidden; position: absolute; left: -9999px; }
# The honeypot fields are in the form but hidden from real users.
HONEYPOT_FIELDS = {"hp_url", "website_field", "user_url", "contact_email_alt"}


async def check_honeypot(request: Request) -> None:
    """Raise 400 if any honeypot field is non-empty.

    Call this as a dependency in your form-handling endpoints.
    A real user's browser will submit these fields as empty strings.
    A bot that auto-fills form fields will populate them.
    Log before raising — honeypot triggers are valuable threat-intel.
    """
    form = await request.form()
    for field in HONEYPOT_FIELDS:
        value = form.get(field, "")
        if value:
            # Log the trigger with session and source signals for threat intel
            request.app.state.logger.warning(
                "honeypot_trigger",
                extra={
                    "field": field,
                    "value_preview": str(value)[:50],
                    "ip": request.client.host,
                    "ja4": request.headers.get("X-JA4-Fingerprint", ""),
                    "session": request.cookies.get("session_id", ""),
                },
            )
            raise HTTPException(status_code=400, detail="Bad request")

For API scraping detection, deploy canary API endpoints that appear real to automated scanners but are not referenced in any UI or documentation. Any request to these endpoints originates from automated discovery — spidering, sitemap parsing, or credential stuffing tooling that probes known endpoint patterns:

# Canary endpoints — register these routes but return 200 with plausible data.
# Alert on any access. No legitimate user or integration should reach them.
CANARY_ENDPOINTS = [
    "/api/v1/users/export",      # Looks like a data export endpoint
    "/api/internal/health/deep", # Looks like an internal monitoring path
    "/api/v2/admin/tokens",      # Looks like an admin token endpoint
    "/.git/config",              # Common scanner probe
    "/phpinfo.php",              # Common scanner probe
]

@app.get("/api/v1/users/export")
async def canary_users_export(request: Request):
    request.app.state.logger.critical(
        "canary_endpoint_accessed",
        extra={"path": "/api/v1/users/export", "ip": request.client.host},
    )
    # Return a plausible 200 — don't 404, which signals to scanners that the
    # probe was detected and lets them know to move on quietly.
    return {"users": [], "count": 0, "page": 1}

The canary endpoint returns a 200 with plausible but empty data rather than a 404 or 403. A 404 tells a scanner the path does not exist; a 403 tells it access is restricted but the path is real. A 200 with empty data looks legitimate and does not alert the scanner that detection occurred — but you have already captured the source, session, and TLS fingerprint for investigation and block-listing.

Expected Behaviour

A high-entropy, AI-generated SQLi probe against the scoring stack.

The probe arrives as a POST with a parameter value of '/**/UNION(SELECT(username),(password)FROM(users))--. The anomaly scorer fires: payload_entropy returns 4.8 bits/char, SQL keywords detected, comment characters present. The request shape score reaches 0.75 — above the challenge threshold but below the hard-block threshold. ModSecurity CRS rule 942100 may or may not fire (case variation may evade it); the anomaly score accumulates anyway. The aggregate WAF score triggers a Turnstile challenge. The automated tool cannot solve the challenge without a browser context; the request is blocked. The access log shows:

192.0.2.44 - [08/May/2026:14:32:01 +0000] "POST /api/search HTTP/1.1" 403 0
  ja4="t13d1516h2_8daaf6152771_e5627efa2ab1"
  modsec_score=13 shape_score=0.75 rule_ids="9001,9003"
  msg="Anomaly threshold exceeded"

A JA4-blocked non-browser client.

A Python httpx client sends requests with varied payloads — none matching CRS signatures. JA4 fingerprint t13d1516h2_002f,0035,009c matches the known httpx fingerprint in the block map. The request is rejected at the edge before reaching ModSecurity. The attacker’s payload variation is irrelevant; the TLS hello fingerprint is identical across all variants:

192.0.2.100 - [08/May/2026:14:33:12 +0000] "GET /api/products?q=<payload> HTTP/1.1" 403 0
  ja4="t13d1516h2_002f,0035,009c_000000000000"
  block_reason="ja4_blocklist"

A honeypot trigger from an auto-filling form bot.

A credential-stuffing bot submitting the login form populates hp_url=https://example.com — a honeypot field hidden with CSS. The endpoint logs the trigger and returns 400:

{
  "level": "WARNING",
  "event": "honeypot_trigger",
  "field": "hp_url",
  "value_preview": "https://example.com",
  "ip": "45.33.32.156",
  "ja4": "t13d1312h2_002f,0035,009c_000000000000",
  "session": "sess_8f3a..."
}

The IP and JA4 are added to a temporary block list for 24 hours. Subsequent requests from that source are rejected regardless of payload content.

Trade-offs

Anomaly scoring over blocking mode. Anomaly scoring’s false positive rate is higher than precise signature matching against known-bad patterns. A legitimate request that happens to contain SQL keywords in a user-submitted query (a database admin tool, a developer forum, a code search interface) will accumulate score. Tune the thresholds against your specific traffic before enabling. Start with the threshold at 25, monitor blocked requests for a week, and lower it in steps of 2 as you identify and suppress false positives with exclusion rules.

State machine validation. The implementation cost is significant. You need to maintain the transition graph as the application evolves; a new checkout flow that adds a step breaks validation for that path. Direct-link access from email campaigns, bookmarks, and external referrers is legitimate and produces invalid transitions. The practical constraint: state machine signals must be one input to a scorer, not a binary blocker. The engineering overhead scales with application complexity and must be continuously maintained by engineers who understand both the security model and the UX flows.

JA4 blocking. Every non-browser client in your ecosystem will appear in the JA4 block list: CI health checks, monitoring probes, partner API integrations, mobile application HTTP stacks, internal automation. Before enabling blocking, collect JA4 fingerprints in logging mode for at least two weeks. Build an allowlist of known-good fingerprints from authenticated API clients. Enforce the allowlist via bearer token validation rather than JA4 alone — a bot that steals a bearer token should still be detectable by other signals.

Honeypot fields. Honeypot fields require client-side coordination to hide from real users. CSS alone is insufficient against accessibility tools and some browser extensions that may reveal hidden fields. Add both CSS display-none and an explicit aria-hidden="true" attribute. Test with screen readers and browser autofill — some autofill implementations will populate hidden fields, generating false positives from legitimate users. If autofill false positives appear, switch the honeypot mechanism to a time-based token (the field must be empty and submitted within a valid time window) rather than solely empty-field detection.

AWS WAF targeted inspection cost. At 100M requests/month, targeted bot control adds approximately $100/month over the standard tier. At 1B requests/month, $1,000/month. Budget accordingly. For low-traffic internal APIs where the primary threat is authenticated API abuse rather than bot traffic at scale, the standard inspection level is usually sufficient.

Failure Modes

Relying on OWASP CRS in blocking mode as the primary defence. A motivated attacker with access to the CRS rule set (it is public at github.com/coreruleset/coreruleset) and a capable LLM generates a payload set that evades every rule in under 10 minutes. The workflow: paste the relevant SQLi rule regexes into a model prompt asking for payloads that match none of them while exploiting a specific injection point. The model produces dozens of viable bypasses on the first attempt. Blocking mode on CRS with no other signals provides no protection against this workflow. Detection: monitor your WAF bypass rate — the ratio of requests that reach the application after the WAF layer and still produce error responses indicating injection. A rising bypass rate after an LLM payload tool release is the signal. Response: move to scoring mode and add the signals described above.

Rate limiting by IP alone. Residential proxy networks trivialise per-IP limits. A credential-stuffing campaign across 1,000 Bright Data residential IPs, each sending one request per minute, produces 1,000 requests/minute total with zero per-IP violations against a typical limit of 100/minute. The attack succeeds without triggering any IP-based control. Simultaneously, a legitimate user behind CGNAT or a corporate proxy shares an IP with hundreds of colleagues and hits per-IP limits while the bot does not. Per-IP rate limiting in 2026 harms legitimate users more than attackers. Replace it with per-identity limits keyed on session token, account ID, or the composite (JA4, ASN, UA class) for unauthenticated traffic.

CAPTCHA as the sole bot defence. AI vision models and CAPTCHA-solving services render CAPTCHA unreliable as a sole control. hCaptcha and reCAPTCHA v3 solve rates above 70–85% for capable AI clients make the assumption “if it passed CAPTCHA, it’s human” dangerous. CAPTCHA remains useful as a friction layer that raises attacker cost — it is not useless — but it must be layered with the behavioural and provenance signals described above. A bot that solves a CAPTCHA but presents a Python httpx JA4 fingerprint should still be blocked.

Not monitoring WAF bypass rates. If you do not measure what fraction of requests reach the application after the WAF layer, you cannot detect when the WAF is being systematically evaded. Instrument your application to emit a metric on every request — WAF-passed, blocked, challenged — and correlate it with application-layer error patterns (database errors, unexpected 4xx/5xx rates on injection-prone endpoints). A sudden increase in application-layer SQL errors with flat WAF block rates is the signature of a WAF bypass campaign in progress. Without this telemetry, the bypass runs until the database fires an alert or the attacker’s data appears in an external breach feed.

Stale JA4 allowlist. A JA4 allowlist built once and never maintained becomes either too restrictive (blocking new legitimate clients after TLS library upgrades) or too permissive (allowing attacker clients that have updated to a previously-known-good fingerprint). Schedule quarterly reviews of the allowlist. When a mobile app or API client updates its TLS library, the JA4 fingerprint changes — this will appear as a sudden spike in blocked requests from a source that was previously legitimate. Build a review process that catches this before it causes a support incident.