SENTINEL/RX
Cyber Recovery OS · v4.2
SYS NOMINAL
AI-Powered · Zero-Trust · Telco-Grade

Recovery is not
an option. It's a protocol.

SENTINEL/RX is a mission-control platform that orchestrates cyber recovery for telco workloads — BSS, OSS, 5G core, billing, IMS — with AI-powered RTO and RPO.

RTO
< 4 min
guaranteed
RPO
≤ 30 sec
continuous
Audit
100%
immutable
Threat Surface · LIVE
2 ANOMALIES
NODES
1,284
VAULTS
37
SHIELDS
ACTIVE
Workloads Protected
12,408
+128 / 24h
Immutable Snapshots
8.2M
air-gapped
AI Decisions / day
640K
auto-orchestrated
Recovery Success
99.998%
12-mo rolling
// 01 · Mission Control

One pane. Every workload. Every threat.

Operators see the entire telco recovery posture in real time — and the AI assist suggests the next safe move.

Threat Telemetry
● LIVE
  • 12:04:21Ransomware signature on BSS-CRM-04
  • 12:03:58Anomalous IAM token velocity · region EU-W
  • 12:03:30Vault-09 integrity verified · SHA-512 OK
  • 12:02:11Lateral movement attempt blocked
  • 12:01:45Snapshot 8,201,994 sealed (immutable)
  • 12:00:02AI rerouted 5G-CORE-N12 to clean enclave
  • 11:58:39Recovery drill completed · RTO 03:42
  • 11:57:10Drift detected on IMS-SBC-02 config
  • 12:04:21Ransomware signature on BSS-CRM-04
  • 12:03:58Anomalous IAM token velocity · region EU-W
  • 12:03:30Vault-09 integrity verified · SHA-512 OK
  • 12:02:11Lateral movement attempt blocked
  • 12:01:45Snapshot 8,201,994 sealed (immutable)
  • 12:00:02AI rerouted 5G-CORE-N12 to clean enclave
  • 11:58:39Recovery drill completed · RTO 03:42
  • 11:57:10Drift detected on IMS-SBC-02 config
AI Assist
REASONING
Incident · INC-44219

Suspected ransomware on BSS-CRM-04. Encryption entropy +812%.

Recommended Protocol
  1. 1Isolate workload · sever east-west traffic
  2. 2Mount snapshot 8,201,994 · clean enclave
  3. 3Restore subscriber data · verify hash chain
  4. 4Replay billing delta · 00:14 window
Confidence · 98.4%
Workload Fabric
12 / 12,408 shown
5G-CORE-N12
BSS-CRM-04
IMS-SBC-02
OSS-PROV-01
BILL-RTG-09
HSS-EU-W3
CHARGING-07
DPI-EDGE-22
VAULT-CORE
PCRF-N04
SMSC-LEG-1
MEC-NODE-18
SAFE
9
WARN
2
THREAT
1
// 02 · Operating Doctrine

Five principles. No exceptions.

The non-negotiable rules that govern every decision the platform — and its operators — are allowed to make.

P/01

Nothing moves unless it's safe

Cyber gating is sacred. Every restore, failover, and replay passes integrity, provenance, and behavioral checks before a single byte is committed.

No bypass without trace.
P/02

AI advises, humans decide

Every recommendation ships with its reasoning, evidence chain, and confidence band. Operators can override, amend, or escalate — always.

Explainable. Overridable. Accountable.
P/03

Time is everything

RTO and RPO are not slides — they are live counters. Every panel surfaces the cost of waiting and the budget remaining.

Always visible. Always updating.
P/04

Start small, recover fast

Minimum Viable Service first: restore the dial-tone path, then expand outward in deterministic, reviewable waves.

MVS-first recovery philosophy.
P/05

Every action leaves a trail

AI suggestions, operator clicks, system effects — all sealed in a tamper-evident ledger. Auditability is the substrate, not a report.

First-class. Not afterthought.
// 03 · Capabilities

Six pillars. One guarantee.

Built specifically for the operational reality of telco infrastructure under nation-state pressure.

Zero-Trust Recovery Plane

Every restore action signed, verified, and isolated. No implicit trust between vault, network, or operator.

AI Decision Engine

Models trained on telco kill-chains recommend least-impact recovery paths in seconds, not hours.

Immutable Vaults

Air-gapped, WORM-locked snapshots with cryptographic chain-of-custody for every byte.

Telco Workload Aware

Native understanding of BSS, OSS, 5G core, IMS, charging, and HSS dependencies.

Forensic-Grade Audit

Every operator click, AI suggestion, and system action sealed in tamper-evident ledger.

Continuous Recovery Drills

Synthetic attacks rehearse RTO/RPO daily — your recovery posture is proven, not promised.

// 04 · Recovery Protocol

From detonation to dial-tone in under four minutes.

A deterministic playbook, executed by humans and machines as one team.

  1. T+0sSTEP 01

    Detect

    AI signal fusion across IDS, EDR, NDR, and behavioral telemetry.

  2. T+12sSTEP 02

    Isolate

    Zero-trust segmentation severs blast radius automatically.

  3. T+45sSTEP 03

    Decide

    Assist proposes recovery protocol with confidence score.

  4. T+90sSTEP 04

    Restore

    Mount immutable snapshot in clean enclave; verify hash chain.

  5. T+3mSTEP 05

    Validate

    Synthetic transactions confirm BSS/OSS/5G integrity.

  6. T+4mSTEP 06

    Resume

    Subscriber traffic re-cut. RTO met. Audit sealed.

// 05 · System Module

Recovery Orchestrator with Cyber Gates.

A visual runbook pipeline. Six stages, four gates each. Nothing advances unless every gate is green — and every block has evidence.

RecoveryFlowEngine
Module · cyber-gated runbook pipeline
2 PASSED1 PENDING1 BLOCKED
Data· gates
1 blocked
Progression rule. The pipeline cannot advance to Network until every gate on this stage reads PASSED. No bypass without a witnessed override.
Why blocked?
BLOCKED
Integrity Check

Hash chain mismatch on shard 14 of subscriber DB. Expected 9af3…c7e1, got 11b2…8ddf. Suspect tampering — escalate to forensics before mount.

Stage
S/04 · Data
Scanner
cyber-gate.v9.2
Witness
ledger://INC-44219
Hash
9af3…c7e1
// 06 · System Module

AI RTO/RPO Prediction Engine.

ResilienceAI fuses backup lag, replication health, infra contention, change rate, and restore throughput into a live forecast — and tells you exactly how to stay inside the SLA.

ResilienceAI
Module · RTO/RPO prediction engine
BREACH RISK
RTO target
4:00
RTO predicted● LIVE
6:08
Drift
+53%
Target PredictedForecast breaches SLA in ~17h
−24h−18h−12h−6hnow
Live Signal Inputs
14s
Backup Lag
92%
Replication Health
78%
Infra Contention
1.4k/m
Change Rate
6.8GB/s
Restore Throughput
AI Recommendations
4 actions
  • Increase replication bandwidth by 20%

    −42s RTO·conf 96%
    high
  • Reduce snapshot interval from 30m → 10m

    −18s RPO·conf 94%
    high
  • Pre-warm clean enclave on EU-W3

    −27s RTO·conf 88%
    med
  • Throttle non-critical batch jobs (06:00–08:00 UTC)

    −12s RTO·conf 81%
    med
Predictions update every 15s. All actions are operator-approved.
// 07 · System Module

Clean Restore Point Intelligence.

Not every snapshot is safe. RecoveryPointAdvisor ranks every restore point by confidence — fusing EDR/XDR, file integrity, behavior, and timeline correlation — so you mount the past, not the attacker.

RecoveryPointAdvisor
Module · clean restore point intelligence
4 CLEAN1 RISK1 COMPROMISED
−3h attack T+0 · 12:04:21Znow
SAFE CUT
Timestamp
Confidence
Status
Size
Why this is clean
CLEAN
EDR / XDR Alerts

0 alerts across all sensors · last scan 11:18:42Z

File Integrity

All hashes match signed manifest · SHA-512 OK

Behavioral Anomalies

Baseline behavior · no outliers in 24h window

Timeline Correlation

45m pre-attack · before earliest IoC observed

Snapshot · T−45m · conf 92%
// 08 · System Module

Containment & Recovery Zoning.

Every restored workload lands in Quarantine. Promotion to Staging and Production is gated by validation, approval, and zero-trust posture — enforced, not requested.

ZoneManager
Module · containment & recovery zoning
3 QUARANTINE2 STAGING4 LIVE
default restore validate approve production
Quarantine
isolated · default landing
3
All restored workloads land here. Zero egress. Forensic taps active.
  • BSS-CRM-04
    BSS
    VALAPR
    Stagingneeds approval
  • IMS-SBC-02
    IMS
    VALAPR
    Stagingneeds validation
  • BILL-RTG-09
    Billing
    VALAPR
    Staging
Staging
validated · pre-promotion
2
Synthetic traffic + integrity replay. East-west traffic to prod is blocked.
  • OSS-PROV-01
    OSS
    VALAPR
    Production
  • HSS-EU-W3
    5G Core
    VALAPR
    Productionneeds approval
Production
live · subscriber traffic
4
Promotion requires validation pass AND human approval (FIDO2-witnessed).
  • CHARGING-07
    Billing
    VALAPR
    serving traffic
  • PCRF-N04
    5G Core
    VALAPR
    serving traffic
  • MEC-NODE-18
    Edge
    VALAPR
    serving traffic
  • DPI-EDGE-22
    Edge
    VALAPR
    serving traffic
Zone Controls · zero-trust enforcement
Microsegmentation
East-west traffic blocked between zones except on whitelisted service paths.
● enforced
Zero-Trust Policies
Every call is authenticated, authorized, and signed. No implicit trust between hops.
● enforced
Identity Restrictions
Only scoped workload identities (SPIFFE) and FIDO2-attested operators may act.
● enforced
// 09 · System Module

Telco Dependency Intelligence.

DependencyGraphAI maps 5G Core (AMF/SMF/UPF), IMS, OSS/BSS, and DNS/DHCP into a live topology — auto-generating restore order and surfacing dependency conflicts before they detonate.

DependencyGraphAI
Module · telco dependency intelligence
5G COREIMSOSS/BSSNET FOUND.
Topology
click a node to inspect dependencies
dependency path conflict idle edge
Selected
Business SupportOSS/BSS

Depends on 7 upstream services. They must be healthy before this node can be restored.

Restore Sequence Preview
6 waves
  1. 1
    Wave 01
  2. 2
    Wave 02
  3. 3
    Wave 03
  4. 4
    Wave 04
  5. 5
    Wave 05
  6. 6
    Wave 06
Detected Conflicts
1
  • BSSIMS

    Circular promotion risk: BSS declares dependency on IMS which is restored in a later wave. AI proposes deferring BSS startup probe by 30s.

auto-generated · graph-aware
// 10 · System Module

Minimum Viable Service Mode.

When the network is on fire, restore the dial-tone first. MVSOrchestrator brings up only what's critical, then expands in waves as KPIs prove green.

MVSOrchestrator
Module · minimum viable service mode
MVS · ACTIVE
Restore Plan · Emergency Services Only
ETA 6m / 27m
Live KPIs · phase P1
Attach Success Rate96.4%
0%target 95%100%
Call Setup Success Rate99.1%
0%target 97%100%
Authentication Success97.2%
0%target 98%100%
Subscribers Servedlive
18%target 100%
Profile Guarantees
  • 112 / 911 routing operational
  • Public-safety priority queuing
  • Location services to PSAP active
start small · expand on green KPIs
// 11 · System Module

Recovery is not done until it's proven.

ServiceValidator runs config-compliance checks and synthetic Attach / Call / SMS / Data transactions. The runbook stays open until every probe is green.

Automated Validation Engine
● Idle
Config Compliance3 checks
Policy Compliance
NIS2 · DORA · ISO 27031 baselines
Config Drift
Golden manifest vs live state
Certificate Chain
mTLS + SBC trust anchors
Synthetic Transactions4 checks
Attach
UE → AMF → SMF → UPF
Call
VoLTE setup · IMS · 200 OK
SMS
MO/MT round-trip via SMSC
Data
PDU session · 50 Mbps probe
Recovery is NOT complete until validation passes
0 passed · 0 failed · 7 pending
// 12 · System Module

Humans hold the trigger. Always.

ApprovalSystem enforces role-based, separation-of-duties approvals for production promotions and AI overrides — every decision FIDO2-witnessed and written to the audit trail.

ApprovalSystem
Module · human-in-the-loop · separation of duties
Acting as
Promote to Prod MED RISK
Approve Promotion to Production
CR-77412 · HSS-EU-W3 · 5G Core · raised 12:08:14 via ZoneManager · Staging
FIDO2-witnessed
Proposed Action

Promote workload from Staging → Production. Validation passed (7/7). Microsegmentation enforced.

Approval Chain · quorum 3-of-3
  1. SecOps Leadstep 1 approved · a.morel @ 12:09:02

    Forensic taps clean. Integrity hash matches vault-seal.

  2. Compliance Officerstep 2 awaiting
  3. Duty Executivestep 3 awaiting
Your decision · Compliance Officer
witness: you
// 13 · System Module

Every AI decision, fully explainable.

ExplainabilityPanel surfaces the top contributing signals, the analysis window, the model's confidence and the supporting telemetry — for every action the AI takes.

ExplainabilityPanel
Module · why the AI decided · auditable
rpa-v3.4.1 · ensemble(xgb+iforest)
RecoveryPointAdvisorAID-90412
Recommend restore point T-45min (snapshot 8,201,949)
✅ CLEAN · 92% confidence
Window
90 min
Confidence
92%
Raised
12:08:03
Confidence92%
Rationale

No EDR/XDR alerts within window. File-integrity hash chain unbroken across 184 monitored paths. Behavioral baseline within ±1.2σ. Snapshot predates first observed encryption-entropy spike by 14 min.

Top contributing signals
Σ 100%
  • EDR/XDR alerts in window
    34%
    obs 0 · base ≤ 2CrowdStrike · Defender
  • File integrity hash drift
    27%
    obs 0 / 184 · base 0 / 184Tripwire · vault-seal
  • Behavioral anomaly z-score
    18%
    obs +0.4σ · base ±1.5σUEBA · prod-cluster
  • Encryption entropy on disk
    12%
    obs 3.1 bits · base ≤ 4.0 bitsDPI-EDGE-22
  • Lateral movement attempts
    9%
    obs 0 · base ≤ 3Zeek · east-west tap
Supporting telemetry
last 90m
  1. 12:07:58vault-seal
    Hash chain verified
    SHA-512 OK · 184/184
  2. 12:06:11CrowdStrike
    EDR sweep · BSS-CRM-04
    0 alerts, 0 quarantined
  3. 12:05:44UEBA
    Behavioral baseline tick
    z = +0.4σ
  4. 12:03:30vault-seal
    Vault-09 integrity
    SHA-512 OK
  5. 11:58:42snapshot-svc
    Snapshot sealed
    id 8,201,949 · immutable
decision sealed · written to immutable audit ledger reproducible from rpa-v3.4.1 · ensemble(xgb+iforest)
// 14 · System Module

The platform survives what it protects against.

ControlPlane runs multi-site HA with cross-region quorum, gracefully degrades when sites fall, and ships signed offline runbooks so operators stay in control even when the cloud isn't there.

ControlPlane
Module · platform resilience · multi-site HA
Quorum 3/4 Normal
2 active 1 degraded rpo ≤ 30s · cross-site

All sites healthy. Quorum 4/4. AI fully online, full telemetry pipelines.

Frankfurt-Aprimary
EU-West · eu-w-fra
ACTIVE
rtt
12ms
load
62%
wkld
4,821
hb 12:14:02 serving
Stockholm-B
EU-North · eu-n-sto
STANDBY
rtt
28ms
load
18%
wkld
0
hb 12:14:01
Madrid-C
EU-South · eu-s-mad
DEGRADED
rtt
71ms
load
84%
wkld
1,207
hb 12:13:58 not eligible
Ashburn-D
US-East · us-e-iad
ACTIVE
rtt
48ms
load
41%
wkld
3,194
hb 12:14:02
Resilience capabilities
Multi-site HA
Synchronous quorum across 4 sites. Sub-second leader election. RPO ≤ 30s cross-region.
engaged
Degraded mode
Sheds non-critical telemetry, runs cached AI models, retains full recovery & approval flows.
engaged
Offline runbooks
Signed, versioned runbooks executable from local vault. No cloud, no orchestrator required.
engaged
Offline runbooks
local vault
  • RB-014core
    5G Core cold-start without orchestrator
    12 steps · ~18mopen
    signed · secops · 2026-04-22
  • RB-021billing
    Billing day-close with charging in MVS
    9 steps · ~11mopen
    signed · finops · 2026-05-01
  • RB-007ims
    IMS SBC failover via offline DNS
    14 steps · ~22mopen
    signed · secops · 2026-03-30
  • RB-031edge
    MEC node lights-out recovery
    7 steps · ~9mopen
    signed · netops · 2026-04-15
// 15 · AI Behaviour Charter

The assist has a personality — and rules.

SENTINEL/RX is engineered to be precise, transparent, and advisory. It never issues commands; it surfaces reasoned recommendations with confidence scores so a human always owns the call.

AI Personality Charter

How SENTINEL/RX speaks under pressure

Never vague

Every statement is grounded in a measurable signal, a named source, or a numeric threshold.

✗ Anti-pattern

“Looks suspicious. Maybe revert?”

✓ Exemplar

“Snapshot SS-218 (T-42m) shows entropy 7.91 bits — exceeds 4.0 threshold on 3 of 14 volumes (vol-eu-07, vol-eu-09, vol-na-02).”

Voice calibration · say this / not that
“Confidence 0.91 — recommended.”
“Trust me on this.”
“3 of 4 gates passed. 1 pending: enclave attestation.”
“Almost ready.”
“Suggest holding promotion until UEBA σ falls under 1.0.”
“Do not promote.”
“Evidence: 14 telemetry sources, 2 disagree (see ledger #4821).”
“Most signals agree.”
Enforced at inference time · prompt-layer + post-hoc linteraudit-tag: persona.v1.4
// 16 · Security & Compliance Behaviour

Designed for the auditor in the room.

Every operator click and every AI inference lands in a tamper-evident ledger. Bundles export pre-mapped to NIS2, DORA, ISO 27001/27031 and GDPR — no spreadsheet archaeology required.

Security & Compliance Behaviour

Every action witnessed. Every log sealed. Every record export-ready.

Stream · INC-44219 · last 5 entries of 1,284append-only · object-lock · region-replicated
Seq
Timestamp
Actor
Action / target
Hash
#4821
09:14:02Z
ai.sentinel
decision.recommend · snapshot SS-218
9f3a…b81c
#4822
09:14:47Z
n.ferreira
approval.granted · RB-014 / step 3
2d77…a019
#4823
09:15:11Z
control-plane
enclave.attestation · tee-eu-west-2
4b9e…c33f
#4824
09:16:08Z
k.osei
approval.granted · RB-014 / step 3
8a01…7d62
#4825
09:16:33Z
ai.sentinel
promote.shadow · vol-eu-07
1c54…ff2a
100%
actions captured
<1ms
ledger latency p95
3 regions
synchronous replicas
Behaviour enforced by ControlPlane · evidence sealed at write-timepolicy: compliance.v2.1
// 17 · End-to-end Scenario

A live ransomware incident, second by second.

Watch the platform, the AI assist, and the human operators play their parts — from first IoC to a sealed evidence bundle.

Example user flow · INC-44219

From ransomware detection to sealed evidence — in 5h 58m.

SYSTEMT+00:00· step 1 of 9

Incident detected — ransomware

EDR + entropy spike + DNS exfil pattern correlated across 14 nodes. INC-44219 opened, blast-radius scoping started.

EDR alerts
47
Entropy
7.91 bits
Affected nodes
14
artifact: INC-44219 · severity P1
// 18 · Reference Architecture

The stack we'd ship to a tier-1 telco tomorrow.

Opinionated choices, not laundry lists. Each layer earns its place by what it survives — process death, region loss, silent data corruption — not by what it ships in a brochure.

Suggested tech stack

Five layers, chosen for survivability and explainability.

AI Layer

Time-series + anomaly detection + LLM explainability

Detection needs deterministic numerical models; explanation needs natural language. We split the brain in two and chain them — fast classifiers feed a small LLM that narrates the why.

Building blocks
Prophet / N-BEATS
RTO drift forecast, capacity prediction
Isolation Forest + UEBA
anomaly scoring on telemetry & logs
Vector store (pgvector)
runbook retrieval, similar-incident recall
LLM gateway (Lovable AI)
explainability layer, action suggestions
alternatives considered:Custom transformers,AWS Lookout for Metrics
layer · ai
Cross-cutting · spans every layer
Zero-trust
mTLS, SPIFFE IDs
Observability
OTel everywhere
Multi-region
active/active quorum
Offline-capable
signed runbooks
// 15 · Compliance

Audit-grade by design.

Every action is cryptographically witnessed. Regulators don't ask twice.

  • NIS2 · DORA · GDPR aligned
  • ISO 27001 / 27031 control mapping
  • ENISA telco resilience guidance
  • Tamper-evident WORM ledger (SHA-512 chain)
  • Operator action attestation with FIDO2 keys
Audit Ledger · sample
SEALED
{
  "incident": "INC-44219",
  "ts":       "2026-05-08T12:04:21Z",
  "actor":    "ai.assist::v4.2",
  "action":   "RESTORE_FROM_SNAPSHOT",
  "target":   "BSS-CRM-04",
  "snapshot": "vault-09::8201994",
  "operator": "k.lindqvist@telco.eu",
  "fido2":    "ok",
  "hash":     "9af3...c7e1",
  "prev":     "1d0b...44a9",
  "rto_sec":  214,
  "rpo_sec":  18
}
// 19 · Charter

The brief, in one breath.

Pin this to the wall. Everything in SENTINEL/RX is judged against it.

// Final Instruction · ratified
doc-id: DIR-001 · classification: internal · sig: hsm-eu-west

Build a real-time, AI-assisted cyber recovery control plane with strong visual orchestration, explainable AI decisions, strict security gating, and telco-aware dependency intelligence.

The experience must feel like a mission-critical NOC + AI assist, optimized for speed, trust, and auditability during high-pressure incidents.

Real-time
sub-second telemetry, live recovery graph
AI-assisted
assist, never auto-pilot
Visual orchestration
the plan is the picture
Explainable
every score, every signal, sourced
Strictly gated
two-person rule, attested enclaves
Telco-aware
HSS, charging, MVNO blast-radius
Speed under pressure
designed for the worst Tuesday
Trust & audit
WORM ledger, regulator-ready
Speed

Every screen answers the operator's first question in under a second. No spinners on critical paths. Defaults are the safe action.

Trust

The AI explains itself or stays silent. Confidence scores are visible. Humans always own the irreversible step.

Auditability

Every click, every signal, every decision lands in a tamper-evident ledger — pre-mapped to the frameworks regulators will ask about.

charter active · enforced at every layer · violations break the build
signed: SecOps · NetOps · SREhash: 9f3a…b81c
Briefing available · NDA

When the next attack lands, you'll already be recovering.

Schedule a 30-minute mission-control walkthrough with our telco resilience architects.