Element34
Automated RCA · Root cause AI

Stop chasing flakes. Find the cause.

Automated RCA clusters failed sessions, summarizes the root cause, and proposes the fix. The per-session debug analyzer surfaces network, console, and command logs for any single failure. The cross-session RCA spots the pattern when the same flake hits seven specs at once.

  • Per-session debug analyzer
  • Cross-session cluster RCA
Triage time
Morning → 15 min
Activation
1 flag
Categories
Selenium · SBOX · Infra
Selenium Box · Root Cause AI
Failure cluster · live
Specs 7 Confidence 94% Fixes 3
FAILURE STREAM
[09:14] build release-2026.05.29
✕ checkout.smoke · TimeoutError
✕ auth.flow.sso · TimeoutError
✕ billing.api.handshake · TimeoutError
✕ search.spec.locale-de · TimeoutError
✕ search.spec.locale-en · TimeoutError
✕ profile.edit · TimeoutError
✕ cart.add.line-item · TimeoutError
↳ clustering 7 failures...
✓ cluster identified · confidence 94%
RCA SUMMARY
Cluster #042 Confidence 94%
Pattern TimeoutError on dashboard-greeting across 7 specs
Suggested cause Backend p95 rose 340ms → 9.4s after deploy 7c9a2b1
Proposed fixes
  • Selenium Bump retry budget to 3 on dashboard-greeting wait
  • SBOX Increase per_test_timeout_ms to 800000
  • Infra Add p95 alert on /api/dashboard
Selenium · framework-level fix SBOX · config-level fix Infrastructure · platform-level fix
Banking & Financial Services
Triage time drops

Cluster instead of triage. One RCA summary replaces a morning of session-by-session debugging across the release pipeline.

Release-day proof
Insurance & Healthcare
Audit-ready fix trail

Every RCA writes the pattern, the suggested cause, and the proposed fix to the session record. Defensible to QA leadership and compliance.

Audit trail
Government & Public Sector
AI runs on your model

Cluster RCA reasoning uses your bring-your-own LLM. Failure data and logs stay inside your perimeter.

Private inference
Mid-market & MedTech
Fix path, not a hint

Recommendations bucket into Selenium, SBOX, or Infrastructure. The engineer gets a fix path, not a hint.

Actionable
The triage tax

Seven sessions failed last night. They failed for the same reason.

Every long-lived test suite carries the same hidden cost. When a deploy ships a regression, the same root cause fails dozens of sessions at once. Engineering opens seven debug tickets, reads seven traces, finds one cause. Automated RCA collapses that into one summary, attached to the pattern, with a proposed fix.

Pain pattern 01

Deploy-day regression

Symptom. A backend service deploys, p95 latency climbs from 340ms to 9.4s, and seven E2E specs time out on dashboard load. Each session shows TimeoutError. Engineering reads each trace individually.

Automated RCA response Clusters the seven failures. Surfaces the p95 spike and the offending commit hash. Proposes a retry-budget change plus a wait extension on dashboard-load.
Pain pattern 02

Infrastructure starvation

Symptom. Kubernetes pods take 30 minutes to scale. Queue wait time grows. Tests fail with start-time exhaustion. The session log just shows the timeout, not the cause.

Automated RCA response Categorizes the recommendation as Infrastructure. Suggests a scaling strategy for your private grid. The engineer escalates to the platform team with a specific ask, not a vague complaint.
Pain pattern 03

Slow-command drift

Symptom. A command that normally takes 200ms starts taking 700ms after a feature flag flip. Tests don't fail, they get slower. Nobody notices until p95 SLOs break.

Automated RCA response Per-session debug analyzer flags slow commands against the historical baseline (over 500ms is the default threshold). Surfaces the flagged commands before they cause failures.
How it diagnoses

Two tiers. Per-session debug. Cross-session cluster.

Automated RCA is two capabilities under one name. The per-session debug analyzer answers 'why did this single session fail'. The cross-session cluster RCA answers 'why are these seven sessions failing together'. Engineers get both, automatically, on every failed run.

Tier 1 · default

Per-session debug analyzer

Surfaces network logs, console logs, and command logs for the failed session. Flags slow commands against historical baseline. Categorizes recommended fixes into three buckets.

Slow cmd flagged:
driver.findElement('#dashboard-greeting')
Took: 7.2s (baseline: 340ms)
Category: Infrastructure
Recommendation: Investigate /api/dashboard p95
Deterministic. No model. On by default.
Tier 2 · activate per build

Cross-session cluster RCA

Clusters failed sessions across the build by signature. Identifies common root cause across the cluster. Attributes the cluster to a specific commit or change. Proposes a concrete fix.

Cluster: 7 specs · confidence 94%
Pattern: TimeoutError on dashboard-greeting
Cause: Backend p95 340ms → 9.4s
After deploy: 7c9a2b1
Fixes: Selenium · SBOX · Infrastructure
Runs on your model. BYO-LLM.
Same failure set. Two response modes.

Manual triage takes a morning. Automated RCA takes a minute.

The same seven failed sessions land in the queue. Without RCA, engineering opens each one individually. With RCA on, the system identifies the cluster, attributes the cause to a deploy or config change, and queues the fix for review.

Without Automated RCA

Seven failures. Seven separate investigations.

09:14checkout.smokeTicket
09:14auth.flow.ssoTicket
09:15billing.api.handshakeTicket
09:15search.spec.locale-deTicket
09:16search.spec.locale-enTicket
09:16profile.editTicket
09:17cart.add.line-itemTicket
Result: 7 tickets in the queue. 7 engineers triaging the same root cause. ~3 hours of duplicated work before someone connects the dots.
With Automated RCA

Seven failures. One cluster. One fix queue.

RCA Summary · Cluster #042 · 94% confidence
Cluster 7 specs grouped by failure signature
Pattern TimeoutError on dashboard-greeting
Cause Backend p95 rose 340ms → 9.4s after deploy 7c9a2b1
Proposed fixes
Selenium Bump retry budget to 3 attempts on dashboard-greeting wait
SBOX Increase per_test_timeout_ms to 800000 for affected suite
Infra Add p95 alert on /api/dashboard, review commit 7c9a2b1
Result: 1 RCA summary. 1 fix queued for review. ~15 minutes from failure to assigned owner.

Same failure set in both runs. Only the e34:rcaCluster capability changed.

Activation

Two flags. Per-session debug + cluster RCA.

Per-session debug analysis is on by default for every SBOX session. Cross-session cluster RCA activates with a single capability flag at the build level. Existing Selenium specs do not change.

Add to your capabilities BETA

wdio.conf.js
1capabilities: { 2 'wdio:enforceWebDriverClassic': true, 3 browserName: 'chrome', 4 'e34:per_test_timeout_ms': 600000, 5 'e34:testName': 'Checkout smoke', 6 'e34:buildName': 'release-2026.05.29', 7 'e34:autoHeal': true, 8 'e34:debugAnalyzer': true, // per-session, on by default 9 'e34:rcaCluster': true, // ← THE FLAG for cross-session RCA 10 'e34:token': 'jenkinshardcap12', 11 acceptInsecureCerts: true, 12}
Same capabilities block you already use. Two RCA flags clearly named. The cluster flag activates cross-session RCA at the build level.
Three fix categories

Selenium. SBOX. Infrastructure.

Generic "something failed" messaging is useless to a busy engineer. Automated RCA bins every recommendation into one of three categories so the right person owns the fix without escalation hops.

Category 01

Selenium framework

When the cause is at the framework or test level. Fix stale locators, tune wait strategies, add missing assertion guards.

Example recommendation Test uses implicit wait. Switch to explicit WebDriverWait on dashboard-greeting for predictable behaviour.
Category 02

SBOX configuration

When the cause is in the test infrastructure configuration. Loosen timeout settings, align capability declarations, add missing browser flags.

Example recommendation Per-test timeout (600s) is tight relative to current p95 (8.4s). Increase per_test_timeout_ms to 800000 for this suite.
Category 03

Infrastructure

When the cause is below the test layer. Identify backend latency spikes, fix pod scaling delays, reduce network jitter, resolve dependency outages.

Example recommendation Backend p95 on /api/dashboard rose 340ms → 9.4s after deploy 7c9a2b1. Add alert on backend p95. Review commit 7c9a2b1.
Private inference

Cluster reasoning runs on your model.

Automated RCA's cluster summarization uses your bring-your-own LLM. Per-session debug analysis runs on deterministic heuristics with no model invocation. Failure data and logs stay inside the customer environment.

How Automated RCA handles AI without exposing failure data

Most heals and most diagnostics never invoke a model. Cluster reasoning uses the model the customer already trusts, configured in the SBOX config.

BYO-LLM (cluster tier)

Use your OpenAI, Anthropic, or Azure OpenAI subscription. Or a self-hosted model running inside your environment.

No LLM for the common case

Per-session debug analyzer uses deterministic rules and historical baselines. No model call. No token spend.

Failure data stays inside your perimeter

Session logs, network captures, and stack traces never leave your environment. Element34 has no visibility into your failures.

Automated RCA in context

Studio writes. Auto Heal heals. Automated RCA explains.

The five AI native modules inside SBOX cover the full lifecycle. Automated RCA is the diagnostic layer.

AI authoring

Studio

Plain-English to production-grade Selenium. Authoring entry point.

Explore Studio
Self-healing

Auto Heal

Resilient locators. Heals what would have caused a flake before RCA sees it.

Explore Auto Heal
Quality intelligence

Pulse Report

Site health view. Aggregates RCA findings over time into prioritized opportunities.

Explore Pulse Report
Mobile testing

Real Device Cloud

Real iOS and Android devices. RCA applies the same way as desktop browser sessions.

Explore Real Device Cloud
Automated RCA FAQ

Questions enterprise teams ask about Automated RCA.

What is Automated RCA in Element34 SBOX?
Automated RCA is the diagnostic module inside Element34 SBOX. It works in two tiers: a per-session debug analyzer that surfaces network, console, and command logs for any failed session, and a cross-session cluster RCA that groups related failures, attributes them to a root cause, and proposes a fix.
How is the per-session debug analyzer different from cross-session RCA?
The per-session debug analyzer answers why a single session failed using deterministic rules and historical baselines, with no model invocation. The cross-session cluster RCA answers why a group of sessions are failing together using AI reasoning over the cluster signature and the surrounding context like deploys, config changes, and infrastructure metrics.
What categories of fixes does Automated RCA propose?
Three categories. Selenium framework guidance for stale locators, wait-strategy issues, or missing assertions. SBOX configuration for timeout tuning, capability adjustments, or build-level settings. Infrastructure for backend latency, Kubernetes scaling, network jitter, or dependency outages.
Does Automated RCA send my session logs to a public AI service?
No. Cluster RCA uses bring-your-own LLM. The customer configures Automated RCA to point at their existing OpenAI, Anthropic, or Azure OpenAI subscription, or a self-hosted model. Per-session debug analysis runs on deterministic rules with no model invocation. Failure data and logs stay inside the customer perimeter.
How do I activate Automated RCA?
Per-session debug analysis is on by default for every SBOX session. Cross-session cluster RCA activates with one capability flag at the build level: e34:rcaCluster: true in your WDIO config. Existing Selenium specs do not change.
What does an RCA summary include?
A failure cluster identifier, confidence score, the pattern observed across the cluster, the suggested cause attributed to a deploy or config change, and one or more proposed fixes categorized into Selenium framework, SBOX configuration, or Infrastructure. Every RCA writes to the session record for engineering audit.

Send us your last red build. We send back the RCA.

Share the failed cluster from your most recent red build. We run Automated RCA on your private grid and return the cluster summary, attributed cause, and proposed fixes across Selenium / SBOX / Infrastructure categories. Annual licensing, predictable across the contract term. Talk to sales for a scoped quote.