What does Automated RCA do?

Automated RCA clusters failed test sessions by shared cause, attributes the root cause in plain English, and proposes a specific fix.

How does Automated RCA cluster failures?

RCA analyzes failure signatures across stack traces, error messages, DOM snapshots, network behavior, and timing patterns.

Does Automated RCA use AI?

Yes. RCA uses the customer's chosen LLM to translate technical failure signatures into plain-English explanations and to propose code-level fixes.

Which failure types does RCA cover?

Locator failures, timing failures, network failures, environment failures, and assertion failures.

How accurate is the proposed fix?

Each proposed fix includes a confidence score from the LLM and a recommended code patch. Engineers review and apply the fix manually.

Automated RCA · Root cause AI

Stop chasing flakes. Find the cause.

Automated RCA clusters failed sessions, summarizes the root cause, and proposes the fix. The per-session debug analyzer surfaces network, console, and command logs for any single failure. The cross-session RCA spots the pattern when the same flake hits seven specs at once.

Book a demo Get a quote

Per-session debug analyzer
Cross-session cluster RCA

Triage time

Morning → 15 min

Activation

1 flag

Seven sessions failed last night. They failed for the same reason.

Every long-lived test suite carries the same hidden cost. When a deploy ships a regression, the same root cause fails dozens of sessions at once. Engineering opens seven debug tickets, reads seven traces, finds one cause. Automated RCA collapses that into one summary, attached to the pattern, with a proposed fix.

Pain pattern 01

Deploy-day regression

Symptom. A backend service deploys, p95 latency climbs from 340ms to 9.4s, and seven E2E specs time out on dashboard load. Each session shows TimeoutError. Engineering reads each trace individually.

Automated RCA response Clusters the seven failures. Surfaces the p95 spike and the offending commit hash. Proposes a retry-budget change plus a wait extension on dashboard-load.

Pain pattern 02

Infrastructure starvation

Symptom. Kubernetes pods take 30 minutes to scale. Queue wait time grows. Tests fail with start-time exhaustion. The session log just shows the timeout, not the cause.

Automated RCA response Categorizes the recommendation as Infrastructure. Suggests a scaling strategy for your private grid. The engineer escalates to the platform team with a specific ask, not a vague complaint.

Pain pattern 03

Slow-command drift

Symptom. A command that normally takes 200ms starts taking 700ms after a feature flag flip. Tests don't fail, they get slower. Nobody notices until p95 SLOs break.

Automated RCA response Per-session debug analyzer flags slow commands against the historical baseline (over 500ms is the default threshold). Surfaces the flagged commands before they cause failures.

How it diagnoses

Two tiers. Per-session debug. Cross-session cluster.

Automated RCA is two capabilities under one name. The per-session debug analyzer answers 'why did this single session fail'. The cross-session cluster RCA answers 'why are these seven sessions failing together'. Engineers get both, automatically, on every failed run.

Tier 1 · default

Per-session debug analyzer

Surfaces network logs, console logs, and command logs for the failed session. Flags slow commands against historical baseline. Categorizes recommended fixes into three buckets.

            Slow cmd flagged:

            driver.findElement('#dashboard-greeting')

            Took: 7.2s (baseline: 340ms)

            Category: Infrastructure

            Recommendation: Investigate /api/dashboard p95

Deterministic. No model. On by default.

Tier 2 · activate per build

Cross-session cluster RCA

Clusters failed sessions across the build by signature. Identifies common root cause across the cluster. Attributes the cluster to a specific commit or change. Proposes a concrete fix.

            Cluster: 7 specs · confidence 94%

            Pattern: TimeoutError on dashboard-greeting

            Cause: Backend p95 340ms → 9.4s

            After deploy: 7c9a2b1

            Fixes: Selenium · SBOX · Infrastructure

Runs on your model. BYO-LLM.

Same failure set. Two response modes.

Manual triage takes a morning. Automated RCA takes a minute.

The same seven failed sessions land in the queue. Without RCA, engineering opens each one individually. With RCA on, the system identifies the cluster, attributes the cause to a deploy or config change, and queues the fix for review.

✕ Without Automated RCA

Seven failures. Seven separate investigations.

14checkout.smokeTicket
14auth.flow.ssoTicket
15billing.api.handshakeTicket
15search.spec.locale-deTicket
16search.spec.locale-enTicket
16profile.editTicket
17cart.add.line-itemTicket

Result: 7 tickets in the queue. 7 engineers triaging the same root cause. ~3 hours of duplicated work before someone connects the dots.

✓ With Automated RCA

Seven failures. One cluster. One fix queue.

RCA Summary · Cluster #042 · 94% confidence

Cluster 7 specs grouped by failure signature

Pattern TimeoutError on dashboard-greeting

Cause Backend p95 rose 340ms → 9.4s after deploy 7c9a2b1

Proposed fixes

Selenium Bump retry budget to 3 attempts on dashboard-greeting wait

SBOX Increase per_test_timeout_ms to 800000 for affected suite

Infra Add p95 alert on /api/dashboard, review commit 7c9a2b1

Result: 1 RCA summary. 1 fix queued for review. ~15 minutes from failure to assigned owner.

Same failure set in both runs. Only the e34:rcaCluster capability changed.

Activation

Two flags. Per-session debug + cluster RCA.

Per-session debug analysis is on by default for every SBOX session. Cross-session cluster RCA activates with a single capability flag at the build level. Existing Selenium specs do not change.

Add to your capabilities BETA

wdio.conf.js

1capabilities: {
'wdio:enforceWebDriverClassic': true,
browserName: 'chrome',
'e34:per_test_timeout_ms': 600000,
'e34:testName': 'Checkout smoke',
'e34:buildName': 'release-2026.05.29',
'e34:autoHeal': true,
'e34:debugAnalyzer': true,        // per-session, on by default
'e34:rcaCluster': true,           // ← THE FLAG for cross-session RCA
'e34:token': 'jenkinshardcap12',
acceptInsecureCerts: true,
12}
      

✓ Same capabilities block you already use. Two RCA flags clearly named. The cluster flag activates cross-session RCA at the build level.

Three fix categories

Selenium. SBOX. Infrastructure.

Generic "something failed" messaging is useless to a busy engineer. Automated RCA bins every recommendation into one of three categories so the right person owns the fix without escalation hops.

Category 01

Selenium framework

When the cause is at the framework or test level. Fix stale locators, tune wait strategies, add missing assertion guards.

Example recommendation Test uses implicit wait. Switch to explicit WebDriverWait on dashboard-greeting for predictable behaviour.

Category 02

SBOX configuration

When the cause is in the test infrastructure configuration. Loosen timeout settings, align capability declarations, add missing browser flags.

Example recommendation Per-test timeout (600s) is tight relative to current p95 (8.4s). Increase per_test_timeout_ms to 800000 for this suite.

Category 03

Infrastructure

When the cause is below the test layer. Identify backend latency spikes, fix pod scaling delays, reduce network jitter, resolve dependency outages.

Example recommendation Backend p95 on /api/dashboard rose 340ms → 9.4s after deploy 7c9a2b1. Add alert on backend p95. Review commit 7c9a2b1.

Private inference

Cluster reasoning runs on your model.

Automated RCA's cluster summarization uses your bring-your-own LLM. Per-session debug analysis runs on deterministic heuristics with no model invocation. Failure data and logs stay inside the customer environment.

How Automated RCA handles AI without exposing failure data

Most heals and most diagnostics never invoke a model. Cluster reasoning uses the model the customer already trusts, configured in the SBOX config.

BYO-LLM (cluster tier)

Use your OpenAI, Anthropic, or Azure OpenAI subscription. Or a self-hosted model running inside your environment.

No LLM for the common case

Per-session debug analyzer uses deterministic rules and historical baselines. No model call. No token spend.

Failure data stays inside your perimeter

Session logs, network captures, and stack traces never leave your environment. Element34 has no visibility into your failures.

Automated RCA in context

Studio writes. Auto Heal heals. Automated RCA explains.

The five AI native modules inside SBOX cover the full lifecycle. Automated RCA is the diagnostic layer.

AI authoring

Studio

Plain-English to production-grade Selenium. Authoring entry point.

Explore Studio →

Self-healing

Auto Heal

Resilient locators. Heals what would have caused a flake before RCA sees it.

Explore Auto Heal →

Quality intelligence

Pulse Report

Site health view. Aggregates RCA findings over time into prioritized opportunities.

Explore Pulse Report →

Mobile testing

Real Device Cloud

Real iOS and Android devices. RCA applies the same way as desktop browser sessions.

Explore Real Device Cloud →

Automated RCA FAQ

Questions enterprise teams ask about Automated RCA.

What is Automated RCA in Element34 SBOX?

Automated RCA is the diagnostic module inside Element34 SBOX. It works in two tiers: a per-session debug analyzer that surfaces network, console, and command logs for any failed session, and a cross-session cluster RCA that groups related failures, attributes them to a root cause, and proposes a fix.

How is the per-session debug analyzer different from cross-session RCA?

The per-session debug analyzer answers why a single session failed using deterministic rules and historical baselines, with no model invocation. The cross-session cluster RCA answers why a group of sessions are failing together using AI reasoning over the cluster signature and the surrounding context like deploys, config changes, and infrastructure metrics.

What categories of fixes does Automated RCA propose?

Three categories. Selenium framework guidance for stale locators, wait-strategy issues, or missing assertions. SBOX configuration for timeout tuning, capability adjustments, or build-level settings. Infrastructure for backend latency, Kubernetes scaling, network jitter, or dependency outages.

Does Automated RCA send my session logs to a public AI service?

No. Cluster RCA uses bring-your-own LLM. The customer configures Automated RCA to point at their existing OpenAI, Anthropic, or Azure OpenAI subscription, or a self-hosted model. Per-session debug analysis runs on deterministic rules with no model invocation. Failure data and logs stay inside the customer perimeter.

How do I activate Automated RCA?

Per-session debug analysis is on by default for every SBOX session. Cross-session cluster RCA activates with one capability flag at the build level: e34:rcaCluster: true in your WDIO config. Existing Selenium specs do not change.

What does an RCA summary include?

A failure cluster identifier, confidence score, the pattern observed across the cluster, the suggested cause attributed to a deploy or config change, and one or more proposed fixes categorized into Selenium framework, SBOX configuration, or Infrastructure. Every RCA writes to the session record for engineering audit.

Send us your last red build. We send back the RCA.

Share the failed cluster from your most recent red build. We run Automated RCA on your private grid and return the cluster summary, attributed cause, and proposed fixes across Selenium / SBOX / Infrastructure categories. Annual licensing, predictable across the contract term. Talk to sales for a scoped quote.

Book a demo Get a quote