Braze
Three silent send failures on Braze.
Braze Canvases are flexible, and the flexibility creates a long tail of silent failure modes. Three silent stops the community reports most often on Braze.
The detection gap
What changes when a flow is instrumented.
A silent send failure follows the same six steps on every ESP. The platform handles the first two well. The next four are where the lag lives, and where the wallet card flips the timeline.
Step
Your ESP today
With Telltide instrumented
1. Trigger fires
Logged
Identical, the ESP handles this
2. Send leaves platform
Counted
Identical, the ESP handles this
3. Send fails silently
No surface
Wallet card flips amber
4. You are alerted
Not until a customer complains
Lock-screen banner within minutes
5. Triage starts
Hours later, on the next dashboard scan
From the wallet card, on the same screen
6. Fix lands
After revenue impact is visible
Before revenue impact is visible
The lag is not a tooling absence. It is the difference between a per-event log and a status surface a human checks without thinking. The wallet card is that surface.
Pattern 1 of 3
Canvas control group expands to consume the active audience
What it looks like
A Canvas has a holdout group set to a small percentage. A filter or audience-segment change widens the holdout above its intended size, so a meaningful slice of the audience is enrolled into the control rather than the treatment. The treatment send numbers drop without a Canvas-level error.
Why detection lags
Braze reports the Canvas as running normally. The holdout is a design choice, not a fault, so there is no alarm. The drop only shows up when revenue attribution comes in below plan a few weeks later.
The holdout is a design choice, not a fault, so there is no alarm.
Telltide alert email, illustrative
Subject: Monitor missed: Cart abandon
MISSED WINDOW
No email received: Cart abandon
The Same-day recovery window closed without a matching email.
Investigate monitor →
What to instrument
Seed addresses outside the holdout segment receive a working Canvas. If the seed stops landing while the Canvas still reports as Live and Enrolled, the holdout has expanded.
Pattern 2 of 3
Liquid templating error in subject line drops the message silently
What it looks like
A subject line or block uses a Liquid expression that references a custom attribute that has been renamed or dropped upstream. Braze evaluates the Liquid per recipient at send time, so the expression errors for the slice of the audience whose attribute is missing and those recipients silently do not receive the message.
Why detection lags
Per-recipient Liquid failures are logged in the message activity log, one error per affected user, with no Canvas-level rollup. The Canvas continues to report Live and the slice of the audience whose attributes are populated still receives the send, so the dashboard reads normal while a meaningful cohort is dropped.
Per-recipient Liquid failures stay in the per-user log, with no Canvas-level rollup.
What to instrument
A seed address with the attribute populated lands the message. A seed without it does not. Diverging seed outcomes across attribute states catch templating drift before it affects the live audience.
Pattern 3 of 3
Multivariate variant sends to zero after an audience criteria change
What it looks like
A multivariate test variant has a custom audience filter. An upstream property change makes the filter match nobody. The variant reports zero sends. The other variants absorb the audience and the test reports as running.
Why detection lags
Braze accepts zero-audience variants as a design choice. There is no warning that one arm is silently inactive. The test reads as healthy at the aggregate level.
Braze accepts zero-audience variants as a design choice.
What to instrument
A seed address that matches the variant filter validates the variant. If the seed lands one variant only, the others are silently dead.
Self-audit
Ten ticks. Each one you cannot mark is exposure.
Run this against your own programme. If you finish with three or more unticked, the case for instrumenting one flow this week is the rest of this document.
-
Trigger metrics for your top three flows are reviewed monthly for rename and event-shape drift.
-
Send-skip reasons (send-time optimisation, frequency caps, send classifications, suppression hits) are aggregated to a per-flow report you maintain, since the platform does not roll them up.
-
Webhooks the flow depends on, inbound triggers and outbound callouts both, are pinged on a separate health check that fires independently of the flow.
-
Catalogue or product feed fields used in flow filters have a daily canary record that matches the filter clause and fails the build if the match returns zero.
-
Audience filters on each live flow alert if the matching profile count drops by more than a configured share week-over-week.
-
A pre-send validator catches broken template syntax and unresolved merge tags on every send, not only the compose-time linter.
-
Suppression lists driven by upstream integrations are monitored for runaway growth above a daily baseline.
-
Subscription type, consent, or preference centre changes are versioned and diffed against the previous published state before publish, so silent removals are caught.
-
Lists, segments, or upstream data feeds that power live flows have a freshness SLA and an alert if the SLA breaks.
-
Every flow critical to revenue has a seed address subscribed to it that lands within the flow's normal arrival window.
Detection Guarantee
The Detection Guarantee covers your first 90 days on Starter. If we miss a silent send failure on a monitored flow inside that window, we credit up to three months of the affected tier. One page of terms, plain English, no carve-outs.
2:12
Detail
Welcome 1 missed window
Updated
18 May 2026, 2:12 pm
Your Telltide Card. Green when every monitor is healthy. Flips amber and pushes to your lock screen the moment a flow stops sending.
Start free, instrument one flow. The wallet card unlocks on Starter, with a fourteen-day free trial when you add it. Roughly ten minutes end to end.