- Compare two or more campaign concepts head-to-head (for example, a 10% discount vs. free shipping).
- Validate a hypothesis about how a specific mechanic changes customer behavior.
- Establish a clean control group that doesn’t receive the campaign at all, so you can isolate net incremental revenue.
How it works
Running an A/B test by segments takes three steps:- Create an A/B test. Define the participant segment, the number of variants, how traffic is split between them, and the metrics you want to track (conversion rate, average order value, or ARPU).
- Wire each variant into your campaigns. Use a filter in every campaign mechanic so that only customers from the matching test variant receive that version.
- Wait for results. Maestra Platform monitors statistical significance and notifies you when the test reaches a conclusion — either confirming the hypothesis or rejecting it.
Each customer is assigned to exactly one variant for the lifetime of the test. That means a customer in variant A will never accidentally receive variant B, no matter how many campaigns the test feeds.
Step 1. Create the A/B test
Open the A/B tests section and create a new test. You’ll be asked to configure the following fields.Hypothesis
A short statement of what you expect the test to prove or disprove — for example, “Customers who receive a 10% discount email will place 15% more orders than customers who receive a free shipping email.” A clearly written hypothesis keeps the team aligned and makes it easier to interpret the report later.Participants
The segment of customers eligible for the test. Maestra Platform will randomly assign every customer in this segment to one of the variants. A few rules of thumb:- The smaller the segment, the longer the test will take to reach significance.
- The segment should be large enough that each variant ends up with a representative sample.
- Avoid changing the participant segment after the test has launched — it invalidates the random assignment.
Number of variants and traffic distribution
Choose how many variants you want to test and how to split traffic between them. Shares are entered as fractions and automatically recalculated into percentages. A typical setup:| Variant | Share | Purpose |
|---|---|---|
| 0 | 0.25 | Control group — receives nothing |
| 1 | 0.25 | Discount email |
| 2 | 0.25 | Free shipping email |
| 3 | 0.25 | Loyalty points bonus |
Control group
If you want to measure the net incremental effect of a campaign — that is, how much extra revenue it actually generates compared with doing nothing — designate variant 0 as your control group and don’t assign any campaign mechanic to it. The report will then compare every other variant against variant 0 as the baseline.Metrics
Pick a primary metric and, optionally, a secondary metric:- Conversion rate — the share of customers in each variant who completed the target action (a purchase, a registration, etc.).
- Average order value (AOV) — average revenue per order placed by customers in the variant.
- ARPU — average revenue per customer in the variant, regardless of whether they ordered.
Advanced settings
These come pre-filled with sensible defaults, but you can adjust them if you know what you’re doing:- Expected lift — the minimum improvement you want to be able to detect. Smaller expected lifts require larger samples.
- Statistical power — the probability that the test will detect a real effect if one exists. Default is 80%.
- Confidence level — the probability that the detected effect is not due to random chance. Default is 95%.
Step 2. Add each variant to your campaigns
Once the test is saved, each variant becomes a filter you can use anywhere on Maestra Platform. For every campaign mechanic that’s part of the test:- Open the campaign (email, SMS, push, pop-up, promo, loyalty rule, etc.).
- In the Recipients or Audience filter, add the condition: Customer is in A/B test variant X.
- Save and launch the campaign.
Step 3. Monitor and stop the test
While the test is running, Maestra Platform tracks the chosen metrics for each variant and updates the report at least once every 24 hours. The report shows:- The number of customers in each variant.
- The value of each metric per variant.
- The percentage difference vs. the baseline (variant 0 or the lowest-numbered variant if there’s no control).
- A statistical significance indicator that turns green once the result is reliable.
Examples
Compare two offer mechanics
You want to know whether a flat 10% discount or free shipping drives more revenue.- Participants: all customers who opened at least one email in the last 30 days.
- Variants: 0 (control, 25%), 1 (10% discount email, 37.5%), 2 (free shipping email, 37.5%).
- Primary metric: ARPU.
- Hypothesis: free shipping will outperform the 10% discount by at least $1 ARPU.
Validate a new welcome series
You want to know whether your redesigned welcome series actually lifts first-order conversion.- Participants: new subscribers from the last 60 days.
- Variants: 0 (control, 50%, no series), 1 (new welcome series, 50%).
- Primary metric: first-order conversion rate.
Best practices
- Write the hypothesis first. If you can’t state what you expect to learn, you’re not ready to run the test.
- Don’t change the test mid-flight. Adding variants, shifting traffic, or editing the participant segment after launch breaks the math.
- Run one test at a time per audience. Overlapping tests on the same customers contaminate each other’s results.
- Give it enough time. Most retail tests need at least one full purchase cycle (typically 2–4 weeks) before results stabilize.
- Always keep a control group when you care about incremental revenue. Comparing two campaigns tells you which one is better; comparing both against a control tells you whether either one is worth running at all.