What Shopify Rollouts Actually Does
For years, running an A/B test on a Shopify store meant installing a third-party app like Shoplift, Intelligems, or plugging VWO into your theme. Shopify's Winter '26 "RenAIssance" Edition changed that in one quiet line: Rollouts is now a native feature inside every Shopify admin, with no app install and no extra monthly fee.
Rollouts is Shopify's name for scheduled theme releases with built-in traffic splitting. You take your currently published theme, spin up a variant, change whatever you want in the theme editor, and send a percentage of visitors to the new version. When the test ends, you either publish the variant or discard it. If you already read our guide on split testing on Shopify, this is the feature that finally lets you skip a lot of the plumbing.
This guide walks through exactly how to use Shopify's native A/B testing: what it does well, how to run your first test, how to handle statistical significance (Shopify doesn't calculate it for you), what to test first, and when you should still reach for a paid tool like Intelligems or Shoplift instead.
How Native A/B Testing Works in Shopify
Rollouts lives in the admin under Online Store > Themes. Next to your published theme, you'll see a new Rollouts button. Clicking it opens a simple flow: name the rollout, set a traffic percentage for the variant, open the theme editor, and make your changes. Anything you don't touch stays synchronized with the live theme, so a rollout is a fork, not a full clone.
That's the headline mechanic. Underneath, Shopify is doing three useful things for you:
- Traffic splitting at the edge — visitors are deterministically bucketed into control or variant, so a returning shopper sees the same version twice
- Scheduling — you can set a start and end date for each rollout, which is handy for seasonal drops, BFCM hero swaps, or a timed pricing test
- Market targeting — on Advanced and Plus plans, you can scope a rollout to a specific Shopify Market, so a rebrand can go live in the UK without touching your US storefront
According to Shopify's own changelog post, Rollouts is available to all plans for scheduled releases, but the A/B split feature (the "Treatment percentage") requires Advanced or Plus. On Basic, you still get scheduled theme releases — just without the traffic split.
What you can and can't change inside a rollout
Because a rollout is scoped to the theme editor, anything you can drag, click, or type in there is fair game. That includes sections, blocks, text, colors, section order, and settings. What's off-limits:
- Liquid template edits — rollouts don't capture changes to
.liquidfiles - Vintage themes — only Online Store 2.0 themes are supported
- Backend logic — you can't split on cart rules, discount codes, or server-side personalization
Agencies like Conspire have pointed out that this keeps Rollouts firmly in the "visual A/B test" lane — perfect for a new hero, a swapped CTA, or a redesigned product page, but not for testing a different checkout flow or dynamic pricing rule.
Setting Up Your First Rollout: Step by Step

If you want to try it on a throwaway test before running something real, here's the minimum viable flow. Total time is about ten minutes, assuming you already know what you want to change.
- Open Online Store > Themes in your Shopify admin
- Click Rollouts next to your published theme
- Click Create rollout and give it a clear, dated name — e.g.,
2026-04 PDP sticky CTA test - Set the Treatment percentage — 50% is standard for a clean A/B, but you can start at 10–20% if you want a safer canary
- Click Customize to open the theme editor scoped to the rollout
- Make your changes — swap a section, rewrite a headline, reorder blocks
- Set a start and end date — at minimum, plan for two full weeks
- Save and activate
That's it. Shopify immediately starts splitting traffic between your live theme and the rollout variant. Metrics appear in the Rollouts dashboard: sessions, conversion rate, revenue, and orders per variant.
Naming conventions that save your future self
You will forget what Test 2 was about three weeks from now. A disciplined naming convention is the single highest-ROI thing you can do before you run any tests. Use: `[YYYY-MM] [page] [element] [hypothesis]` — for example, 2026-04 PDP sticky-ATC mobile-only. This makes it trivial to filter completed tests, remember which variant won, and document your learnings in a shared spreadsheet.
If you're building a broader experimentation practice, pair this with the advice in our Shopify conversion rate optimization tips guide. Tests without a system to track them compound to zero.
Statistical Significance: The Part Shopify Doesn't Do For You
Here's the single most important limitation of native Rollouts in 2026: Shopify does not calculate statistical significance for you. The dashboard shows conversion rate and revenue per variant, but there's no confidence interval, no p-value, and no "your test is ready to call" flag. That means it's on you to avoid the most common A/B testing mistake — calling a winner too early.
The minimum traffic you actually need
Shopify's own guide to A/B testing recommends at least 1,000 conversions per variant to reach a meaningful result at 95% confidence. For a store with a 2.5% conversion rate, that translates to roughly 40,000 sessions per variant, or 80,000 sessions across the test. Most independent Shopify merchants don't hit that in two weeks on a single page.
Here's a realistic decision matrix based on your daily sessions:
| Daily sessions | Test duration | Can you A/B test? |
|---|---|---|
| Under 500 | 4+ weeks | Only big, bold swings — reserve native tests for 50%+ design changes |
| 500–2,000 | 2–3 weeks | Yes, but focus on high-leverage pages (PDP, homepage) |
| 2,000–10,000 | 2 weeks | Yes — you can realistically ship a winner every sprint |
| 10,000+ | 1–2 weeks | Run 2–3 tests concurrently on different page types |
Agencies including Charle and Neat Digital recommend a hard minimum of 14 days regardless of traffic, to cover two full weekly cycles. Shopper behavior shifts wildly between weekdays and weekends; a seven-day test oversamples whichever side of the week you started on.
How to calculate significance without a plugin
Paste your results into any free calculator — Easy Apps' A/B test calculator or VWO's statistical significance tool work fine. You need four numbers: visitors and conversions for both the control and the variant. Aim for 95% confidence or higher before you declare a winner. Anything below 90% is noise dressed up as a result.
What to Test First on Your Shopify Store

The temptation when you get a new toy is to test button colors. Don't. The highest-leverage first tests on any Shopify store are the ones that touch purchase intent directly — the product detail page, the homepage hero, and the primary CTA.
Product detail page (PDP)
The PDP is where most buying decisions happen and, according to shopper journey data, accounts for the largest share of drop-offs before add-to-cart. High-impact PDP tests:
- Sticky add-to-cart bar on mobile — apparel and accessories stores often see 5–12% ATC lift
- Image gallery layout — grid vs. carousel vs. stacked
- Review placement — above the fold vs. below the buy box
- Bundle/upsell widget — present vs. absent vs. collapsed
- Shipping and returns badge — trust signals near the CTA
Our guide on Shopify product page layouts that convert has the layout patterns that most commonly win these tests.
Homepage hero
Shopify's own data shows that nearly 29% of shoppers move from homepage to collection before purchasing. The homepage isn't where people buy — it's where they decide whether to keep looking. Test:
- Hero image or video — lifestyle shot vs. product-focused vs. typography-led
- Headline copy — benefit-led vs. product-led vs. promotional
- Primary CTA text —
Shop the collectionvs.Start herevs.See the full range - Number of above-the-fold content blocks — one focused CTA vs. multiple entry points
CTA copy and placement
Button copy is the classic A/B test for good reason — it's cheap to change, and small lifts compound across every page view. Replo's A/B test example library has documented cases where swapping Buy now for Add to cart drove a 47% conversion rate increase on a PDP. Don't take that as a law — test it on your own store, because context and audience matter.
Pricing and promotions
If you're on Shopify Plus and comfortable with pricing tests, Rollouts can carry a theme-level pricing display change (strikethrough style, discount badge copy, urgency messaging). For actual variant price tests across SKUs, you still need a dedicated tool like Intelligems.
Native A/B Testing vs. Third-Party Apps: When to Switch

Native Rollouts is a genuinely useful tool, but it's not a replacement for a purpose-built experimentation platform for every merchant. Here's the honest trade-off, drawing on comparisons from VWO's Shopify split testing roundup and the Intelligems vs. Shoplift comparison.
| Need | Native Rollouts | Shoplift / Intelligems | VWO / enterprise |
|---|---|---|---|
| Visual theme A/B tests | Yes | Yes | Yes |
| Statistical significance built in | No | Yes | Yes |
| Segment by new vs. returning | No | Yes | Yes |
| Price, shipping, discount tests | Limited | Yes (Intelligems) | Yes |
| Heatmaps and session recordings | No | No | Yes |
| Multi-variate testing (MVT) | No | Partial | Yes |
| Cost | Included | ~$74/mo+ | $$$$ |
| Liquid/template-level tests | No | Yes | Yes |
Use native Rollouts when
- You're on Advanced or Plus with modest test volume (one to three tests per month)
- Your tests are visual — section swaps, copy changes, hero variants
- You don't want another monthly SaaS bill and you're comfortable running significance math externally
- You want to schedule seasonal theme launches cleanly, even if A/B isn't the point
Switch to Shoplift or Intelligems when
- You need built-in significance calculations and winner detection
- You want segment targeting (first-time vs. returning, mobile vs. desktop, market vs. market)
- You're running pricing, shipping-fee, or discount experiments (Intelligems specifically)
- You need Liquid-level control over what you're testing
Switch to VWO or Convert when
- You need heatmaps, session recordings, and funnel analytics alongside A/B
- You're running a mature experimentation program with 5+ concurrent tests
- You have a CRO analyst or agency interpreting results weekly
For most stores doing under $5M/year, the right path is: start on native Rollouts, graduate to Shoplift or Intelligems when you outgrow it. Don't buy enterprise tooling you won't use. If you're not sure where you are on that curve, a Shopify conversion specialist can review your current setup and recommend the right tier.
Analyzing Results Without Fooling Yourself

A test that looks like a winner on day four is almost never a real winner. The graveyard of ecommerce A/B testing is full of stores that shipped the "winning" variant, watched revenue tank, and couldn't figure out why.
A few rules that will save you from yourself:
- Let every test run at least 14 days even if it looks significant on day six
- Check mobile and desktop separately — a variant that wins overall can lose badly on mobile and drag average order value down
- Look at revenue per visitor, not just conversion rate — a test that lifts CR by 10% but drops AOV by 12% is a net loss
- Segment by traffic source — if paid traffic loves the variant but organic hates it, the "winner" depends entirely on your traffic mix
- Document every test in a shared doc — hypothesis, dates, results, and decision, even for losers
Convert's analysis of the Winter '26 release makes a specific point here: because Rollouts doesn't export raw session data, you have to build your reporting habit manually. Take a weekly screenshot of the dashboard, log the underlying numbers in a spreadsheet, and run your significance calculator yourself. This is the workflow tax of using the native tool.
If you want to go deeper on the measurement side, our conversion optimization category collects the benchmarks and playbooks you should use to judge whether a result is meaningful for your category.
Common Mistakes to Avoid
Every merchant new to A/B testing makes at least three of these in their first month. Knowing them ahead of time saves you a quarter of wasted tests.
- Calling winners early — the single biggest source of bad decisions; wait out the full 14 days minimum
- Testing too many things at once in one variant — if you changed the hero image, the headline, and the CTA, you don't know which one moved the metric
- Ignoring external events — a test running through a flash sale, holiday, or paid media spike is contaminated data; pause or discard it
- Testing trivial changes on low-traffic stores — button color on 300 visitors/day is noise forever; spend that statistical budget on bold changes
- Not documenting the losers — the losing tests teach you more about your audience than the winners, and you'll repeat them in six months if you don't log them
- Running concurrent tests on the same page — two tests on the PDP simultaneously contaminate each other; stagger them
- Skipping a hypothesis — "let's see what happens" is not a hypothesis; write down what you expect and why before you ship
- Forgetting mobile — 70%+ of Shopify traffic is mobile; a desktop-only win isn't a win
The teams that compound gains year over year treat experimentation as a habit, not a feature. Two tests a month, documented and shipped with discipline, will beat ten frantic tests per quarter with no process.
Rollouts on Different Shopify Plans
One thing worth being explicit about: the full A/B feature set is gated by plan tier. Here's what you actually get at each level, based on the Shopify Help Center documentation:
- Basic and Shopify plans — scheduled theme releases only. You can stage a new homepage for a Black Friday launch, but you cannot split traffic.
- Advanced — full A/B testing, treatment percentage control, and market targeting. This is the sweet spot for most merchants ready to test.
- Plus — everything on Advanced, with higher concurrency limits, more granular market segmentation, and priority support for experimentation use cases.
If you're on Basic and can't justify a plan upgrade just for Rollouts, Shoplift on Shopify's App Store starts around $74/month and gives you full A/B plus significance calculations. Depending on your volume, that may be cheaper than moving to Advanced. For a broader look at optimization apps, browse our theme design resources.
Putting It All Together
Native A/B testing is the single biggest conversion-optimization upgrade Shopify has shipped in years. It removes the friction of installing another app, eliminates a monthly subscription, and makes experimentation a first-class citizen in the admin. For most stores under $5M/year, it's enough — assuming you respect the trade-offs: no built-in significance, no segmentation, no pricing tests.
Start with one test on your highest-traffic page (usually the PDP or homepage). Set a 50/50 split, run it for 14 days, and resist the urge to peek at day five. Calculate significance externally with a free tool, ship the winner, and document everything. Do that twice a month and you'll outperform 80% of Shopify stores, who either test nothing or test everything without process.
When you outgrow it — when you need segmentation, pricing tests, or heatmaps — graduate to Shoplift, Intelligems, or VWO. But don't skip ahead. The discipline you build running native tests is the same discipline that makes paid tools worth their price later.
Want to compare approaches or get feedback on a test you're planning? Join the Talk Shop community and share what you're working on. More practical walk-throughs are on our blog.
What's the first test you're going to run on Rollouts? Drop it in the community and we'll help you tighten the hypothesis before you ship.

About Talk Shop
The Talk Shop team — insights from our community of Shopify developers, merchants, and experts.
