Guide

Sill vs SplitSignal: Two Bayesian Approaches to Different Problems

SplitSignal and Sill both use Bayesian statistical methods, but they apply them to fundamentally different problems. SplitSignal, built inside the Semrush ecosystem, uses Google's CausalImpact methodology to forecast what organic traffic would have been without an on-site change. Sill models the probability of AI citation shifts across six platforms after a content intervention. SplitSignal has no AI or GEO testing capability. Semrush's separate AI Search monitoring tab ($99/mo add-on) tracks AI mentions but is not connected to SplitSignal's experimentation engine. This comparison breaks down the methodology, requirements, and fit for each.

A construction crane rising above a cityscape, a visual metaphor for building experimentation infrastructure on top of the existing monitoring layer.

What Each Tool Does

SplitSignal tests SEO changes on organic traffic using CausalImpact inside Semrush; Sill tests GEO changes on AI citations across six platforms.

SplitSignal is Semrush's SEO A/B testing product. It uses Google's CausalImpact methodology: a Bayesian structural time-series model that uses 100 days of historical click data to construct a synthetic control group and forecast expected performance. This is a well-validated statistical approach published by Google Research and widely used in econometrics. The implementation uses client-side JavaScript to apply changes, which is simpler to set up than server-side approaches but less robust for tests that depend on how crawlers render pages. Minimum requirements: 300+ pages and 100,000+ clicks over the prior 100 days. Pricing is custom and enterprise-tier, separate from standard Semrush subscriptions.

Sill is a GEO experimentation platform that measures how content changes affect AI citations across ChatGPT, Perplexity, Gemini, Google AI Overviews, Claude, and Grok. Instead of forecasting organic traffic, Sill models citation probability shifts using a Bayesian estimation approach with statistical controls. Prompts affected by a content change serve as the treatment group; unaffected prompts serve as controls. The system fits models independently per AI platform and uses calibration to establish empirical false positive rates. Minimum requirement: 25 prompts. Pricing: Free (5 prompts), $90/mo Basic (120 prompts), $225/mo Pro (360 prompts), Enterprise (custom). All plans include all 6 AI platforms.

The core distinction: SplitSignal answers "did this page change affect organic traffic?" Sill answers "following this content change, did AI platforms mention our brand more or less frequently?" These are different questions directed at different channels.

Two Bayesian Methods, Different Problems

SplitSignal forecasts organic traffic on page groups using CausalImpact time-series; Sill models citation probability across AI platforms with statistical controls.

Both tools apply Bayesian inference, but the structure of the problems they solve is different. SplitSignal uses CausalImpact to build a time-series forecast: given 100 days of historical click data, what would traffic on these pages have been without the intervention? The synthetic control is constructed from the behavior of untreated page groups during the same period. This works well when you have large, stable traffic volumes on groups of structurally similar pages.

Sill faces a different statistical challenge. AI outputs change 63% of the time between consecutive days, and 91.6% of cited URLs appear on only one platform. The unit of measurement is not page traffic but entity citation: whether and how prominently a brand is mentioned in an AI response. Sill models this per platform rather than averaging across them, because a change that moves ChatGPT may leave Gemini unchanged. Built-in calibration establishes the empirical false positive rate so confidence levels have a measured basis.

SplitSignal has no AI or GEO visibility testing capability. Semrush has added an "AI Search" monitoring tab ($99/mo add-on) that tracks brand mentions across AI engines, but this is observation-only and is not connected to SplitSignal's experimentation functionality. The two products exist in parallel without integration. A team using both would need to manually correlate SplitSignal test results with Semrush AI Search observations, reintroducing the before-and-after problem that experimentation is designed to solve.

Feature and Methodology Comparison

SplitSignal requires 100K+ clicks and 300+ pages for SEO testing; Sill requires 25+ prompts for GEO testing across 6 AI platforms.

Dimension	SplitSignal	Sill
Target	Organic traffic on page groups	AI citations across 6 platforms
Bayesian approach	CausalImpact: Bayesian structural time-series forecast	Bayesian estimation with statistical controls
Implementation	Client-side JavaScript on your site	No site changes required
Minimum requirements	100K+ clicks over 100 days; 300+ pages	25+ prompts
AI platforms	None (no AI/GEO capability)	ChatGPT, Perplexity, Gemini, AI Overviews, Claude, Grok
Ecosystem	Semrush (keyword, competitive, backlink data)	Standalone; GEO monitoring, recommendations, Brand Watchdog
Pricing	Custom enterprise (separate from Semrush subscription)	Free (5 prompts) / $90/mo / $225/mo / Enterprise
AI monitoring add-on	Semrush AI Search ($99/mo); not connected to SplitSignal	Included in all plans; integrated with experimentation

When to Use Each

SplitSignal fits large-traffic SEO testing inside Semrush; Sill fits direct AI visibility experimentation for brands of any size.

Use SplitSignal if you are already invested in the Semrush ecosystem and need to test on-site SEO changes on a site with 100,000+ clicks and 300+ template pages. SplitSignal's strength is that it layers causal SEO testing into an existing workflow without adopting a separate platform. The CausalImpact methodology is well-validated and the integration with Semrush's keyword and competitive data provides context that standalone tools lack.

Use Sill if you need to understand whether content changes are affecting how AI platforms mention and recommend your brand. Sill's minimum of 25 prompts (compared to SplitSignal's 100K+ clicks) makes it accessible to brands that lack the traffic volume for traditional SEO split testing. The per-platform models surface divergence: a change that lifts ChatGPT citations may leave Gemini unchanged, and Sill reports that difference rather than averaging it away.

These tools are not substitutes. SplitSignal tests SEO; Sill tests GEO. A team running SplitSignal to measure the organic traffic impact of title tag changes could run Sill in parallel to measure whether the same changes affected AI citations. The results address different layers of the discovery funnel.

For a full comparison across all testing and monitoring tools in the space, see our guide to GEO testing and experimentation tools in 2026.

Test AI Visibility Directly

Sill measures AI citations across six platforms with statistical controls and per-platform models. Free tier includes monitoring, GEO recommendations, and Brand Watchdog. Experimentation starts at $90/mo.

See Pricing Talk to Us

References

Semrush. "SplitSignal: SEO A/B Testing." 2026. semrush.com
Google. "CausalImpact: Inferring Causal Effects Using Bayesian Structural Time-Series Models." google.github.io
Sill internal data. "Daily AI Monitoring: Volatility and Platform Divergence Across 139 Brands." 2026.
Sill. "Best GEO Testing and Experimentation Tools in 2026." 2026. trysill.com

Get a Demo

Tell us about your brand and we'll be in touch to walk you through Sill.