← Reference Deep-Dives Reference Deep-Dive · 7 min read

How to Test New Flavors and Product Variants with Consumers

By Kevin

Testing a new flavor or product variant is not a taste test — it is a portfolio strategy decision that requires understanding whether the variant will bring new consumers into the brand, expand purchase occasions for existing consumers, or simply redistribute existing volume across more SKUs. The last outcome is the most common and the most expensive, yet most CPG variant testing is not designed to detect it.

The economics are unforgiving. Launching a new variant in CPG costs $5-15M when you factor in production setup, packaging, slotting fees, trade promotion, and marketing support. A variant that cannibalizes existing sales rather than generating incremental volume is not just a failed product — it is an active drain on portfolio profitability. The research question is not “do consumers like this?” but “will this variant make the portfolio stronger?”

The Variant Proliferation Trap

CPG companies launch variants because they are one of the fastest paths to visible growth. A new flavor generates trade excitement, earns incremental shelf space (temporarily), and produces a sales spike from trial. The problem is that most of this activity is not incremental — it is redistributive.

The data is sobering. Analysis by Catalina and the Food Marketing Institute found that the average CPG brand could eliminate 30-40% of its SKUs with minimal impact on total revenue. The variants that would be cut are not failing — they are cannibalizing each other, splitting volume across more SKUs without growing the category or the brand’s total consumption.

The trap operates through a predictable cycle. Brand team launches a variant. Initial sales look promising because they include trial volume from existing brand buyers. Six months later, the original SKUs have lost volume roughly equal to the new variant’s steady-state sales. Total brand volume is flat, but the brand now carries more SKUs (higher supply chain cost), has weaker velocity per SKU (weaker shelf position), and has used trade budget on a launch that produced no incremental growth.

Breaking this cycle requires variant testing that explicitly measures incrementality — not just preference. For CPG brands, this means asking different questions and using different research designs than the standard concept test.

Monadic vs. Sequential Testing for Variants

The choice between monadic and sequential testing design has significant implications for variant research, and most teams default to the wrong one.

Monadic testing assigns each consumer to evaluate a single variant. Its advantage is ecological validity — consumers do not compare four flavors side by side at the grocery store. They encounter one product and decide whether it is worth trying. Monadic testing produces uncontaminated assessments of absolute appeal and purchase intent. Its limitation is that it requires larger samples (one cell per variant) and cannot directly compare variants within the same consumer.

Sequential monadic testing has each consumer evaluate multiple variants in a randomized order. Its advantage is efficiency — fewer total participants are needed — and it produces direct comparative data. Its limitation is that order effects (the first variant evaluated gets different treatment than the fourth) and contrast effects (a good variant makes an average one look worse) distort absolute assessment.

For CPG variant decisions, the recommendation is to use monadic design for the critical go/no-go question (“does this variant meet our launch threshold?”) and sequential design for the portfolio question (“how do these variants rank against each other and against the existing line?”). AI-moderated platforms can run both designs efficiently — the concept testing methodology adapts the conversation flow based on whether the participant is in a monadic or sequential cell.

The more important design decision is sample composition. Variant testing must include three distinct consumer segments: current brand loyalists (will they switch from existing variants?), competitive brand buyers (will this variant attract them?), and light or lapsed category buyers (will this variant create new purchase occasions?). The ratio of each segment in the results determines whether the variant is cannibalizing, converting, or expanding.

Sensory Language Analysis

When consumers describe flavors and product experiences, their word choice reveals more than their preference rankings. Sensory language analysis — systematically examining the vocabulary consumers use to describe variants — uncovers the emotional and functional associations that predict market performance.

Consider the difference between a consumer who describes a new snack flavor as “interesting” versus one who says “comforting.” Both might rate it 4 out of 5 on a liking scale, but their language signals different relationships with the product. “Interesting” predicts trial but not repeat — the consumer is curious but not attached. “Comforting” predicts repeat purchase and potential loyalty — the consumer has formed an emotional connection.

Other language patterns carry diagnostic value. “Clean” and “fresh” signal health associations that expand the usage occasion set. “Heavy” and “rich” signal indulgence associations that limit consumption to specific moments. “Familiar” suggests the variant is too close to existing options (cannibalization risk). “Different” needs context — different-good drives trial; different-weird kills it.

AI-moderated interviews are uniquely suited to sensory language analysis because they generate extensive verbatim at scale. When 200 consumers describe a new variant in their own words, language patterns become statistically observable. You can see that 65% of consumers use warmth-associated language (“cozy,” “comforting,” “homey”) for Variant A but only 20% use it for Variant B, which instead triggers freshness language (“bright,” “light,” “crisp”). These patterns map directly to usage occasion and repeat potential.

The consumer insights for CPG guide discusses how language mining from AI-moderated interviews feeds into positioning and packaging development.

Cannibalization Research

Cannibalization is the variant decision’s critical variable, and it is the one most poorly measured by traditional methods. Standard concept tests ask “would you buy this?” without asking “what would you buy instead?” This omission makes every new variant look incremental because the substitution effect is invisible.

Effective cannibalization research makes substitution explicit. After establishing interest in the new variant, the conversation explores: “If you started buying this, what would you buy less of?” “Would this replace something you currently buy, or would it be in addition to what you already buy?” “Think about the last time you bought in this category — would this variant have changed what you picked up?”

The responses fall into four categories, each with different strategic implications.

Competitive displacement: The consumer would switch from a competitor’s product to the new variant. This is the best outcome — genuine share gain. If 40%+ of interested consumers would displace a competitor, the variant is strongly incremental.

Category expansion: The consumer would buy the variant in addition to their current purchases, on occasions when they currently do not buy from the category at all. This is the second-best outcome — growing total category consumption. New flavors that unlock new usage occasions (e.g., a savory snack brand launching a sweet variant that captures afternoon snacking from a different category) fall here.

Portfolio cannibalization: The consumer would substitute the new variant for an existing SKU in the same brand. This is the trap. Unless the new variant has a higher margin than the cannibalized SKU, the net effect is negative.

Pantry loading without consumption increase: The consumer would buy the new variant and keep buying existing products, but total consumption would not increase. They simply stockpile. This creates a temporary sales spike that disappears after the initial pantry-fill period.

Deep AI-moderated interviews can distinguish these scenarios because they probe the specific decision logic, not just the stated preference. A consumer saying “I’d probably try this instead of my usual” needs further exploration: Instead of your usual what — your usual product in this brand, or your usual product in a different brand? Understanding the substitution target changes the strategic calculus entirely.

From Test to Shelf: Variant Launch Criteria

The final stage of variant research is establishing clear launch criteria that prevent enthusiasm from overriding evidence. Effective launch gates include the following.

Incrementality threshold: At least 35-40% of interested consumers should indicate competitive displacement or category expansion as the source of their volume. Below this threshold, the variant is primarily redistributive.

Sensory language profile: The variant should generate a distinct language profile from existing SKUs. If consumers describe it using the same vocabulary as the current lead variant, it occupies the same mental territory and will cannibalize. Distinct language indicates distinct positioning.

Segment appeal distribution: The variant should attract a different consumer profile than the existing line. If it appeals primarily to the same heavy brand buyers, it is splitting loyalty rather than building breadth.

Decision simplicity: Adding the variant should not create choice paralysis at shelf. If research reveals that consumers in the category already struggle to choose among existing options, adding another option degrades the entire shopping experience. Sometimes product innovation means fewer, better variants rather than more.

Repeat intent language: Consumers should describe the variant in terms that predict repeat purchase (comfort, satisfaction, craving) rather than only trial interest (curiosity, novelty, surprise). A variant with high trial intent but low repeat indicators will produce a launch spike followed by rapid decline.

These criteria should be established before testing begins, not after results are in. Post-hoc rationalization — adjusting the bar to match the data — is the most common way variant launches that should be killed survive to market. The research exists to make portfolio decisions better, but only if the decision framework is set before the evidence arrives.

Frequently Asked Questions

Monadic testing (each consumer evaluates one variant) is better for absolute assessment — does this flavor meet the threshold for launch? Sequential testing (consumers evaluate multiple variants) is better for relative ranking but introduces order and contrast effects. For CPG variants, use monadic design for go/no-go decisions and sequential design for portfolio prioritization. AI-moderated interviews can run both designs efficiently at scale.
Ask consumers who express interest in the new variant which existing product they would replace or reduce purchasing. If 70%+ would substitute an existing SKU in your portfolio rather than a competitor's product or a net-new purchase occasion, the variant is cannibalizing rather than growing. Deep interviews reveal the specific substitution logic that surveys cannot capture.
Test 4-6 concepts initially, narrow to 2-3 for refinement testing, and validate 1-2 for launch. Each stage requires 60-100 consumer conversations with verified category purchasers. The total investment of 200-300 conversations across stages is a fraction of the cost of launching a variant that fails — which averages $5-10M in wasted production, distribution, and marketing spend for mid-size CPG brands.
Sensory language analysis examines the words consumers use to describe flavors, textures, and experiences — words like 'bright,' 'heavy,' 'clean,' or 'comforting.' These descriptors reveal emotional and functional associations that drive repeat purchase. A variant described as 'interesting' performs differently than one described as 'comforting' — the first drives trial, the second drives loyalty.
When it primarily attracts existing brand buyers rather than new consumers, when sensory language suggests trial interest but not repeat potential, when it creates decision complexity that slows the entire shelf set, or when it requires a price point that undermines the brand's value positioning. The variant launch decision is a portfolio decision, not just a product decision.
Get Started

Put This Research Into Action

Run your first 3 AI-moderated customer interviews free — no credit card, no sales call.

Self-serve

3 interviews free. No credit card required.

Enterprise

See a real study built live in 30 minutes.

No contract · No retainers · Results in 72 hours