← Reference Deep-Dives Reference Deep-Dive · Updated · 11 min read

Market Sizing for a New CPG Category: Research-Backed Approaches

By Kevin, Founder & CEO

Sizing a genuinely new CPG category is one of the hardest analytical challenges in consumer goods. When you are creating something that does not yet have a scanner data footprint, a Nielsen category code, or a dedicated shelf set, the standard top-down sizing approaches produce unreliable estimates because they extrapolate from categories that may share little with your actual opportunity. The cost of getting this wrong runs into the hundreds of millions: oversizing leads to over-investment in manufacturing, distribution, and marketing that the demand cannot absorb; undersizing leaves capital on the table and lets a faster mover capture the category. The solution is demand-side sizing — building estimates from the consumer up rather than the category down — anchored in a structured market intelligence program. This guide covers the four-input framework, the research design that populates each input, and how to translate findings into a defensible TAM, SAM, and SOM model that investors, retail partners, and internal stakeholders can act on.

Why does supply-side sizing fail for new categories?


Traditional market sizing for CPG starts with syndicated data. Identify the category, pull annual sales from Nielsen or IRI, apply growth rates, and project forward. This works well for established categories where the market is well-defined and historical data is abundant — the model is essentially calibrating to known reality.

For genuinely new categories, the approach fails at the first step: there is no category to measure. The standard workaround — sizing by analogy to an existing category — introduces assumptions that can be wildly inaccurate. When early kombucha brands estimated their market based on the juice category, they dramatically overestimated near-term demand because juice consumption was a poor proxy for the functional-beverage occasion kombucha actually served. When early protein bar manufacturers sized against the candy bar category, they captured volume potential but missed the entirely different price architecture their category would support, leading to under-investment in premium tiers that turned out to anchor profitability.

Supply-side sizing also suffers from a structural blind spot: it measures what consumers can buy, not what they want to buy. The existence of shelf space does not imply demand, and the absence of shelf space does not imply its absence. Many genuinely large categories were invisible to supply-side analysis right up until the moment they appeared on shelf — plant-based dairy, hard seltzer, ready-to-drink cocktails, functional mushroom beverages, dental aligners. The signal was in consumer behavior and frustration long before it showed up in scanner data.

Consumer research addresses this directly by measuring demand independent of current supply. The output is a TAM grounded in demonstrated need, articulated willingness to pay, and identified source of volume — not in adjacency assumptions that may or may not hold.

What is the demand-side sizing framework?


Demand-side market sizing builds from four consumer research inputs, each answering a specific question that maps to a specific cell of the TAM calculation.

Input 1: Need-state prevalence

How many consumers experience the need your product addresses? This is the broadest question and the most important to get right, because every subsequent calculation multiplies from this base. A 10-point error in need-state prevalence becomes a billion-dollar error in TAM for a population-level category.

AI-moderated interviews excel at need-state identification because the conversational format surfaces needs consumers may not have articulated before. When you ask someone about a category that does not yet exist, you cannot ask directly — they will fabricate or default to socially acceptable answers. Instead, explore the situations, frustrations, and workarounds that surround the need.

For a new sleep-aid food product, do not ask whether consumers would buy sleep-aid food. Explore their evening routines, their relationship with sleep quality, what they currently eat or drink before bed, whether they have ever modified their evening consumption to improve sleep, and what they wish were different about their current approach. The prevalence of sleep-related consumption behavior — measured behaviorally, not aspirationally — becomes your need-state prevalence estimate.

Conduct 200-300 interviews across a broad consumer base using a 4M+ verified panel to ensure demographic and geographic representativeness. The depth of 30+ minute AI-moderated conversations, with 5-7 level laddering, reveals need-states that survey screening questions would miss entirely.

Input 2: Purchase frequency estimation

Among those with the need, how often would they purchase a solution? Frequency estimation requires understanding the occasions that drive the need-state, not hypothetical purchase intent questions that consistently overstate frequency by 2-4x.

Through conversational research, map the specific occasions when the need arises. How often does a consumer struggle with the problem your product solves? How regularly do those situations occur? What is the current spending pattern on imperfect substitutes? Map current behavior — not stated intent — onto the proposed product’s purchase rhythm.

Current behavior is a more reliable frequency predictor than stated intent. A consumer who already buys chamomile tea three times a week for sleep provides a credible frequency anchor for a new sleep-aid product. A consumer who says they “would probably buy” a sleep-aid product “a few times a month” provides almost no signal.

Input 3: Willingness to pay

What price point do consumers consider reasonable for a purpose-built solution? Pricing research for new categories requires particular care because consumers have no reference frame for appropriate pricing — they cannot tell you, in the abstract, what they would pay for something they have never seen.

Explore pricing through comparison rather than direct questioning. What do consumers currently pay for their imperfect substitutes? What premium would they pay for a product that solved their problem more effectively? How does this product sit in their mental pricing architecture relative to other categories they purchase regularly? The Van Westendorp price sensitivity meter is helpful, but the conversational layer that explains why a given price feels right or wrong matters more for new categories where the reference frame is being constructed by the research itself.

For a detailed exploration of how this pricing methodology integrates with broader CPG consumer insights, see the pillar guide.

Input 4: Source of volume

Where will purchases come from? In CPG, new products rarely create entirely new spending. They displace volume from adjacent categories, substitute for non-category solutions, or convert unspent occasions where the consumer currently buys nothing. Each pathway has different competitive and channel implications.

Understanding source of volume requires exploring current substitution behavior. What do consumers currently use to address the need? How much do they spend on it? Would they reduce spending in existing categories to fund the new product, or would it represent incremental spending? Source of volume analysis distinguishes the “new spending” category (incremental volume, less competitive resistance) from the “share-shift” category (replaces existing spending, triggers competitive response).

This input is critical for investors and retailers who need to understand category competitive dynamics. If your product primarily displaces spending from yogurt, that has different shelf, category management, and competitive response implications than if it displaces spending from supplements.

Input mapping

InputResearch questionMethodOutput cell of TAM
Need-state prevalence% of population experiencing the need200-300 conversational interviewsPopulation multiplier
Purchase frequencyAnnual occasions among those with the needOccasion mapping + current-behavior anchoringAnnual purchase rate
Willingness to payPrice point with strong intentComparative pricing + Van WestendorpPrice per unit
Source of volumeWhich existing spending gets displacedSubstitution behavior probingCannibalization adjustment

How do you build the TAM, SAM, and SOM model?


With four research inputs, the market size calculation flows in three stages.

TAM (Total Addressable Market) = Need-state prevalence × Population × Annual purchase frequency × Price per unit

This is the theoretical ceiling — every consumer with the need, buying at their stated frequency, at the validated price point. Consumer research provides the inputs directly.

SAM (Serviceable Addressable Market) = TAM adjusted for distribution reach, channel access, and geographic availability

Layer onto TAM the realistic distribution footprint. A new product launching in natural and specialty channels first cannot access mass shoppers until distribution expands. SAM reflects which TAM consumers your go-to-market plan can actually reach.

SOM (Serviceable Obtainable Market) = SAM adjusted for competitive share, awareness build rate, and trial-to-repeat conversion

SOM reflects realistic share capture within the served addressable market over a 3-5 year horizon. Trial-to-repeat conversion is typically the binding constraint — the innovation pipeline screening framework covers how to model this with consumer evidence rather than assumption.

The critical advantage of research-backed TAM is that the largest and most uncertain component of the calculation (total demand) rests on consumer evidence rather than analogy. SAM and SOM still require business judgment, but the foundation is no longer hypothetical.

How do you validate sizing assumptions and stress-test the model?


Market sizing for new categories involves inherent uncertainty. The goal is not a single precise number but a credible range, plus identification of which assumptions drive the most variance — so additional research can be targeted at the highest-leverage inputs.

Sensitivity analysis. Test each research input independently. If need-state prevalence is 15% instead of 25%, how does TAM change? If purchase frequency is monthly instead of biweekly, what happens to annual revenue projection? If price point is $5 instead of $7, what is the impact on TAM and margin? Sensitivity output identifies which assumptions matter most and where additional research reduces uncertainty most efficiently per dollar spent.

Targeted validation studies. When initial sizing produces a wide range, commission focused 100-interview follow-up studies on the highest-sensitivity assumption. At $20 per interview, a targeted validation runs $2,000 and returns in 24-48 hours. Compared to the cost of committing to production based on unvalidated estimates — manufacturing investment, slotting fees, marketing launch budget — the validation cost is negligible.

Adjacent-category triangulation. Use existing category data as a sanity check, not as the foundation. If TAM analysis projects $2B but the closest analogous category is $500M, the analyst owes the audience an explanation of which mechanism produces the gap. If the explanation is compelling (new occasion, new demographic, new channel), the gap is defensible. If the explanation is hand-waving, the model needs revision.

Investor and retailer pressure-testing. Walk the model through a series of skeptical-investor questions before fundraising or retail meetings. Where does the TAM come from? What is the prevalence assumption based on? How was frequency estimated? What evidence supports the price point? Source of volume? The exercise surfaces weak links before external counterparts surface them.

How does User Intuition support research-backed market sizing?


The four-input demand-side framework only works if the research behind each input can be run, inspected, and re-run as the concept evolves. Traditionally it could not — a 200-interview need-state study cost $40,000-$80,000 and took 8-12 weeks, so iterative validation against the highest-sensitivity assumption was a months-long commitment, and the sizing was stale by the time it was defensible. User Intuition removes that barrier. A single AI-moderated conversational interview costs $20 and turns around in 24-48 hours, so a complete 250-interview demand-side study finishes in a week, and a focused 100-interview follow-up to stress-test a single assumption adds days, not a research cycle.

The capability that matters specifically for new-category sizing is the conversational depth that surfaces need-states a survey cannot reach. Because consumers have no reference frame for a category that does not yet exist, the model depends on probing evening routines, current workarounds, and substitution behavior rather than asking about purchase intent directly — and the 30-plus-minute interview with 5-7 level laddering is built for exactly that exploration. The 4M+ panel supports targeted recruitment of the consumer profiles that represent the addressable market, and 50+ language coverage enables multi-market sizing in one program. The output is a TAM model with evidence trails: every number traces to a consumer quote rather than an adjacency assumption. Innovation teams can review the market intelligence solution for how this research feeds a sizing program, or book a demo to design a demand-side study.

How do you translate sizing into the launch decision?


Market sizing is an input to the launch decision, not the decision itself. A large TAM with weak need-state conviction can represent a worse opportunity than a smaller TAM with intense, clearly articulated demand. Conviction signal — the depth and consistency of need expressed across hundreds of consumer conversations — is what separates a launchable opportunity from a model that looks good on a slide and dies in market.

When an innovation team can point to 200+ consumer conversations where real people described a genuine, recurring need, articulated willingness to pay, and identified the specific products they would reduce purchasing to fund the new product, the launch decision rests on a foundation that no spreadsheet model can replicate. The decision is no longer “do the numbers support a launch?” — it is “do the numbers and the consumer voices both support a launch?” When they do, conviction is high. When they diverge, the conviction gap is itself diagnostic — typically signaling that the model is leaning on assumptions consumer evidence does not support.

What are the most common new-category sizing mistakes?


Even teams that commit to demand-side sizing produce models that mislead in predictable ways. The errors cluster around six patterns the strongest innovation teams design against from the start.

Confusing intent with behavior. Stated purchase intent overstates actual purchase rates by 2-4x in every category that has been measured. A model anchored on “would buy” answers without behavioral validation routinely produces TAM that is double or triple the real opportunity. Always validate intent against current behavior — what the consumer is doing now, not what they say they would do.

Sizing by analogy to a single adjacent category. The kombucha-by-juice mistake repeats every product cycle. A new category rarely maps cleanly to a single existing category; it draws spend from several and creates new occasions. Triangulate across multiple adjacencies and explicitly model source-of-volume rather than defaulting to a single comparator.

Failing to model channel access into SAM. TAM is the theoretical ceiling. SAM is what the go-to-market plan can actually reach in the first 3 years. Teams that present TAM as if it were SAM oversize the near-term opportunity and create misalignment between sizing and operating plans.

Ignoring competitive response in SOM. Once a category is visible, incumbents and adjacent players respond. SOM modeling that assumes static competitive behavior typically overstates achievable share by 30-50%. Build competitive response scenarios explicitly into the model — what happens when the category leader launches a similar product 18 months in.

Skipping sensitivity analysis. Single-point estimates create false confidence. Sensitivity analysis exposes which inputs drive the most variance and where additional research has the highest return. Every sizing model should include high/base/low scenarios for each input and a clear identification of the highest-leverage assumption.

Underinvesting in iterative validation. Sizing studies done once at the start of innovation development decay as the product concept evolves. Run targeted validation as the concept changes — when pricing shifts, when positioning evolves, when packaging changes. At $20 per interview, the cost of iterative validation is trivial compared to the cost of committing to production on a stale sizing assumption.

What does a strong innovation sizing practice look like?

The CPG innovation teams running the strongest sizing programs share five traits. They anchor TAM on consumer behavior, not consumer intent. They triangulate across multiple adjacent categories rather than relying on a single analog. They model SAM and SOM with explicit channel and competitive assumptions, not as percentages of TAM. They run sensitivity analysis on every input and use the sensitivity output to direct follow-up research. And they refresh the sizing model as the concept evolves rather than treating the initial size estimate as permanent. Programs that do this consistently make better innovation bets because they distinguish the large-market-weak-signal opportunities from the smaller-market-strong-signal opportunities — and over multiple launches, the discipline compounds into a structural advantage versus competitors still presenting TAM slides with one big number and no methodology.

CPG organizations building this capability into their innovation process make better bets more consistently — not because they have eliminated uncertainty, but because they have grounded their estimates in the most reliable signal available: the voices of the consumers they intend to serve. The compounding effect across multiple launches is what eventually separates the innovation-strong CPG from the innovation-weak one.

Note from the User Intuition Team

Your research informs million-dollar decisions — we built User Intuition so you never have to choose between rigor and affordability. We price at $20/interview not because the research is worth less, but because we want to enable you to run studies continuously, not once a year. Ongoing research compounds into a competitive moat that episodic studies can never build.

Don't take our word for it — see an actual study output before you spend a dollar. No other platform in this industry lets you evaluate the work before you buy it. Already convinced? Sign up and try today with 3 free interviews.

Frequently Asked Questions

Supply-side sizing extrapolates from existing category data — competitive sales figures, retailer scan data, syndicated tracking — none of which exists for categories that haven't yet launched. More fundamentally, supply-side approaches measure what consumers are currently buying, not what they would buy if a new option were available at the right price and in the right channel. New categories are defined by demand that isn't yet expressed in purchasing behavior, which means the relevant data must be gathered through consumer research rather than extracted from market databases.
The demand-side framework quantifies unmet demand through a combination of consumer research methods: frequency and context of need-states the new category would serve, willingness to pay at specific price points, competitive substitutes currently being used to address the need, and household penetration potential based on who experiences the need and how often. Each of these inputs produces a sizing assumption that can be stress-tested against consumer evidence rather than defended as an extrapolation from adjacent category data.
User Intuition can run rapid consumer interviews to populate the demand-side assumptions in a new category sizing model — probing need-state frequency, competitive substitutes, price sensitivity, and purchase occasion across targeted consumer profiles. At $20 per interview and 24-48 hour turnaround, brands and investors can validate sizing assumptions iteratively as the model develops rather than committing to a single research phase at the start of the project. The 4M+ panel enables targeted recruitment of the consumer profiles most likely to represent the new category's addressable market.
Direct purchase-intent questions overstate frequency by 2-4x because respondents anchor on aspirational rather than behavioral answers. The reliable approach is to map current behavior in adjacent categories: how often does the consumer engage in the occasions your product would serve, and what do they currently use to address the need? A consumer who already buys chamomile tea three times a week as a sleep ritual provides a credible frequency anchor for a new sleep-aid food; a consumer who says they would 'probably buy' something 'a few times a month' provides almost no signal. Grounding frequency estimates in documented current behavior rather than stated intent is the single most important discipline in demand-side sizing.
Get Started

Put This Research Into Action

Run your first 3 AI-moderated customer interviews free — no credit card, no sales call.

Self-serve

3 interviews free. No credit card required.

See it First

Explore a real study output — no sales call needed.

No contract · No retainers · Results in 72 hours