Customer discovery interviews are the foundation of evidence-based product decisions for SaaS teams. Whether you are running idea validation for a new concept or pressure-testing an existing roadmap, the questions you ask determine whether you get polite agreement or genuine insight. This guide provides a practical question bank organized by research goal, with guidance on the adaptive follow-up techniques that separate productive interviews from scripted exercises. It draws on the patterns User Intuition has observed across thousands of AI-moderated discovery sessions for SaaS product teams running continuous discovery inside sprint cycles.
What three principles must come before any discovery question?
Before reaching for specific questions, internalize three principles that govern effective discovery. The principles matter more than the questions themselves. A perfectly worded question asked without these principles still produces noisy data; an imperfectly worded question asked with them still produces useful signal.
Ask about behavior, not opinions. “How do you currently handle X?” produces actionable data. “What do you think about X?” produces speculation. Past behavior is the single best predictor of future behavior. Opinions predict almost nothing — and worse, the act of asking for an opinion implicitly invites the participant to perform a certain kind of response. The behavioral framing removes the invitation. Participants who narrate what they actually did last Tuesday are not constructing an idealized self-image; they are recalling specifics.
Follow the energy. When a participant’s tone shifts — they become more animated, more frustrated, or more detailed — that is where the real insight lives. Abandon your script momentarily and follow the thread. The most valuable finding in a discovery interview is almost never the answer to a question you prepared. It is the unprompted aside the participant volunteered when you stayed quiet long enough for them to keep talking. Discovery interviewers who chase script completion at the expense of energy consistently produce thinner findings than interviewers who follow the participant’s lead.
Pursue specificity. Every generalization hides useful detail. When a participant says “we usually do it this way,” ask about the last specific time. “Tell me about the most recent time that happened. When was it? What exactly did you do?” Specificity prevents participants from constructing an idealized narrative and forces recall of actual events. The specific event is the unit of data; the generalization is a story the participant tells about the events, and stories are systematically biased toward coherence rather than accuracy.
What is the question bank by research goal?
Problem discovery
Use these when you are exploring a problem space before committing to a solution direction.
- “Walk me through a typical day when you are working on [relevant task]. What tools do you use and in what order?”
- “What is the most tedious or frustrating part of [workflow area]? Can you give me a specific example from the last week?”
- “When was the last time [problem area] caused you to miss a deadline, waste time, or get frustrated? What happened?”
- “If you could eliminate one step from your current process, which would it be and why?”
- “What have you tried to solve this problem? What worked and what did not?”
The last question is particularly powerful. Users who have tried to solve a problem — building spreadsheets, writing scripts, hiring contractors — have already validated the problem’s intensity through their own investment. Users who have never attempted a solution may not care enough to adopt yours. Every existing workaround is encoded evidence of unmet need; the questions that surface workarounds are doing most of the diagnostic work.
Solution validation
Use these after you have validated the problem and need to evaluate a specific solution concept.
- “I want to describe something we have been thinking about. [Describe concept in 2-3 sentences.] Based on what you just told me about your workflow, where would this fit in?”
- “What would need to be true about this for you to try it? What would make you hesitate?”
- “If this existed today, what would you stop doing? What tools or processes would it replace?”
- “Who else on your team would need to be involved for this to work? What would their concerns be?”
- “On a scale of your current workaround to a perfect solution, where does this concept land? What is missing?”
The commitment probes — asking what they would stop doing, who else would need to be involved — separate genuine interest from polite enthusiasm. A user who cannot describe what they would change is expressing a preference, not demonstrating intent. A user who can name the tool they would deprecate, the colleague they would have to convince, and the budget line they would re-allocate is describing actual purchase behavior.
Competitive landscape
Use these to understand how users evaluate and compare solutions.
- “What tools have you evaluated in this space? What stood out about each one?”
- “If you had to switch away from [current tool] tomorrow, what would you move to and why?”
- “What does [current tool] do well that you would not want to lose? What frustrates you most about it?”
- “When you last evaluated a new tool in this category, what were the top three criteria? How did you weight them?”
Competitive landscape questions surface the alternatives users actually consider, not the alternatives your competitive intelligence team tracks. The two lists frequently diverge. Tools you consider rivals may not appear in any user’s evaluation set; tools you consider irrelevant may appear in every user’s shortlist. The mismatch is one of the highest-value findings discovery produces.
Workflow and context
Use these to build a complete picture of the user’s environment before proposing solutions.
- “Walk me through how a [relevant deliverable] gets from initial request to final delivery. Who is involved at each step?”
- “Where do things typically break down or slow down in this process?”
- “How do you currently measure success for [relevant outcome]? What metrics or signals do you track?”
- “What has changed about how you do this in the last 12 months? What drove those changes?”
Workflow questions are foundational. Many discovery failures trace to a misunderstanding of the environment the feature would live in. A feature that looks brilliant in a slide deck dies on contact with a workflow involving four stakeholders, three other tools, and one approval gate that nobody mentioned because they assumed you already knew. The “What has changed in the last 12 months?” question is particularly diagnostic — it surfaces the recent disruptions to the workflow that signal where the participant’s attention currently is, and where adoption energy might exist.
Switching triggers and decision authority
Use these to understand who decides what gets adopted and what events cause re-evaluation.
- “When you last brought a new tool into this workflow, what triggered the search? Who approved the purchase?”
- “What would have to be true for you to consider switching away from [current tool]?”
- “Walk me through the last conversation you had with your manager about tooling for [workflow area]. What did they say mattered most?”
- “If a competitor showed up with a free trial tomorrow, what is the first thing you would do to evaluate it?”
These questions are critical for feature prioritization and competitive positioning. They reveal the actual mechanics of how purchase decisions get made in the participant’s organization — which is rarely what the vendor’s go-to-market team assumes.
How do you master the art of follow-up?
Prepared questions get you to the surface. Follow-up gets you to insight. The 5-7 level laddering technique works by treating each answer as a prompt for deeper exploration.
Level 1 probe: Clarify. “When you say ‘it takes too long,’ what does that mean specifically? How long is too long?”
Level 2 probe: Contextualize. “What are you trying to accomplish during that time? What depends on this being faster?”
Level 3 probe: Quantify. “How often does this happen? Is it every day, every week, or less frequent?”
Level 4 probe: Explore consequences. “What happens when this delay occurs? Who is affected beyond you?”
Level 5 probe: Uncover the root. “If you could redesign this from scratch, knowing what you know now, what would it look like?”
Each level moves from description to diagnosis to prescription. By level 5, you are no longer discussing the original symptom — you are discussing the underlying need that the symptom revealed. Human interviewers commonly stop at level 2 or 3 because the social pressure of repeated “why” questions feels intrusive. Participants rarely mind; they appreciate being listened to with depth. The discomfort is the interviewer’s, not the participant’s.
Manual interviewing vs. AI-moderated discovery
| Dimension | Manual interviewing | AI-moderated discovery |
|---|---|---|
| Interviewer capacity | 3-4 sessions/day before quality degrades | Hundreds simultaneously |
| Cost per interview | $200-$500 (researcher time) | $25 (User Intuition) |
| Laddering consistency | Variable; interviewer-dependent | Uniform 5-7 levels every session |
| Time to 20 transcripts | 2-3 weeks | 24 hours |
| Scheduling friction | High; multi-stakeholder calendars | Asynchronous; participants self-schedule |
| Language coverage | Limited by interviewer fluency | 50+ languages |
| Recruitment | Manual outreach | 4M+ panel access |
| Searchable archive | Requires separate tooling | Native transcript repository |
| Best fit | Sensitive enterprise interviews, novel domains | Sprint-cycle discovery, continuous practice |
The two methods are complementary, not competitive. Senior enterprise discovery sometimes benefits from a human researcher; routine sprint discovery benefits from the AI cadence. Most growth-stage SaaS teams now run an 80/20 split with AI-moderated as the default.
How can you scale discovery without losing depth?
The traditional constraint on discovery research is interviewer bandwidth. A skilled researcher can conduct 3-4 interviews per day before quality degrades. At that rate, a 20-person study takes a full week just for data collection, not counting recruitment and analysis. The interviewer becomes the rate limit on the entire product-decision process.
AI-moderated interviews remove this bottleneck. User Intuition runs dozens of conversations simultaneously, each with adaptive 5-7 level follow-up, producing the same depth as human-moderated interviews at a fraction of the time. A 20-person discovery study that would take 2-3 weeks with traditional methods completes in 24 hours through User Intuition’s 4M+ panel across 50+ languages, with every conversation recorded, transcribed, and searchable. Participant satisfaction sits at 98%, and studies start at $150.
For SaaS teams operating in two-week sprints, this timeline means discovery research can inform the current sprint’s decisions rather than the sprint after next. Product managers get evidence while the decision window is still open. The shift in what the team is allowed to know — at the moment the knowledge is still actionable — is structural rather than incremental. It is the same difference as the shift from monthly to weekly deploys.
The cost structure supports continuous practice rather than episodic studies. At $25 per interview, running 10 discovery conversations per week costs $800 per month — roughly the cost of a single team lunch. That steady cadence of customer contact keeps the team’s understanding of user needs current as the product and market evolve. Studies that previously required quarterly budget approval become a line item in the sprint plan.
Running this question bank with User Intuition
The three principles this guide opens with — behavioral framing, following the energy, pursuing specificity — are easy to state and hard to execute consistently across 40 conversations, because the laddering they depend on is exactly where human interviewers tire and drift. User Intuition’s AI moderator holds that discipline uniformly: it applies the non-leading phrasings in the question bank, runs the five-to-seven-level probe sequence on every session, and chases each generalization down to the specific recent event rather than settling for “we usually do it this way.” For idea validation work specifically, the capability that changes the practice is cadence at sprint speed — a 20-person discovery study lands in 24 hours, so the evidence informs the current sprint’s roadmap call rather than the one after next, and the commitment probes that separate genuine intent from polite enthusiasm get asked the same way for every participant. The platform absorbs the operational work a research-ops function used to own: panel recruitment, screening, scheduling, transcription, and theme synthesis with verbatim citations. Every conversation lands in a searchable repository, so “what do our users think about reporting?” becomes a query across six months of transcripts instead of a new study. SaaS teams moving from quarterly to weekly discovery can book a demo to see a sprint-cycle study fielded end to end.
How do you sequence questions inside a single 30-minute session?
The bank above contains far more questions than fit in a single interview. A 30-minute session typically covers 8-12 prepared questions with extensive follow-up; the discipline is selection, not exhaustion. The default sequence that works for most discovery sessions: open with two workflow/context questions to ground the conversation in actual behavior, move into three to four problem-discovery questions to map pain and workarounds, transition to solution validation only after pain is established, and close with one or two competitive landscape or switching trigger questions.
Avoid front-loading solution validation. Showing the concept early biases everything downstream — participants who have seen your proposed solution stop describing their actual workflow and start describing how the solution would fit. Hold the concept back until at least 15 minutes into the session, ideally longer. The participant who has spent 20 minutes talking about their problem gives substantially more honest feedback on the concept than the one who saw the concept at minute three.
The other sequencing discipline is exit cleanly. Reserve the last 3-5 minutes for the “is there anything I should have asked?” prompt and the “who else on your team should I talk to?” referral question. Both questions consistently produce findings — the first surfaces the priority the participant wishes you had explored, the second extends recruitment into the buying committee at zero marginal cost.
How do you build a durable interview practice for product teams?
Discovery interviews produce the most value when they are a habit rather than an event. Teams that run 5-10 interviews per week develop an intuitive understanding of their user base that no amount of analytics or survey data can replicate. They recognize patterns faster, spot emerging needs earlier, and make roadmap decisions with confidence. The compounding effect is visible in retros — teams that have run continuous discovery for two quarters argue less about user needs because they share an evidence-based picture of who the user is and what the user does.
The key infrastructure is a searchable repository where conversation insights accumulate over time. When a PM asks “what do our users think about reporting?” they should be able to search across six months of discovery conversations and find every relevant mention — traced to specific verbatim quotes, not filtered through someone’s memory or a stale research report. This institutional memory survives team transitions and strategy shifts, compounding in value with every conversation added. Without the searchable archive, every new PM relitigates the same questions the previous PM already answered, and the compounding value disappears.
The final discipline is making discovery findings visible to the broader organization. Post a weekly summary in a public Slack channel. Cite verbatim quotes in design reviews. Make customer evidence a mandatory section of every PRD. When the organization sees regular customer insight flowing through every product decision, the practice becomes self-reinforcing — and the next leadership change cannot easily kill it, because the muscle is now distributed across the team rather than concentrated in one PM’s notebook.