Interviewing Users in the Age of AI: What's Better, Synthetic Data or Real Conversations?
- Cher Taylor
- Jan 9
- 4 min read
We're living through a fascinating tension in user research. On one side, AI can now generate synthetic user interviews in minutes: complete with personas, responses, and insights that look remarkably convincing. On the other side, seasoned researchers are doubling down on the irreplaceable value of sitting across from a real human being, watching their facial expressions, and hearing the hesitation in their voice.
So which approach actually delivers better insights? The answer isn't as straightforward as you might think.
The Synthetic Data Revolution
Let's be honest: synthetic user research is impressive. I've watched teams generate hundreds of "user interviews" in the time it would take to schedule three real ones. The technology has reached a point where AI feedback aligns with human feedback over 95% of the time, particularly for broad attitudinal questions.

Speed is the obvious win. You can test multiple scenarios, validate assumptions, and explore different user segments without the logistics nightmare of recruiting, scheduling, and conducting real interviews. For teams working under tight deadlines or limited budgets, this feels like a superpower.
Privacy protection is another major advantage. When you're dealing with sensitive topics: healthcare, financial stress, personal relationships: synthetic data lets you explore user perspectives without putting real people in uncomfortable situations or risking data breaches.
Scale becomes effortless. Want to understand how 50 different user types might respond to your new feature? Synthetic data can generate those perspectives in hours, not months.
The transcripts are clean, well-formatted, and ready to share with stakeholders without the usual cleanup that comes with real interview recordings. For design teams that need to move fast and demonstrate user-centricity to leadership, synthetic research offers a compelling path forward.
Where Real Conversations Still Reign
But here's what I've learned after years of user research: the magic often lives in the moments AI can't replicate.
Real humans go off-script. They use your product in ways you never intended. They reveal needs they didn't even know they had. A synthetic user will stick to logical, predictable responses based on training data, but a real person might say something that completely transforms your understanding of the problem.

Emotional nuance is everything. When someone pauses before answering, when their voice changes when discussing a particular feature, when they light up talking about an unexpected use case: these subtle cues carry meaning that no algorithm can fabricate. I've seen single moments of genuine emotion redirect entire product strategies.
Cultural context matters deeply. Synthetic data reflects averaged patterns, but real users bring their specific cultural backgrounds, life experiences, and unique perspectives that training data simply can't capture. This is especially critical when designing for diverse or underserved communities.
Empathy builds better products. There's something about sitting with a frustrated user, seeing their struggle firsthand, that creates a level of team empathy no synthetic report can match. When developers hear a customer say, "This makes me feel stupid," it hits differently than reading it in a generated transcript.
When to Choose Which Method
The most effective teams I've worked with use both approaches strategically, not as an either-or decision.
Use synthetic data for:
Early-stage ideation and assumption testing
Rapid exploration of multiple scenarios
Sensitive topic research where privacy is paramount
High-volume testing when you need to validate concepts quickly
Initial persona development and journey mapping
Choose real conversations for:
Understanding emotional drivers and motivations
Uncovering unexpected use cases and edge scenarios
Building team empathy and buy-in
Validating final design decisions before launch
Exploring complex, nuanced problems that require context

Avoid synthetic data when:
You're designing for specific cultural communities
Emotional insight is critical to your product's success
You're entering completely new problem spaces
Stakeholders need to hear directly from real customers
The Risks of Going All-In on Either Approach
I've seen teams get burned by over-relying on synthetic data. The insights feel comprehensive and convincing, but they miss the messy, contradictory nature of real human behavior. Products built entirely on synthetic insights often feel polished but somehow... hollow.
On the flip side, teams that reject AI-assisted research entirely are handicapping themselves. They're spending weeks gathering insights that could be validated in hours, missing opportunities to explore broader question sets, and often working with sample sizes too small to be truly representative.
The biggest risk with synthetic data? Confirmation bias at scale. If your AI is generating responses that confirm your existing assumptions, you might mistake validation for insight. Real users have a wonderful way of proving our assumptions wrong.
The biggest risk with real-only research? Moving too slowly in fast-changing markets. By the time you've conducted thorough user research the traditional way, your competitors might have shipped three iterations.
The Hybrid Approach That Actually Works
The teams getting the best results combine both methods strategically. Here's the framework I recommend:
Start synthetic, go deep with real. Use AI to rapidly test multiple directions and identify the most promising areas for investigation. Then invest your limited research budget in deep, qualitative conversations with real people on those focused topics.
Let AI identify the "what," then use humans to uncover the "why." Synthetic data excels at surfacing patterns and preferences, but real conversations reveal the underlying motivations and contexts that drive those patterns.

Use synthetic data to improve your real research. Generate potential interview questions, explore edge cases you might miss, and test different ways of framing problems before you sit down with actual users.
Validate synthetic insights with real voices. When synthetic data suggests something important, always confirm it with real user conversations before making major product decisions.
Practical Recommendations for Your Team
If you're ready to experiment with this hybrid approach, start small:

The Bottom Line
The question isn't whether synthetic data or real conversations are better: it's about using each method where it provides the most value. Synthetic data gives us speed and scale for exploration, while real conversations provide the emotional depth and unexpected insights that drive breakthrough products.
The teams that will win in this AI-augmented world are those that master both approaches, using synthetic data to ask better questions and real user research to find answers that actually matter.
Your users are complex, contradictory, and wonderfully human. No AI can fully replicate that complexity, but it can help us understand where to focus our human research efforts for maximum impact.
The future of user research isn't choosing between artificial and authentic: it's combining both to create insights that are faster, deeper, and more actionable than either approach alone.
Comments