I build AI products with a simple conviction: disciplined experimentation beats intuition. Over the years, I’ve refined a practical playbook that helps my teams learn faster, reduce risk, and turn every release into a smarter next step.
Product experimentation isn’t luck; it’s a method. Learn how top AI product managers test, measure, and grow smarter with every release.
I begin every effort with a crisp hypothesis, an expected user or business outcome, and unambiguous success criteria tied to outcomes vs output OKRs. Before writing a line of code, I define primary metrics and guardrails so we know what “good” looks like—and what to stop.
When the change affects UX, pricing, or activation flows, I favor A/B testing with the statistical rigor to back decisions. We calculate the minimum detectable effect (MDE), choose appropriate randomization units, and pre-register the analysis plan to avoid p-hacking. This gives the team the confidence to scale wins and sunset underperformers quickly.
AI features demand a tailored approach, so I run eval-driven development before any user sees a variant. We curate golden datasets, score candidate prompts and models, and stress-test failure modes. This is where LLMs for product managers matters: prompt templates, context window management, and a retrieval-first pipeline are all evaluated for quality, latency, and cost-to-serve. I treat “hallucination rate,” safety violations, and bias as first-class metrics under AI risk management.
To de-risk launches, we ship behind feature flags with CI/CD, monitor DORA metrics, and roll out in stages. Product trios own problem framing to solution delivery, which shortens feedback loops and preserves accountability. If early signals drift from our hypotheses, we pause, adjust, and re-run—no sunk-cost thinking.
Measurement is non-negotiable. I instrument user journeys end-to-end with Amplitude analytics, track activation and retention analysis, and map behavior to learning objectives. We consolidate logs and events into a unified analytics platform so qualitative insights from customer research pair cleanly with quantitative trends.
Continuous discovery keeps the engine running. Weekly customer conversations, in-product feedback, and lightweight prototypes ensure we validate needs, not just solutions. The output flows into product discovery, product roadmapping and sprint planning, and a reusable AI product toolbox that scales across teams.
Finally, I protect the culture that makes experimentation work: we celebrate invalidated hypotheses, document decisions, and optimize for outcomes over output. That’s how empowered product teams sustain product-led growth—even as complexity grows.
If you’re building AI features today, adopt this playbook to maximize learning velocity, minimize risk, and compound advantage. The method is straightforward: form strong hypotheses, test with rigor, measure what matters, and let evidence—not HiPPOs—guide the roadmap.
Quantitative metrics tell the story in numbers; qualitative ones whisper why it matters. Both shape how products grow. Here’s what you need to know.
In my day-to-day, I rely on quantitative metrics to surface what’s changing in the business and where we need to focus. Activation rate, conversion through the onboarding funnel, feature adoption, retention analysis, and LTV/CAC give me a precise read on performance. I also keep an eye on DORA metrics to understand delivery health and deployment frequency, but I never mistake those for customer outcomes. Numbers spotlight signal—but they rarely explain causality on their own.
That’s where qualitative analysis earns its keep. Customer interviews, usability studies, win/loss debriefs, support transcripts, and community feedback give me the context behind the charts. Tools like Pendo help me layer in in-app guides and micro-surveys to capture intent and friction in the flow. This combination turns raw data into decisions that actually move the product strategy forward.
My operating cadence is simple: weekly dashboards to monitor quantitative metrics, ongoing continuous discovery to collect qualitative insight, and a monthly synthesis to reconcile both with our outcomes vs output OKRs. The aim is to move from opinions to evidence, and from anecdotes to patterns. When quant and qual agree, we execute confidently; when they diverge, we design the smallest experiment to learn fast.
I use a three-question decision tree to choose the method. First, are we exploring or validating? Exploration leans qualitative; validation leans quantitative. Second, do we have enough volume for statistical power? If yes, I’ll run A/B testing with a clear minimum detectable effect (MDE) to avoid false positives. If not, I’ll rely on targeted qualitative discovery until we can instrument a meaningful test. Third, will this decision meaningfully impact our product-led growth or user activation goals? If it will, we invest in both measurement and discovery to reduce decision risk.
Here’s a concrete example. We once saw a sudden drop in user activation. The quantitative dashboard flagged a step-function change at onboarding step three, but it couldn’t explain why. A quick round of qualitative interviews revealed that our tooltip design buried a critical permission request. We shipped a Pendo-powered in-app guide variant and ran an A/B test to validate the fix. Activation rebounded within a week, and 30-day retention followed suit.
There are common pitfalls I actively avoid. Chasing vanity metrics that don’t ladder up to outcomes. Conflating shipping speed with customer value by over-indexing on DORA metrics. Overfitting with A/B testing when the MDE is unrealistic for our traffic. And on the qualitative side, mistaking a compelling anecdote for a representative sample without triangulating evidence.
If you’re looking to tighten your practice, start with a lightweight playbook: instrument core events in Amplitude analytics; define a small set of outcomes vs output OKRs; schedule recurring customer conversations as part of continuous discovery; tag qualitative insights so patterns surface over time; and pair every material UX change with either a well-powered experiment or a clear qualitative learning goal. This creates a unified analytics and discovery loop that compounds.
Ultimately, quantitative metrics help me prioritize with clarity, while qualitative analysis helps me decide with confidence. When you weave them together, you not only ship faster—you ship the right thing, for the right reason, at the right time.
I rely on product benchmarks to align teams, sharpen strategy, and accelerate outcomes—especially in healthcare, where stakes are high and complexity is real. Over the years, I’ve learned that the right metrics create clarity across product, engineering, compliance, and go-to-market, enabling faster, safer decisions that translate into measurable impact.
Discover exclusive data and strategies from our Product Benchmark Report. Compare the healthcare technology industry’s performance across key product metrics.
When I evaluate a healthcare product’s health, I focus on a few essentials: activation rate and time-to-value for new users, weekly active usage and feature adoption for clinicians and admins, and cohort-based retention analysis to understand whether value compounds over time. I also look at funnel friction (onboarding drop-off, failed setup steps), support load per account, and reliability signals that influence trust—because in healthcare, trust fuels growth.
Benchmarks turn those metrics into context. They help me answer, “Are we good, or just lucky?” By comparing our numbers to industry peers, I can prioritize the few bets that matter, set outcomes vs output OKRs, and guide empowered product teams to focus on the highest-leverage improvements.
Operationally, I instrument products with a unified analytics platform and tools like Amplitude analytics and Pendo to track user activation, feature adoption, and in-product journeys. Pairing that with continuous discovery keeps insights fresh, while A/B testing and clear minimum detectable effect (MDE) thresholds ensure we ship with statistical confidence.
In practice, my playbook for healthcare product-led growth is straightforward: simplify onboarding with targeted product tours and in-app guides, tighten the first-win loop to reduce time-to-value, and eliminate blockers surfaced by behavioral analytics. Then, reinforce the loop with lifecycle messaging, role-specific education, and clear value propositions for clinicians, operations teams, and executives.
Of course, none of this works without strong governance. Data governance and regulatory compliance aren’t just guardrails; they’re growth enablers. Clear audit trails, privacy-by-design, and reliable incident management build the trust that keeps adoption high and churn low.
If you’re ready to benchmark your roadmap against the market, this report gives you the clarity to spot gaps, the language to align stakeholders, and the metrics to execute with precision. Use it to calibrate your product strategy, guide your next set of experiments, and confidently scale what works across the healthcare technology ecosystem.
Inspired by this post on Amplitude – Perspectives.
The most valuable lesson I’ve learned leading product organizations is that portfolio choices make or break outcomes. In an era of infinite requests and finite teams, the question isn’t what we could build—it’s what we must build next. That’s why I’m codifying a pragmatic, AI-driven playbook to optimize the product portfolio while staying true to outcomes, not output.
AI-powered product portfolio optimization is here. Explore strategies and tools helping product leaders manage complexity and boost ROI.
My starting point is a data backbone that connects strategy to reality. I aggregate product usage, revenue by segment, cost-to-serve, retention cohorts, and support signals into a unified analytics platform, then layer a retrieval-first pipeline so LLMs can reason over clean context. Instrumentation matters: Amplitude analytics, Pendo, and in-app guides provide the behavioral and activation signals that make prioritization measurable.
From there, I translate strategy into an objective decision system. I express outcomes vs output OKRs, align initiatives to value proposition and competitive differentiation, and classify opportunities with the Kano Model. LLMs for product managers help cluster voice-of-customer at scale; with thoughtful prompt engineering and AI workflows, I can map themes to jobs-to-be-done, quantify demand, and de-duplicate asks across stakeholders.
Execution hinges on evidence. I run A/B testing with a clear minimum detectable effect (MDE), pair it with eval-driven development for AI features, and ship through CI/CD while tracking DORA metrics. This closes the loop between product roadmapping and sprint planning and real-world performance—activation, retention analysis, and Web Vitals inform the next set of portfolio bets.
Trust is a feature, so governance is built-in. Privacy-by-design, data governance, and AI risk management guide how we store, prompt, and evaluate models. I apply guardrails to sensitive workflows and define success metrics that balance short-term ROI with long-term resilience and regulatory compliance.
The operating model matters as much as the models themselves. Product trios and empowered product teams run continuous discovery, pressure-test assumptions in QBRs vs OKRs, and make trade-offs visible. Stakeholder management becomes easier when the portfolio narrative is anchored in transparent scenarios and shared metrics.
If you’re getting started, here’s my flow: unify data, define outcomes, segment opportunities, simulate scenarios, and test fast. Use LLMs to synthesize signals you’d never humanly read, then make one focused bet per team that moves a measurable KPI. Rinse, learn, and reallocate—portfolio optimization is a living system, not an annual meeting.
Ultimately, the promise of this new playbook is simple: less noise, sharper focus, and compounding ROI. By pairing AI Strategy with disciplined product management leadership, we can manage complexity with clarity—and consistently build what matters most.
Benchmarks are my reality check. In the fast-moving media and entertainment space, I rely on concrete product metrics to align strategy, prioritize roadmaps, and drive product-led growth with confidence. When my team and I calibrate against industry benchmarks, we turn opinions into outcomes and ensure our bets are tied to measurable impact.
Discover exclusive data and strategies from our Product Benchmark Report. Compare the media and entertainment industry’s performance across key product metrics.
Here’s how I think about what matters most in this report: user activation and time-to-value to understand onboarding effectiveness, retention analysis to quantify staying power, feature adoption to validate value delivery, and engagement depth to see whether we’re building habit loops—not just generating clicks. I also look at experimentation maturity (A/B testing volume and velocity), release cadence, and how we structure outcomes vs output OKRs to keep teams accountable to real customer impact.
Benchmarks aren’t scorecards—they’re decision accelerators. I use them to run a gap analysis, set clear targets, and focus the roadmap on the few bets most likely to move our leading indicators. For example, if activation lags, we invest in clearer in-app guides, product tours, and progressive onboarding; if retention stalls, we refine the value proposition and instrument cohorts to isolate which segments respond best.
Operationally, I instrument a unified analytics platform with Amplitude analytics for cohorting and funnel analysis, and Pendo for in-app guidance and feature adoption insight. Weekly product health reviews keep the team oriented around activation, retention, and engagement. When we A/B test, we set a minimum detectable effect (MDE) up front and tie experiments to specific OKRs, so decisions aren’t swayed by noise. This discipline helps empowered product teams ship faster without sacrificing rigor.
If you’re building in media and entertainment, use these benchmarks to define what “good” looks like for your model, then localize targets to your audience and content format. Start by instrumenting the essentials, align leaders on the few metrics that matter, and iterate with high-velocity experiments. The right benchmarks will sharpen your product strategy, improve stakeholder confidence, and turn your roadmap into a reliable engine for growth.
Inspired by this post on Amplitude – Perspectives.
I’ve been systematically exploring how the product model shows up inside iconic companies. After studying “The Product Model at Spotify” and “The Product Model at Amazon,” I’m turning my lens to Google—specifically, how the product operating model, product culture, and product strategy manifest in practice and what we can pragmatically take back to our own organizations.
When I talk about the product model, I’m looking at the machinery that connects strategy to outcomes: empowered product teams, clear decision rights, tight product trios, continuous discovery, data-informed bets, and an operating cadence that enables learning at speed. My goal here is to unpack how those elements come together at Google and translate them into repeatable patterns you can adopt.
At a high level, I focus on how teams are empowered to solve problems rather than ship outputs, how outcomes vs output OKRs clarify what matters, and how experimentation (from rapid prototyping to A/B testing) de-risks decisions before they scale. I also examine how engineering and product partner to balance platform scalability with customer value, and how stakeholder management reinforces alignment without slowing teams down.
Why does this matter? Because the product model is a lever for resilience and speed. When product strategy is explicit and the operating model is built for learning, organizations multiply the impact of talented people. That’s how small, focused teams repeatedly deliver outsized results—even in complex, regulated, or high-scale environments like Google.
In the sections that follow, I’ll synthesize what I see as the core patterns behind Google’s approach and distill them into actionable guidance: how to structure product trios, how to run continuous discovery alongside delivery, how to set and calibrate OKRs for outcomes, and how to evolve your product culture so empowered product teams can do their best work. My aim is not to idolize a model, but to extract what’s portable and help you adapt it to your context.
Every week I review dozens of applications for PM roles, and in under 30 seconds I decide whether to keep reading. In 2026, the bar is higher than ever: clarity, outcomes, and customer insight beat buzzwords every time.
Learn how to write a standout product manager cover letter with steps, examples, templates, and smart AI workflows to make your application stand out.
I start with a crisp opening that communicates my value proposition in one sentence: the product problem I love solving, the customer I serve, and the measurable outcomes I drive. Then I connect my experience to the role’s core responsibilities—product discovery, product positioning, go-to-market strategy, and stakeholder management—without rehashing my resume.
A strong PM cover letter follows a simple structure: a hook with context, one paragraph proving product management leadership through outcomes vs output OKRs, a paragraph on how I partner with empowered product teams and engineering to ship, and a closing line that shows I understand the company’s roadmap and where I can help now.
To make this concrete, I include brief examples that show decisions, not duties: how I translated ambiguous customer signals into a roadmap, how I balanced platform scalability with speed, and how I measured success with activation, retention, and adoption—not vanity metrics.
Templates help me move fast, but I always tailor. I mirror the job’s language, highlight the few experiences that map 1:1, and cut everything else. I quantify impact where possible, link outcomes to business value, and keep it to 200–300 words so hiring managers can scan.
I also use smart AI workflows to accelerate the craft without sacrificing authenticity. My LLMs for product managers playbook: extract the role’s competencies, generate a draft outline, compare multiple versions with light A/B testing, and refine tone and clarity. Tools should augment judgment; the final voice is mine.
If you’re applying now, assemble your core template, slot in two role-specific examples, and close with a confident ask for next steps. With the right structure, clear outcomes, and a little AI leverage, your product manager cover letter will stand out in any stack.
I’ve learned that in financial services, intuition isn’t enough—rigorous product benchmarks are what separate signal from noise. When my team and I evaluate portfolio performance, we anchor our decisions to the metrics that correlate with customer trust, compliant growth, and durable revenue.
Discover exclusive data and strategies from our Product Benchmark Report. Compare the financial services industry’s performance across key product metrics.
Here’s how I use a benchmark report in practice: I calibrate our baseline against peers, identify the few levers that disproportionately drive outcomes, translate those findings into outcomes vs output OKRs, and align stakeholders across product, risk, operations, and go-to-market. Benchmarks turn debate into data and surface the opportunity cost of not fixing broken journeys.
The product metrics I zero in on typically include user activation rate, time-to-first-value, onboarding completion, funnel conversion (for example, from signup to funded account or application to approval), cohort-based retention analysis (D7/D30/D90), depth of feature adoption, weekly-to-monthly active ratios, support contact rate, and cost-to-serve. In financial services, these signals tell a clear story about trust, reliability, and product-market fit.
To operationalize these insights, I combine Amplitude analytics with Pendo in-app guides to instrument end-to-end journeys, segment by customer profile, and run disciplined A/B testing with clear guardrails. This lets us move from anecdotes to statistically defensible changes and iterate confidently on onboarding, product tours, and moments that drive activation and engagement.
Because the trust and regulatory bar is higher in financial services, I also watch for friction in verification flows, error states that erode confidence, and any gaps between intent and completion. When benchmarks show we’re lagging, I pair discovery with rapid experiments to improve the experience while maintaining privacy-by-design and strong governance.
Use this benchmark report to pinpoint where you outperform and where you lag, prioritize roadmap bets, and focus your product-led growth motion. When teams rally around a shared set of product benchmarks, execution speeds up, trade-offs become clearer, and the value proposition sharpens for both customers and the business.
Inspired by this post on Amplitude – Perspectives.
I spend my days shaping core analytics product experiences that help teams see their business with greater clarity. When I design an analytics workflow, my goal is simple: make it effortless to ask better questions, uncover meaningful patterns, and turn insight into action. In this brief reflection, I’ll share how I approach product discovery, experimentation, and roadmapping to create analytics tools that truly move the needle.
Everything starts with outcomes. I anchor roadmaps to a clear north star and use outcomes vs output OKRs to align problem statements with measurable impact. That means instrumenting a precise event taxonomy and building guardrails for data quality so retention analysis and user activation metrics are trustworthy. When the foundation is sound, product-led growth becomes repeatable because we can connect feature usage to value creation without guesswork.
Experimentation is where conviction meets evidence. I rely on A/B testing with a disciplined view of minimum detectable effect (MDE) so we size experiments responsibly and ship with confidence. Self-serve analysis—and, when appropriate, tools like Amplitude analytics within a unified analytics platform—lets teams quickly validate hypotheses, monitor cohorts, and understand lift. The result is faster learning cycles without sacrificing statistical rigor.
On the delivery side, I practice continuous discovery and translate insights into product roadmapping and sprint planning that teams can execute. I work closely with design and engineering to reduce cognitive load in the UI, standardize tooltips and in-app guides, and ensure every chart, filter, and segment supports a clear decision. This collaboration empowers the team, shortens feedback loops, and keeps us oriented toward customer outcomes rather than feature checklists.
Great analytics products give people confidence. By aligning on outcomes, instrumenting clean data, testing with discipline, and shipping thoughtfully, I’ve seen teams unlock deeper understanding and sustained growth. If you care about building products that illuminate the path forward, start with the questions customers need to answer—and let your analytics experience make those answers obvious.
Inspired by this post on Amplitude – Best Practices.
Capacity planning has always been a high-stakes exercise in customer service, and when you miss, the signal shows up fast in backlogs and SLAs. I’ve lived that pressure across multiple cycles, and 2026 will reward teams that plan differently.
AI fundamentally changes capacity planning because it changes the work. It resolves the bulk of your volume, speeds up execution, and elevates the complexity and value of what humans handle. The consequence is simple: planning models must evolve.
This is the final installment in my 2026 customer service planning series, and I’m focusing on the tension every leader feels right now—be ambitious about automation, but avoid the trap of understaffing if your assumptions don’t hold.
My goal is to share how AI changes the logic of capacity planning, what I’ve learned implementing these practices with my team and with customers, and the common traps to avoid.
Traditional planning rests on relatively stable assumptions: volume grows predictably, work types stay consistent, handle times don’t swing dramatically, and productivity improves slowly with better tools and training. In an AI-first model, none of that is guaranteed, and the fundamentals flip.
The mix of work changes as AI absorbs a growing share of simpler conversations, leaving humans with deeper, more time-consuming issues that demand human-to-human connection. Demand can actually increase when you remove friction, so AI can both resolve more and attract more volume. Human time splits differently as teammates solve customer problems and also review AI behavior, give feedback, improve content, and support system-level work. Performance becomes dynamic, not fixed—automation rate isn’t a one-time number; it can rise with care and fall with neglect.
If you plan for 2026 using a pre-AI model—assuming similar productivity, similar work mix, and a linear relationship between volume and headcount—you will underestimate what it now takes to run a high-performing support organization.
There are many metrics you can track, but the one to put at the center is automation rate (AI Agent involvement rate × AI Agent resolution rate). This single construct tells me what share of total volume AI actually resolves, how much work remains for humans, how much additional demand humans can absorb, and how ambitious I can be with headcount.
Early in the journey, I prioritize raising involvement—getting the AI involved in more conversations. Once involvement is high, I shift to resolution on the hardest remaining work, where each additional 1% of automation can represent several people’s worth of capacity.
In my 2026 plans, automation rate sits alongside projected inbound volume, average “output” per person for the more complex work that remains, and occupancy—how much time is allocated to customer-facing interactions versus operational and strategic work. Together, those inputs give a realistic picture of how many people you need and where they should spend their time.
First, plan boldly on automation, but match it with investment. I do not cap automation assumptions at 40–50% “because AI is new.” Many teams are already modeling 60%, 70%, even 80%+ for 2026—when they invest in AI ownership and content. The investment is non-negotiable: named ownership for AI performance (AI ops, knowledge management, conversation design), clear automation targets by work type (e.g., informational vs. personalized vs. actions vs. deep troubleshooting), realistic expectations for what’s easy to automate and what’s not, and a concrete plan to raise automation over time in monthly or quarterly steps rather than a single jump.
To decide where to invest first, I dig into the data. I start with the biggest volume drivers, separate content-led issues from those dependent on data or complex procedures, assume higher resolution potential for content-led topics once the knowledge base is in shape, and set more modest initial resolution expectations for system-dependent flows. Then I stair-step improvements as the systems, data contracts, and workflows mature.
In short, bold automation goals only work when paired with the team structure, content, and systems required to reach them—and the discipline to iterate.
Second, expect human “output” per person to go down. That’s a mindset shift. Historically, we assumed individual productivity would stay flat or tick up as tools improved. In an AI-first model, humans handle fewer conversations but more complex, cross-functional issues—and create more value despite lower case counts.
I model a lower “cases closed per person” than prior-year baselines, explicitly assume the remaining work is more complex and time-consuming, and redefine productivity to include system-level work like AI Agent improvements, content updates, and policy or workflow change management. I also report “capacity created” from automation alongside human outputs, so leadership sees the full picture.
Third, rethink occupancy: more time off the queues, on higher-value work. Traditional occupancy splits time between inbox and training, meetings, and breaks. Now there’s an expanding “out-of-inbox” portfolio that directly affects AI performance and overall capacity: reviewing AI-handled conversations, improving AI Agent triaging and handovers, contributing to content and procedures, feeding insights to product and engineering, and supporting system changes that reduce future volume.
I set lower inbox occupancy targets than before and make the rationale explicit. People aren’t working less—they’re working differently. In planning, I assume more time spent on improvement and system work, make it visible (for example, X% in inbox and Y% on AI and system improvement), and treat this as critical, not a “nice to have.” If you don’t proactively allocate it, it won’t happen—and your automation and performance targets will suffer.
Fourth, work with the finance team early, and treat your plan as a set of assumptions. Capacity planning with AI is a set of bets across automation rate, human output, demand growth, occupancy, and where surplus capacity (if any) goes. I bring finance in early, show that the plan is dynamic and directly tied to AI performance, and label every lever as an assumption with ranges.
I commit to a quarterly review cadence with finance to compare assumptions versus reality and adjust headcount, targets, and investment as needed. The risks are real: if automation grows slower than expected and you stop backfilling too early, you’ll be understaffed for months. Hiring and onboarding take time, so course-correcting late creates strain. If you do produce surplus capacity, have a clear strategy to reallocate those teammates to higher-value work—improving systems, feeding insights back to product, supporting new channels, and driving proactive CX—rather than defaulting to reductions.
I also set explicit guardrails—if automation rate misses by five points for two consecutive months, we pause planned reductions and revisit hiring gates. If it over-performs, we shift people into backlog eradication, content upgrades, or proactive outreach, so we bank compounding value.
To set your team up for success in 2026, anchor your plan on automation rate, be honest that humans will handle fewer but harder conversations, and protect time for system improvements. Partner early and often with finance, avoid shrinking too fast, and design a plan for surplus capacity so you’re never caught flat-footed.
If AI is going to handle the majority of your customer conversations, your plan has to be designed to help it do that well and to keep your team set up for meaningful, sustainable work. A 2026 plan built on adaptable assumptions—not fixed predictions—will hold up as your work, your systems, and your customers’ expectations continue to change.
If you’d like future editions like this, subscribe and stay close—I’ll keep sharing what’s working, what isn’t, and how to tune your customer support AI strategy in real time.
I’ve been closing the year with a deliberate reflection ritual for more than a decade, and this season I found fresh energy for it after listening to an insightful conversation with Teresa Torres and Petra Wille on All Things Product. Their approaches mirror the evolution many product leaders experience: moving from rigid annual goal-setting to values-led themes, longer time horizons, and a healthier respect for spaciousness. In my own practice, that shift has created better focus, less pressure, and far more meaningful outcomes.
Prefer to listen? You can find this episode here: Spotify | Apple Podcasts. I took notes with my team in mind and translated the discussion into a simple, values-driven framework that any product organization can adopt.
Why does annual reflection matter for product people? Because our work lives at the intersection of ambiguity, trade-offs, and time. If we only measure ourselves by shipped output or quarterly OKRs, we overlook the compounding value of learning, relationships, and judgement. I treat this ritual as a strategic reset: a chance to surface patterns, adjust expectations, and recommit to outcomes over output.
My own reflection habit started scrappy—paper notebooks, messy timelines, and even artful visualizations inspired by Dear Data by Giorgia Lupi & Stefanie Posavec. Like Petra, I’ve found that tactile, analog artifacts unlock insights I miss in a spreadsheet. Over time, I’ve kept the spirit and simplified the mechanics: a “what went well” review, a short list of hard lessons, and a handful of decisions that paid off—or didn’t.
The biggest evolution for me has been moving from rigid annual goals to values and themes. I still run OKRs, but I use them to track progress, not identity. The lens of process vs. outcome goals—reinforced by ideas from Atomic Habits—helped me set fewer, better commitments. For example, instead of “launch X by Y,” I’ll emphasize the cadence of customer discovery, the health of the product trio, and the quality of decisions made along the way.
One exercise that changed my practice is the “100 wishes” list. It’s powerful—and surprisingly difficult. Pushing past 30 or 40 wishes forces me to name latent interests and long-range intentions I rarely say out loud. Combined with decade-level themes, the list helps me balance ambition with patience. I don’t try to do it all next year; I use it to spotlight direction, not deadlines.
I also review patterns across years: Where did over-scheduling create hidden costs? When did I protect focus time and what did that unlock? Paul Graham’s Maker’s Schedule, Manager’s Schedule remains a useful calibration tool here. And when I feel the pull toward constant throughput, I revisit Stefan Sagmeister’s The Power of Time Off (TED Talk) to remind myself why strategically creating space often yields the most valuable ideas.
Of course, not every year follows plan—and that’s normal. Reflection helps me spot unrealistic expectations early and let them go. When setbacks hit, I’ll rewatch Dealing with Setbacks and re-ground in continuous discovery. The question isn’t “Did we do everything?” but “Did we learn fast, protect customer value, and make trade-offs aligned with our values?” That’s how empowered product teams compound impact.
My sharing philosophy has become more nuanced over time. Some reflections are public to invite dialogue and accountability; others stay private so I can process honestly. I’ve found it helpful to publish what I’m saying no to, capture a theme for the year ahead, and keep the rest for myself and my team. This balance preserves motivation while still contributing to the broader product management leadership community.
If you’re designing your own ritual, consider this lightweight flow: review wins and tough calls, write your “100 wishes,” extract a few values-based themes, then translate those into process goals for Q1. Revisit monthly, not just annually. If you like structured prompts, Chris Guillebeau’s How to Conduct Your Own Annual Review from The Art of Nonconformity offers a practical template you can adapt to your context.
For deeper dives and complementary ideas, I bookmarked these as part of my year-end reset: What I’m Saying No to This Year—And Why, Ask Teresa: My Leaders Still Want Roadmaps with Timelines—What Should I Do?, Scaling Impact: A Look at the Year Ahead (2022), Let’s Connect in 2025: A Look at the Year Ahead, The Interview Coach, and Petra’s own year-ahead reflections (here and her 2026 version). I also recommend revisiting the prior conversation on leadership and change: Role of Leadership in Transformations.
I’d love to hear how you approach your end-of-year reflection. What questions bring you the most clarity? Which practices help you set an intentional, values-driven path for the next year? Share your process—I’m always looking to learn from other product creators and leaders.
I rely on disciplined product benchmarks to turn strategic intent into measurable progress. In B2B technology, benchmarks give me and my team the clarity to prioritize what truly matters, align executives around shared outcomes, and course-correct before small gaps become growth-stalling problems.
Discover exclusive data and strategies from our Product Benchmark Report. Compare the B2B technology industry’s performance across key product metrics.
When I assess product health across a portfolio, I start with a core set of product benchmarks: activation rate, onboarding completion, time-to-first-value, weekly and monthly active accounts, feature adoption, cohort-based retention, expansion and contraction revenue, and support deflection. Together, these metrics show where the product creates value, where users get stuck, and which levers most efficiently drive product-led growth.
Benchmarks are only powerful if they inspire action. I instrument reliable analytics (Amplitude analytics) to capture consistent event data, pair that with in-app guides and product tours (Pendo, Intercom) to test friction hypotheses, and run A/B testing to validate changes with statistical rigor. From there, I translate insights into outcomes-based OKRs, so roadmapping and sprint planning focus on the few bets most likely to move our key product metrics.
I’ve seen this approach pay off repeatedly. When peer benchmarks revealed our user activation lagged, we simplified onboarding, clarified value propositions with sharper UX writing, and launched targeted in-app nudges. We didn’t just ship features—we improved the experience against a clear yardstick, and the subsequent lift in activation and early retention validated the strategy.
Cross-functional alignment is critical. I partner with customer success to interpret retention analysis by segment, with marketing to ensure messaging supports time-to-value, and with engineering to keep quality and reliability high. While product metrics lead, I also keep an eye on complementary signals like incident management and DORA metrics to ensure we’re not trading speed for stability.
If you’re leading a product organization, use benchmarks to calibrate ambition and create focus. Start by identifying the one or two metrics most predictive of long-term retention, set peer-informed targets, and iterate with continuous discovery. The result is a product strategy that is evidence-based, resilient to opinion cycles, and capable of compounding gains over time.
Ultimately, benchmarks aren’t about vanity; they are about velocity. With a shared view of where you stand against the B2B technology industry, you can make sharper bets, accelerate product-market fit, and turn your roadmap into a reliable engine for growth and customer value.
Inspired by this post on Amplitude – Perspectives.