How do I know when to make a hire vs. wait?

Bob Baxley's 'Hire People' essay names it explicitly: 'no hire is better than the wrong hire.' If after a full structured loop the calibration meeting cannot land on a confident hire decision, don't hire. Six months of an open req is cheaper than 12 months of a wrong hire's underperformance plus the team's morale damage. The canonical mistake is hiring desperately to fill the req.

How do I handle a portfolio that looks great but the candidate can't articulate decisions?

Strong no-hire signal at most levels. Visual portfolio without design-thinking depth means the candidate has executed against direction (likely from a strong design lead or design manager) but hasn't owned design decisions. They will struggle in scope-ambiguity. Probe specifically: 'tell me about a decision on this project that you made independently of your manager's direction.' If the answer is vague, the signal is confirmed.

What's the difference between portfolio screen and onsite craft round?

Portfolio screen tests whether the candidate has design-thinking depth across their past work. Craft-and-taste round tests how the candidate thinks about hypothetical problems in real-time. Both are needed. The portfolio screen alone misses the candidate who has strong past work but freezes in unfamiliar problems; the craft round alone misses the candidate who is articulate in interviews but hasn't actually done the work.

How important is leveling correctness?

Critical. The wrong-level hire (too senior or too junior) causes performance management problems within 3–6 months. The too-senior hire feels in over their head; the too-junior hire feels unrecognized. Both lead to attrition. Calibration is the single most leveraged hiring activity for leveling correctness; skipping it is the canonical leveling-drift cause.

How do I source senior-design-manager candidates externally?

Senior-network channels. Mia Blume's Design Dept network, executive search firms (Heidrick & Struggles, Spencer Stuart, Egon Zehnder design-leadership practices), and direct senior-design-leadership outreach are the dominant channels. Posting and waiting doesn't work at senior-design-manager+ tier. Director-of-design+ tier is almost exclusively executive-search-driven.

How do I run a debrief packet that produces clear hire/no-hire signal?

Structured template with: (1) specific signal observed (with examples), (2) specific concerns observed (with examples), (3) leveling recommendation (with reasoning), (4) hire/no-hire vote. The template forces specificity. Vibe-debriefs ('I liked them' / 'something felt off') are noise; structured debriefs are signal. Julie Zhuo's chapter 7 covers the debrief-packet mechanics.

How do I handle a hiring loop where 4 of 6 interviewers are hire and 2 are no-hire?

The strong default is no-hire. The 2 no-hire votes have specific concerns; the 4 hire votes typically have less specific signal. The calibration meeting works through the no-hire concerns explicitly: are they about the candidate or about the interviewer's own bias? Are they consistent across the 2 interviewers or different? Are they level-relevant or level-blocking? Bob Baxley's writing is explicit: 'one strong no-hire is more meaningful than three weak hires.'

Design Manager Hub

Hiring and Leveling Designers: The Most Leveraged Manager Activity (2026)

By Blake Crosley · Last verified 2026-04-30

In short

Hiring is the single most leveraged design-manager activity per Bob Baxley's 'Hire People' essay, Julie Zhuo's chapter 7 ('Hiring Well') in The Making of a Manager, and the publicly-discussed hiring practices at FAANG and design-strong consumer companies. The mechanics: structured portfolio screen, structured onsite (4–6 rounds), structured debrief packets, structured calibration. The leveling rubric: most companies have explicit leveling docs (Hello Interview's design-leveling rubric is the public reference) that map scope to title. The dominant failure modes: hiring desperately to fill an open req, hiring against vibe instead of structured signal, mis-leveling a hire to the open req level, skipping the calibration meeting.

Key takeaways

Hiring is the single most leveraged design-manager activity. Bob Baxley's 'Hire People' essay frames it explicitly: 'the manager's job is to hire well, then get out of the way; everything else is downstream of who is on the team.' Strong design managers spend 10–15% of their calendar on hiring during active req cycles.
The structured hiring loop reduces noise. Portfolio screen, hiring-manager screen, structured 4–6 round onsite, debrief packets in writing, structured calibration. Hello Interview's design-leveling guides and Julie Zhuo's chapter 7 in The Making of a Manager are the canonical references.
The portfolio screen is the load-bearing first step. 60–90 min, structured. The candidate presents 3–5 case studies. The hiring manager and 1–2 senior designers attend. The signal: can the candidate articulate design intent, design decisions, and design trade-offs in real-time? The failure mode is the gallery-walk portfolio review where the candidate narrates pretty pictures without articulating decisions.
Leveling rubrics map scope to title. Hello Interview's design-leveling guide (hellointerview.com/blog) is the public reference. Most companies have internal rubrics that include: scope of impact, complexity of problems, mentorship and team-leadership signal, and cross-functional partnership signal. Always ask the recruiter for the leveling rubric in interview.
Calibration prevents leveling drift. Most large tech companies run calibration meetings where hiring managers compare candidate files against an existing employee at the same level. The calibration meeting prevents 'this candidate is great so let's hire at L5' drift; it forces 'this candidate is at L5 because they perform comparably to existing L5 employees.' Skipping calibration is the canonical leveling-drift cause.
The dominant hiring failure mode is hiring desperately to fill an open req. The team has a vacancy. The manager hires the first acceptable candidate. Six months later the candidate is the team's biggest performance management problem. Bob Baxley's 'Hire People' essay names this explicitly: 'no hire is better than the wrong hire.'
External hiring at the senior-design-manager+ tier requires senior-network channels. Mia Blume's Design Dept network, executive search firms (Heidrick & Struggles, Spencer Stuart, Egon Zehnder), and direct senior-design-leadership outreach are the dominant channels. Posting and waiting doesn't work at director-of-design+ tier.

The structured design hiring loop

The structured design hiring loop drawn from Julie Zhuo's chapter 7 ('Hiring Well') in The Making of a Manager, Bob Baxley's 'Hire People' essay, the publicly-discussed hiring practices at FAANG and design-strong consumer companies, and Hello Interview's design-interview guides:

Recruiter screen (30 min). Logistics, role context, leveling calibration. The recruiter screens for: experience-relevant-to-the-role (years, scope), interest-in-the-company, and basic compensation alignment.
Hiring-manager screen (60 min). Behavioral plus initial portfolio walkthrough. The hiring design manager probes: past leadership decisions (for design-manager candidates) or past design decisions (for IC candidates), design point-of-view, and cross-functional working style.
Portfolio screen (60–90 min). The candidate presents 3–5 case studies. The hiring manager and 1–2 senior designers attend. The signal: can the candidate articulate design intent, design decisions, and design trade-offs in real-time?
Onsite (4–6 rounds, 60 min each):
- Craft and taste interviews (2–3 rounds): senior designers walk the candidate through hypothetical design problems.
- Behavioral / leadership panel (60 min): structured around past leadership decisions, hiring decisions, and difficult performance situations (for design-manager candidates).
- Cross-functional partnership round (60 min): typically with a senior PM and a senior engineering manager.
- Design-strategy round (60 min, for senior-design-manager+ candidates): a multi-quarter design-strategy problem.
Debrief (1 day after onsite). Each interviewer submits a written debrief packet within 24 hours. The packet includes specific signal: what did the candidate do well, what concerns emerged, what level does the interviewer recommend.
Hiring committee / calibration (within 1 week). The hiring manager runs a structured calibration meeting. The candidate's debrief packets are reviewed against an existing employee at the recommended level. Hire/no-hire decision is made; level is finalized.
Offer and negotiation. The recruiter and hiring manager handle the offer. Compensation negotiation follows the company's standard practice; leveling negotiation requires going back to calibration.

The portfolio screen: load-bearing first step

The portfolio screen is the load-bearing first step in design hiring. The mechanics:

Format. 60–90 min, structured. The candidate presents 3–5 case studies. The hiring manager and 1–2 senior designers attend.
Signal. Can the candidate articulate design intent, design decisions, and design trade-offs in real-time? Specifically: does the candidate frame the problem first (user, constraints, business context), then walk through decisions (what was tried, what was rejected, what shipped), then articulate trade-offs (what was compromised, what would they do differently)?
Failure mode: gallery-walk. The candidate narrates pretty pictures without articulating decisions. 'Here's the homepage, we made it dark, then we made it light.' No problem-framing, no decision-articulation, no trade-off-acknowledgment. The candidate has visual portfolio but not design-thinking depth. Bob Baxley's hiring writing names this pattern explicitly.
Failure mode: process-tour. The candidate spends 30 min walking through their company's design-process framework (research-then-ideation-then-prototyping-then-delivery) without showing specific work. Process literacy without specific decisions means the candidate has not actually owned design decisions.
Strong-signal patterns. The candidate names a specific user constraint ('the user has 30 seconds in the elevator before their meeting'), shows two or three tried-and-rejected directions, articulates the trade-off they made and would not make again, and acknowledges what's still unfinished. Strong candidates show vulnerability in their work — not defensiveness.
Questions to ask. 'What was the hardest decision you made on this project?' 'What would you change if you could re-do it?' 'What did you learn that you've applied since?' These questions probe design-thinking depth past the artifact.

Leveling rubrics: mapping scope to title

Leveling rubrics map scope to title. The mechanics drawn from Hello Interview's design-leveling guides, Julie Zhuo's chapter 7, and the publicly-discussed leveling rubrics at FAANG:

The dimensions. Most company rubrics evaluate: (1) scope of impact (single feature → multi-team product → company-wide design language); (2) complexity of problems (well-defined → ambiguous → novel); (3) mentorship and team-leadership signal (none → informal → formal); (4) cross-functional partnership signal (executes against PM direction → influences PM direction → drives multi-team strategy).
The mapping. Junior / IC1: single-feature scope, well-defined problems, no mentorship, executes against PM direction. Mid / IC2: feature-area scope, mostly-defined problems, informal junior mentorship, partners with PM peer. Senior / IC3: cross-feature scope, ambiguous problems, formal mentorship of 1–2 designers, influences PM direction. Staff / IC4: multi-team scope, novel problems, leads design-system or design-language work, drives cross-functional strategy. Principal / IC5: company-wide scope, novel problems, drives the design language and the design-org strategic direction.
The manager track. Line-design-manager (M1): manages 3–8 designers, owns team-level scope. Senior-design-manager (M2): manages 2–4 line-managers, owns multi-team scope. Group-design-manager (M3): manages 2–4 senior-managers, owns multi-product-surface scope. Director (D1): manages multiple GDMs and senior-managers, owns design-org strategic direction. VP (D2): owns the entire design org, sits at C-suite tier.
Hello Interview as public reference. Hello Interview's design-leveling guide (hellointerview.com/blog) is the most-cited public reference on FAANG-tier design-leveling. The guide includes specific scope examples and behavior-rubrics at each level. Working through the Hello Interview guide is essential prep for any FAANG-tier design hiring or interview process.
Asking for the rubric. Always ask the recruiter for the leveling rubric in interview. The strong-signal company shares it; the weak-signal company says 'we evaluate holistically' or 'we don't share rubrics.' Companies that don't share rubrics typically have leveling drift and harder calibration.

Calibration: preventing leveling drift

Calibration prevents leveling drift. The mechanics:

The structured calibration meeting. Most large tech companies run calibration meetings where hiring managers compare candidate files against an existing employee at the same level. The calibration meeting prevents 'this candidate is great so let's hire at L5' drift; it forces 'this candidate is at L5 because they perform comparably to existing L5 employees.'
The cross-team calibration. At senior-design-manager+ tier, calibration is cross-team. Multiple hiring managers review the candidate file together; the candidate is leveled relative to existing employees across multiple teams. The cross-team check prevents single-team drift.
The calibration anchors. The strong-signal company has 1–2 named existing employees per level as 'calibration anchors.' The hiring manager makes the case: 'this candidate performs comparably to [Anchor X] who is a confirmed L5.' The named anchor forces specificity rather than vibe-leveling.
Skipping calibration. The canonical leveling-drift cause is skipping calibration in service of urgency. The team has an open req, the candidate is acceptable, the hiring manager bypasses calibration to make the offer fast. Six months later the candidate is at the wrong level relative to the team and either un-derperforms (hired too senior) or feels frustrated (hired too junior).
Calibration as a development opportunity. Strong design managers use calibration as a development opportunity for their team's senior ICs. Including a senior IC in the calibration meeting (as observer or contributor) builds their judgment about leveling, which is preparation for their own management transition.

Frequently asked questions

How do I know when to make a hire vs. wait?: Bob Baxley's 'Hire People' essay names it explicitly: 'no hire is better than the wrong hire.' If after a full structured loop the calibration meeting cannot land on a confident hire decision, don't hire. Six months of an open req is cheaper than 12 months of a wrong hire's underperformance plus the team's morale damage. The canonical mistake is hiring desperately to fill the req.
How do I handle a portfolio that looks great but the candidate can't articulate decisions?: Strong no-hire signal at most levels. Visual portfolio without design-thinking depth means the candidate has executed against direction (likely from a strong design lead or design manager) but hasn't owned design decisions. They will struggle in scope-ambiguity. Probe specifically: 'tell me about a decision on this project that you made independently of your manager's direction.' If the answer is vague, the signal is confirmed.
What's the difference between portfolio screen and onsite craft round?: Portfolio screen tests whether the candidate has design-thinking depth across their past work. Craft-and-taste round tests how the candidate thinks about hypothetical problems in real-time. Both are needed. The portfolio screen alone misses the candidate who has strong past work but freezes in unfamiliar problems; the craft round alone misses the candidate who is articulate in interviews but hasn't actually done the work.
How important is leveling correctness?: Critical. The wrong-level hire (too senior or too junior) causes performance management problems within 3–6 months. The too-senior hire feels in over their head; the too-junior hire feels unrecognized. Both lead to attrition. Calibration is the single most leveraged hiring activity for leveling correctness; skipping it is the canonical leveling-drift cause.
How do I source senior-design-manager candidates externally?: Senior-network channels. Mia Blume's Design Dept network, executive search firms (Heidrick & Struggles, Spencer Stuart, Egon Zehnder design-leadership practices), and direct senior-design-leadership outreach are the dominant channels. Posting and waiting doesn't work at senior-design-manager+ tier. Director-of-design+ tier is almost exclusively executive-search-driven.
How do I run a debrief packet that produces clear hire/no-hire signal?: Structured template with: (1) specific signal observed (with examples), (2) specific concerns observed (with examples), (3) leveling recommendation (with reasoning), (4) hire/no-hire vote. The template forces specificity. Vibe-debriefs ('I liked them' / 'something felt off') are noise; structured debriefs are signal. Julie Zhuo's chapter 7 covers the debrief-packet mechanics.
How do I handle a hiring loop where 4 of 6 interviewers are hire and 2 are no-hire?: The strong default is no-hire. The 2 no-hire votes have specific concerns; the 4 hire votes typically have less specific signal. The calibration meeting works through the no-hire concerns explicitly: are they about the candidate or about the interviewer's own bias? Are they consistent across the 2 interviewers or different? Are they level-relevant or level-blocking? Bob Baxley's writing is explicit: 'one strong no-hire is more meaningful than three weak hires.'

Sources

About the author. Blake Crosley founded ResumeGeni and writes about design management, hiring technology, and ATS optimization. More writing at blakecrosley.com.