How to evaluate an outbound lead generation agency (before you sign anything)
Choosing an outbound lead generation agency? This guide covers what to ask, what metrics matter, and how to avoid wasting 3 months of budget on the wrong setup.
How to evaluate an outbound lead generation agency (before you sign anything)
Most outbound lead generation agencies will show you a deck with open rates and logos before they can tell you a single client's positive reply rate. That's a problem. This page explains what to ask instead, which numbers actually matter, and how to tell the difference between an agency that ships pipeline and one that ships reports.
Why the standard agency pitch is broken
Open rates have been noise since Apple's Mail Privacy Protection launched in 2021. Apple prefetches tracking pixels before the user reads the email, which inflates open-rate data to the point where it tells you almost nothing about actual engagement. Any agency leading their pitch with "we hit 58% open rates" is either unaware of this or hoping you are. Either way, not a great sign.
The metrics that actually predict pipeline: positive reply rate (the share of contacted accounts that reply with genuine buying interest) and bounce rate (a deliverability health signal you want below 2%). Across our retainer programs, a positive reply rate above 2.5% on a cold list is a strong campaign. Above 4% means the targeting and copy are both working. Below 1% means something is wrong and needs to be fixed before you spend another dollar on volume.
Click-through rate is similarly unreliable when an agency strips tracking links to protect deliverability, which serious operators do. So if an agency is reporting CTR as a primary metric, ask whether they're using tracked links in outbound. If yes, that's a deliverability risk. If no, the CTR data is meaningless.
What a real outbound lead generation agency actually does
The job is not "send emails." The job is: build a targeted list of accounts that match your ICP, write copy that earns a reply, operate infrastructure that lands in the primary inbox, and iterate on what's working until the positive reply rate is consistently above threshold.
That's four distinct competencies. Most agencies are strong in one or two. The mistake I see most often is founders hiring an agency that writes great copy but has no deliverability infrastructure, so the emails never arrive. Or the reverse: technically clean infrastructure pointed at a list that's too broad to convert.
At minimum, the operational stack needs: dedicated sending domains (warmed for at least three weeks before any campaign volume), verified contact data with a bounce rate target under 2%, sending limits of 30-50 emails per mailbox per day, and reply handling fast enough that you don't lose a warm lead over a weekend.
The 7 things to evaluate before hiring an outbound lead generation agency
Positive reply rate benchmarks from live clients. Ask for the actual rate, not a range. If they can't answer, that's an answer. For context, 2-4% positive reply rate on cold outbound to a well-targeted list is achievable. Below 1% usually means list quality or copy issues.
Bounce rate discipline. Ask what their target bounce rate is and how they verify contacts before sending. If they're not aiming below 2%, their domains will degrade faster than you want.
Domain and mailbox setup. How many sending domains do they run per client? How long is the warmup? Serious agencies run separate infrastructure per client so one program's deliverability problems don't bleed into another's.
List-building methodology. Do they build lists manually, scrape databases, or use a verified data vendor? Ask specifically whether they verify emails before sending. The answer tells you a lot about how they think about deliverability.
Copywriting process. Do they write one sequence and let it run, or do they test variants? Ask to see a before-and-after from a sequence they iterated on. The agencies that treat copy as a variable to optimize are the ones worth hiring.
Reporting cadence and metrics. Weekly or biweekly check-ins with positive reply rate, bounce rate, and meetings booked per 1,000 contacts. If the report they show you during the sales process is full of open rates but light on reply data, expect the same once you've signed.
Market-specific experience. This matters more than it looks. An agency that has only run US-to-US outbound may not understand the nuances of European companies breaking into the American market, including time-zone sequencing, GDPR-adjacent compliance considerations, and cultural copy register. Ask for a specific client type they've run in your market.
Common problems that kill outbound programs (and what to do about them)
Deliverability collapse is the most common, and it usually shows up around weeks 6-10. Domains that were clean at launch start routing to spam as sending volume increases without proper warmup maintenance. The fix is spam placement testing via inbox placement tools before scaling volume, not after you notice replies dropping off. We run placement tests as a standard part of setup, not as a diagnostic tool after something breaks.
List exhaustion is the second problem nobody talks about. A tight ICP in a small market can burn through viable contacts faster than the program generates pipeline. For a US-targeted outbound program we ran for a European print-on-demand marketplace, we modeled the addressable account universe before launching and found the viable list was roughly 3,200 accounts. At 200 sends per week with standard sequencing, that's a 16-week runway before you're recycling contacts. Plan for that upfront or you'll be rebuilding strategy mid-engagement.
The third is a misaligned ICP definition between client and agency. The agency sends to "VP of Operations at manufacturing companies with 50-500 employees" because that's what the brief said. The client actually closes deals with the VP of Procurement at companies already using ERP software. That mismatch lives in the reply rate data, but only if someone is looking closely enough to notice and surface it. Ask how the agency handles ICP refinement after the first 200 sends.
If you want a deeper look at how deliverability is managed at the infrastructure level, our piece on cold email deliverability agency work covers the technical side in more detail.
Agency model comparison: retainer vs. performance vs. in-house
Model | Typical cost | What you get | Main risk |
|---|---|---|---|
Retainer agency | $4,000-$8,000/month | Full-service: list, copy, infrastructure, reporting | 3-month lag before you know if they're any good |
Performance agency | $500-$1,500/meeting booked | Aligned incentives on volume | May prioritize quantity over lead quality |
In-house SDR | $6,000-$9,000/month all-in | Full control, institutional knowledge builds | 6-9 month ramp, single point of failure |
Fractional/specialist | $2,000-$5,000/month | Strategy plus oversight, you handle execution | Only works if you have internal bandwidth to execute |
The retainer model makes sense if you want a program running without internal lift. The tradeoff is paying for 60-90 days before the data tells you whether anything is working. Performance models look attractive but often produce meetings that don't convert, because the incentive is to book the call, not qualify the prospect. In-house is the right long-term answer for most companies, but the ramp cost is real and the first hire is the hardest.
If the outbound lead generation agency route makes sense for where you are right now, book a discovery call and we'll tell you within 30 minutes whether we're the right fit.
When a cold email agency specifically makes sense
Cold email as the primary outbound channel works well when your ICP is reachable via verified email data, your deal size is large enough to justify 60-90 days of pipeline lag (roughly $5,000+ ACV is where the math starts working), and you don't need phone-based SDR support. For more on how the cold email side of this works, our cold email lead generation page goes deep on the mechanics.
Cold email is a worse fit when your ICP is hard to reach via email (consumer or SMB lists with high turnover), when the buying cycle requires relationship development that email alone can't carry, or when your market is small enough that burning contacts matters a lot. In that case, you probably want an outbound model that includes LinkedIn or phone touches alongside email.
European companies breaking into the US market tend to get more value from a cold email outbound agency than from trying to build US-based SDR capacity from scratch. The cost difference is significant, the time-to-first-meeting is faster, and hiring US sales reps remotely before you've proven the market is genuinely hard. We work with several European founders in exactly this situation, and the consistent pattern is that outbound email gets them to their first 10-20 US customer conversations 4-6 months faster than the SDR route would have.
For European companies considering this path, the European cold email agency page covers the compliance and market-entry nuances that US-focused agencies tend to miss.
What good looks like at 90 days
By the end of month three of a well-run outbound program, you should have: a positive reply rate above 2% on your primary sequence, a bounce rate below 2%, at least one sequence variant tested against the original, and enough meetings booked to draw conclusions about which ICP segments are converting. If the agency can't show you those four data points clearly at the 90-day mark, the engagement isn't working and you should say so directly.
For a US promotional products brand we work with, the 90-day mark looked like this: 3.1% positive reply rate on a sequence targeting procurement managers at mid-market companies, bounce rate at 1.4%, and 38 meetings booked from 1,100 contacts. That's what a tight program produces. Not magic, just consistent execution against a well-defined ICP with copy that iterates fast.
Five questions to ask any outbound lead generation agency on the first call
What's the positive reply rate on your best-performing active client, and what's the rate on your worst?
How do you handle domain burnout, and how quickly can you spin up new infrastructure?
What does your list-building process look like, and what's your target bounce rate?
How many sequence variants do you typically test in the first 60 days?
What would make you tell us this channel isn't working?
The last question is the most revealing. An agency with real conviction about what works will have a clear answer. One that hedges is optimizing for retaining the contract, not producing results.
We run programs for European companies entering the US market and for ecommerce brands using cold email to convert B2B buyers. If either of those fits where you are, book a discovery call. Bring your current ICP definition and your target account universe, and we'll give you a straight read on whether outbound is the right move and what a realistic 90-day number looks like.
