Think Tank
RSS

Building a Carrier Scorecard That Actually Changes Behavior

HEAVY DUTY TRUCKS DRIVE IN A LINE DOWN A HIGHWAY — Tractor trailers outside of the Port of Philadelphia. Photographer: Hannah Beier/Bloomberg

June 11, 2026

Most shippers have a carrier scorecard. Far fewer have one that does anything. The document gets built, distributed once or twice a quarter, and then quietly ignored by the carriers it was supposed to influence. Performance doesn’t improve; conversations stay the same, and the scorecard becomes a reporting artifact rather than a management tool.

Most transportation teams have access to plenty of data: on-time pickup, on-time delivery, tender acceptance, billing accuracy, claims ratios, dwell time. The problem is how that data gets packaged, delivered, and tied to consequences. For shippers without dedicated transportation analytics resources, outsourced freight program management can fill that gap. A scorecard only changes behavior when carriers understand what’s being measured, agree the measurement is fair, see the numbers regularly, and know what happens when those numbers move in the wrong direction.

Below is a practical framework for building one that holds up.

Start with the metrics that map to business outcomes. A common mistake is measuring everything that can be measured. Carriers receiving a scorecard with eighteen key performance indicators will optimize for none of them. Pick the four to six metrics that genuinely affect your operation and your customer. For most shippers, the short list looks something like this:

On-time pickup and on-time delivery. Define "on-time" precisely. Is it within the appointment window, or to the minute? Does a fifteen-minute grace period apply? Carriers can’t hit a target they have to guess at.
Tender acceptance rate. This is particularly important for contracted lanes. A carrier with a 95% percent acceptance rate on paper but 70% in practice is creating routing guide depth costs that you are absorbing. The routing guide math matters. According to FreightWaves’ SONAR data, tender rejection rates surged to 8.36% in December, 2024, the highest level since June 2022, and 2025 started with rejections running above the prior two years’ levels. A carrier that accepts 95% of tenders in a soft market and 70% when capacity tightens isn’t a 95% carrier. The scorecard should reflect the full picture.
Billing accuracy. Invoices that match the rate confirmation the first time, without disputes or rebills. This is often overlooked and quietly expensive. To put a number on it: industry analysts cited by SupplyChainBrain estimate that between 15% and 66% of carrier invoices contain some kind of error, with one freight auditor reporting that one in four invoices it reviews gets rejected. Most of those rejections involve incorrect accessorial charges. This is a scorecard problem: If billing accuracy isn’t tracked and tied to consequences, there’s no signal for carriers to fix it.
Claims frequency and resolution time. Both matter. A carrier with low claims that drags resolution out for 90 days isn’t a low-claims carrier in any meaningful sense.
Communication responsiveness. Harder to quantify, but worth attempting. Ir includes average time to respond to status requests, tracking updates and exception notifications.

If a metric isn’t directly tied to cost, customer experience or operational risk, it probably doesn’t belong on the scorecard.

Weight the categories so the math reflects priorities. Not every metric carries equal weight. A shipper with strict retail compliance windows will weight on-time delivery far more heavily than billing accuracy. A shipper with thin margins and high invoice volume may flip that balance. Pick weights that reflect what actually hurts when it goes wrong, and document them in the scorecard itself so carriers can see the logic.

A simple weighted scoring model: Each metric gets a weight totaling 100; each carrier earns a percentage score on each metric, and the weighted average produces an overall grade. Carriers can then see exactly where they are losing points and where improvement would move the needle.

Benchmark against peers, not just absolutes. A carrier scoring 92% on-time delivery looks fine in isolation. But if the rest of your carrier base is averaging 96%, that 92% is a problem. Including peer benchmarks on the scorecard, even anonymized, gives carriers context. It also gives you cover during the conversation. You’re not telling them they’re bad; you’re telling them they’re below the group, which is achieving these numbers under the same conditions.

Deliver the scorecard in a structured cadence. Quarterly business reviews (QBRs) with detailed scorecards work. Annual reviews don’t. By the time 12 months of data is presented, the carrier has no memory of the lanes, loads and exceptions that produced it, and neither do you.

A workable rhythm:

Monthly: automated scorecard distribution, no meeting required
Quarterly: formal QBR with top carriers, performance discussion, action items
Annually: contract review, pricing discussion, lane reallocation

Carriers begin to anticipate the cadence and adjust accordingly. Those who don’t adjust become candidates for replacement, and the scorecard provides the documentation to support that decision.

Tie scores to consequences and rewards. This is where most scorecard programs fall apart. If a carrier scoring 75% gets the same volume next quarter as a carrier scoring 95%, the scorecard is theater. Behavior only changes when scores produce outcomes.

Common mechanisms include:

Tier-based volume allocation, where higher-performing carriers receive first-tender priority on premium lanes;
Performance-based rate adjustments at contract renewal;
Probationary status for carriers below a threshold, with a defined improvement window, and
Removal from the routing guide for sustained underperformance

The mechanism matters less than the consistency. Carriers need to believe the scores translate into real decisions. Once they do, the scorecard becomes self-enforcing.

The incentive to act on scorecard data is only increasing. According to FreightWaves, the truckload market reached near-equilibrium by mid-2025 after years of carrier attrition, with tender rejection rates in major Southeast markets crossing 10% for the first time in nearly three years. Shippers who waited to build their routing guides around performance data found themselves with fewer options and less leverage when capacity tightened. The scorecard is the tool that tells you which carriers are worth holding before the market shifts.

Build the operational capability to act on what you measure. A scorecard creates obligations on your side too. If you tell a carrier their tender acceptance is 78%, you need clean data to back that up, a process for reviewing exceptions they flag, and the bandwidth to follow up on the action items that come out of the QBR. Many shippers build elaborate scorecards and then quietly stop using them because the operational lift was underestimated.

This is one of the areas where outsourced freight program management services can take pressure off internal teams. Outsourcing the data aggregation, scorecard production, carrier QBR coordination and routing guide enforcement to a partner who already has the systems in place lets the shipper focus on the strategic conversations rather than the spreadsheet maintenance. For organizations without dedicated transportation analytics resources, this division of labor is often what makes a scorecard program sustainable past the first six months.

Audit the scorecard itself, annually. Metrics drift. A KPI that mattered three years ago may no longer reflect current operations. A peer benchmark calibrated against a different carrier mix can quietly mislead. Once a year, sit down and ask whether the scorecard still measures the things that matter, whether the weights still reflect priorities, and whether the consequences are still being applied. A scorecard that goes unaudited becomes a scorecard that goes unread.

The carriers you most want to retain are the ones who appreciate a rigorous scorecard. They already know they’re performing well, and they want a buyer that can recognize the difference between them and a marginal competitor. The carriers that push back hardest on measurement are usually the ones the data was going to expose anyway.

Build the scorecard for the carriers you want; hold everyone to it consistently, and the routing guide largely sorts itself out.

Mike Broeckling is vice president of operations at Alpha Zero Logistics.

LTL/Truckload Services Supply Chain Finance & Revenue Management Supply Chain Visibility Transportation Management Quality & Metrics Regulation & Compliance