Skip to content

AI PROFICIENCY ASSESSMENT

Hire for how people actually use AI.

Pick a role, pick a level, see the Leverage Profile. Caliber scores how a candidate redesigns the job around AI, not whether they can name the tools.

Built by Inflection Group, the operator-led MSP and advisory firm. Pilot cohorts open monthly by role.

Everyone says they want AI-first people. Almost no one can test for it.

A resume keyword tells you nothing about how someone actually works. A coding test misses every non-technical role. Self-reported quizzes reward confidence, not competence.

SAMPLE ASSESSMENT

Pick a role. See how the same job gets done at four levels of AI leverage.

One scenario per role. Four candidates at L1 Manual through L4 Architect. The Leverage Profile underneath is what Caliber would score for top-tier work at that role.

BRIEF · SALES · WEEKLY-PIPELINE-REPORT

Give me a weekly pipeline report every Friday.

You are a mid-level AE on a 12-person sales team. Every Friday by 4pm your VP wants a one-page pipeline report covering stage movement, the five biggest deals, and anything stalled 14+ days. The source is a 4,000-row Salesforce export. The audience is the head of sales and the CRO. Produce the report. Tell us how you did it.

L1 · MANUALDoes it by hand. AI is absent or cosmetic.

BUILD LOG

I open the Salesforce export in Excel, sort by stage and amount, and pull the top deals into a Google Doc. I write the narrative myself. I do not use AI on any of it. I prefer to see every number with my own eyes before it goes to the CRO. About two and a half hours every Friday.

OUTPUT

WEEKLY PIPELINE / WEEK OF JUNE 3
Top 5 deals by amount:
  Acme Corp     $480K    Negotiation
  Globex        $310K    Proposal
  Initech       $290K    Discovery
  Hooli         $245K    Closed Won
  Umbrella      $210K    Proposal
Stage totals: Disc 14, Prop 9, Neg 4, CW 2, CL 3.
Notes: Acme legal still reviewing MSA. Hooli closed Tuesday.
Stalled: 3 deals untouched 14+ days. Will follow up Monday.
Submitted to head of sales 3:48pm.
L2 · ASSISTEDUses AI as a faster search box. One-off prompts.

BUILD LOG

I export the CSV, paste it into ChatGPT, and ask it to summarize the pipeline by stage and surface anything stalled. I copy the answer into a doc and add the headline coverage number I grabbed from a Salesforce report. I did not double-check the totals against the source. About 35 minutes start to finish.

OUTPUT

WEEKLY PIPELINE / WEEK OF JUNE 3
Pipeline coverage: 3.4x quota (per ChatGPT summary of export).
Top movement: Acme Corp moved Proposal -> Negotiation.
Stalled deals (no activity 10+ days): 7 listed below.
Narrative: "Pipeline is healthy heading into Q3 close week" (ChatGPT).
Watch item flagged by ChatGPT: "Acme legal exposure may slip the quarter."
[Note from manager: Which 7 stalled deals? Send list.]
L3 · AUGMENTEDReusable prompt, verified output, humans on edges.

BUILD LOG

I wrote a reusable prompt that takes the Salesforce export, returns the report in our fixed format, and flags anomalies (no activity 14+ days, amounts that swung by more than 30%, stage regressions). I spot-check the top five numbers against the Salesforce dashboard every Friday before I post. Anything flagged anomalous routes to me for a one-line human judgment before it goes out. About 20 minutes including the verification step.

OUTPUT

WEEKLY PIPELINE / WEEK OF JUNE 3
Coverage: 3.4x (verified vs Salesforce dashboard, 3:31pm).
Stage movement: 4 deals advanced, 1 regressed.
Anomalies flagged for human review (3):
  Globex amount changed $260K -> $310K (rep override, accepted).
  Pied Piper $180K stage regressed Discovery -> Disqualified (real signal, followed up with rep).
  Vandelay $415K new in Discovery, 24 hours old, large for stage (asked rep to confirm sizing).
Stalled (14+ days, no activity): 3.  Names + last-touch dates attached.
Action items routed to reps in Slack with deal context. Posted 3:42pm.
L4 · ARCHITECTRedesigned the workflow around AI. Reusable systems.

BUILD LOG

I connected Claude to Salesforce and Slack. A skill I built pulls the week's pipeline every Friday at 2pm, generates the report in our format, posts a draft to a private Slack channel, and waits for me to verify the headline metrics. After my green light it auto-posts to the leadership channel and DMs each AE their stalled-deal list with proposed next steps. I review the prompt monthly and verify the top-line numbers every week. Edge cases (regressions, large amount changes, brand-new deals over $250K) escalate to me as a Slack DM. I spend about 10 minutes on this end-to-end.

OUTPUT

WEEKLY PIPELINE / WEEK OF JUNE 3   [auto-generated 14:02 PT, verified 14:11 PT]
Coverage 3.4x   Stage movement +4 / -1   New >$250K this week: 2
Top 5: Acme $480K (Neg), Globex $310K (Prop), Initech $290K (Disc), Hooli $245K (Won), Umbrella $210K (Prop)
Escalations to head of sales (3):
  1. Pied Piper $180K stage regressed Discovery -> Disqualified
  2. Vandelay $415K new in Discovery (24h old, large for stage)
  3. Acme MSA in legal 18 days (above 14d SLA)
Rep DMs sent: 7 AEs, total 14 stalled deals with proposed next steps.
Time spent by AE this week: 11 min.   Time spent prior method (Q1 baseline): 142 min.

LEVERAGE PROFILE · SALES · L4 BENCHMARK

What “great” looks like for this role.

Tool Fluency
4/4
Workflow Architecture
4/4
Judgment & Verification
4/4
Leverage Ceiling
4/4
Responsible Use
3/4

WHAT WE MEASURE

Five dimensions. One picture of how this person works.

  • 01

    Tool Fluency

    Can they drive the tools: prompting, context, connectors, automation.

  • 02

    Workflow Architecture

    Do they redesign the job around AI, or bolt it on as a faster search box.

  • 03

    Judgment & Verification

    Do they know when AI is wrong, how they check it, where humans stay in the loop.

  • 04

    Leverage Ceiling

    How far they push: one-off prompts versus reusable systems and agents.

  • 05

    Responsible Use

    How they handle data, confidentiality, and disclosure.

WHO IT IS FOR

Built for the people doing the hiring, not the testing industry.

  • Hiring teams

    Building AI-first benches who need a real signal, not a quiz.

  • Operating partners

    Assessing the talent inside a portfolio company.

  • Founders

    Who need every early hire to multiply themselves with AI.

Stop guessing who is actually AI-first.

Pilot cohorts open monthly by role. Sales, Operations, and Marketing first. We email you when your role unlocks, send a five-question intake, and book a 20-minute walkthrough.

We will not share your email. One message when your role opens. Unsubscribe in one click.

HIRING NOW · MANUAL SERVICE

Need a Leverage Profile this week, not next quarter?

Inflection Group runs Caliber as a manual service today. We author the scenario for your role, score the candidates you send, and deliver each Leverage Profile inside five business days. Three pilot orgs per quarter run free while we onboard the self-serve product.

Email Shane to start a pilot