Senior Technical Program Manager, AI Platforms · Product.ai
The Commerce Truth Scientist. Bets on the human against the hype machine — building the systems that hand shoppers verified truth instead of a promise broken in the fine print. Checkout codes are where it starts; the method generalizes anywhere people are sold hype instead of facts.
Turns raw checkout signal into a measurable win for the shopper — proven on live traffic, not asserted. An ad-tech systems engineer turned commerce-intelligence architect who documents the spine, ships the code, and drives the number until it’s the right number. The north star is a metric that can’t be gamed without betraying the shopper. Because if the code actually works, both sides win.
The engineer who decides what 1.25 million checkout codes do every day — and refuses to let a single one lie to a shopper.
Specs the spine before the code — the schemas, contracts, and data lineage a verification platform stands on. The architecture is documented before the first line is written.
Trained the models that predict whether a checkout code will work — beating the prior approach on both accuracy and calibration, then locking a production-grade candidate.
Took the scoring pipeline live and kept it live through a run of failure modes in a single day. The fix was the infrastructure that prevents the whole class of failure — not four patches.
Designs the live A/B experiments that prove verified intelligence moves shopper outcomes — measured on real users, over real windows. Validation, not vibes.
The empirical instinct, forged across a career in ad-tech: point-in-time correctness, reproducibility, and a refusal to operate against a number he hasn’t verified.
Mapped ~35 production systems into one dependency picture to plan a platform cutover — every hazard ranked, every surface accounted for before anything moves.
Three generations of Graham’s family have been in medicine. What he inherited was not the profession — it was a conviction: that data and design exist to produce clarity for people in distress.
A shopper at checkout is a small version of that. They want the code to work. They want the price to be fair. They want the transaction clean. Validated intelligence means they get it.
That is why the success rate is the right metric to be measured against — it cannot be gamed without betraying the shopper. If the code actually works, both sides win. The covenant runs all the way down to the number Graham will let himself be judged by.
A code is deployed to win the click, then broken in the fine print. I build the forensics engine that drags that fine print into the light — so the shopper gets the truth, not the hype. Graham, on the work
Architecture documented before code — the schemas, the contracts, the spine. Once the spine is clear, move fast and iterate.
“I document architecture before I touch code. Once the spine is clear, I move fast and iterate.”
Explore the problem and the solution space through several models in parallel, then synthesize — covering ground without sacrificing rigor.
“I run the core questions through multiple models in parallel, then synthesize. It’s how I cover ground without losing rigor.”
When a system has known failure modes, the load-bearing fix is the infrastructure that prevents recurrence — not the one-off hotfix. Build the thing that catches the whole class of failure.
“The meta-fix breaks the find-it, patch-it, repeat cycle.”
Point-in-time correctness, typed schemas, reproducibility. No corners cut on the parts that matter — even working solo.
“I don’t cut corners on the parts that matter, even solo.”
No new features built on the legacy stack — triage only, to keep the lights on. Every hour spent on the old machine is a tax paid to buy time for the new one.
“Every hour spent on the legacy stack is a tax paid to buy time for the new machinery.”
The systems must be grounded in truth and beauty — real outcomes for real people. Not dashboards for their own sake. Not metrics that flatter the org.
“If the code actually works, both sides win.”
Most of the work is the infrastructure behind the surface. What it adds up to: a verification engine that closes the loop from raw checkout signal to a measurable win for the shopper.
The pipeline that scores whether a code will work — in production, at scale.
Owns it end-to-end: validating the data, training the predictive model, designing the live shopper experiment, and the scoring service itself. Now scoring roughly 1.25 million codes every day.
The migration to a sovereign commerce-verification platform.
Drives the architecture and the build — schema design, shadow validation against live traffic, and intelligent routing. The systems the rest of the platform reads its ground truth from.
The number the whole team now operates against.
It exists because Graham refused to accept the wrong one — finding and fixing a bug that had been under-counting results by more than half. A metric that can’t be gamed without betraying the shopper.
One dependency picture for the entire platform cutover.
Mapped ~35 production systems, ranked every hazard, and grouped surfaces by migration risk — the plan that lets the cutover happen without breaking what shoppers rely on.
A platform specification one day, a schema the next, a production fix the day after. Most operators sit at one tier; Graham moves across all three in the same week.
Diagnosed and fixed a bug that had under-counted results by more than half while measuring the first true baseline. The number the team now operates against exists because he refused the wrong one.
When a run of failure modes surfaced in a single day, the response was infrastructure that prevents the whole class of failure — not a stack of one-off patches.
The covenant runs all the way down to which metrics he picks. The success rate is the right north star precisely because it cannot be gamed without betraying the shopper.
Years of supervised and unsupervised ML for ad pacing produced the systems instinct; years across publishing, data, and data science produced the cross-layer fluency. Both forged the architect.
Physics and economics gave him the measurement instinct; music — orchestral and jazz — gave him the operating posture: play in the same key, hit the changes together, trust the room.
Document the spine, then ship the code that proves it on real users. Refuse work that corrodes the shopper. Drive measurement until the number you stake your name on is the right number.
Product.ai builds with operators like Graham — engineers who own the spine and the shipped result both. See open roles →