Product.ai / Join / Projects / Agent-API Recommendation Attribution (Layer 2) — instrument the recommendation-decision moment
Project Open to Alpha Team

Agent-API Recommendation Attribution (Layer 2) — instrument the recommendation-decision moment

Build Layer 2 instrumentation — the API gateway agent attribution layer that captures which AI agents (Claude Code, Cursor, ChatGPT, Gemini, custom agents) recommend Product.ai or SimplyCodes APIs in their reasoning loops. MCP session headers, user-agent strings, request-pattern fingerprints. Three-layer instrumentation: developer-intent prompt battery + API-gateway agent attribution + code-generation trace. Output is a working attribution pipeline plus a public methodology paper that establishes Product.ai as the category convener.
Project Overview
Discipline
brand-content-producer · content-engineer · AI Systems — Data Scientist / ML Engineer · AI Systems — AI Engineer
Duration
3 weeks
Compensation
Your stated freelance rate
Surface
Product.ai · Agent commerce · Truth Graph
Kernels
productai · agent-commerce · truth-graph
Outcomes
dev-integrate · agent-infra · brand-entity
Tier
Consequential
Alpha Team
Open to alpha members who want to take this on
Tooling
Claude Code or Co-work

Why we want this done

The Phase 3 Content briefing identifies agent-API recommendation observability as the highest-leverage uncovered measurement surface in 2026. Existing AEO vendors (Profound, Peec, Siftly, Omnia, AirOps, Scrunch, Brandlight) and agent-observability vendors (Datadog LLM Observability, Honeycomb BubbleUp, Speakeasy, Braintrust) measure execution, not the recommendation-decision moment. Mintlify reports 45.3% of all doc requests come from AI agents (Claude Code 199.4M + Cursor 142.3M = 95.6% of agent traffic). The first-mover window terminates in first-party absorption by Anthropic and OpenAI within 18-24 months (Stripe-precedent: model providers will not leave the recommendation analytics surface to third parties indefinitely). Layer 2 is the structurally distinctive layer because it requires deployment surface that pure observability vendors don't have. Open-source Layers 1 and 3, proprietary Layer 2 — Stripe-precedent: run the test, publish the data, make Product.ai the standard. The candidate ships Layer 2 instrumentation on Product.ai's MCP / API surface and writes the methodology paper that becomes the category-convener artifact.

Scope

  1. Layer 1 — developer-intent prompt battery (200-500 prompts, manually authored — not LLM-generated per Phase 3 axiom A2/E2). Each prompt is a realistic developer-asking-an-agent-for-help scenario relevant to Product.ai's commerce-knowledge surface
  2. Automated execution harness against the major frontier agent surfaces (Claude Code, Cursor, ChatGPT, Gemini) with statistical floor (10 samples per prompt, 7-day rolling, ±5% threshold)
  3. Layer 2 — API gateway agent attribution. Capture MCP session headers, user-agent strings, request-pattern fingerprints. Build the attribution pipeline that maps inbound API traffic to originating agent class
  4. Layer 3 — code-generation trace. Parse the code agents generate when they invoke Product.ai's APIs; classify correctness, token efficiency, error patterns
  5. Storage + dashboard — Datadog LLM Observability or Honeycomb (not custom time-series infrastructure per Phase 3 brief)
  6. Drift instrumentation — track citation/recommendation drift week-over-week with statistical confidence intervals
  7. Methodology paper — public-facing, Stripe-precedent, opens with the category gap (recommendation decision moment uncovered) and the empirical methodology Product.ai has pioneered

What success looks like

  • All three layers produce real data on at least three frontier agent surfaces
  • Statistical methodology applied — sample size, rolling window, signal-vs-noise threshold all defended
  • Drift signature captured for at least one prompt cluster
  • Methodology paper drafted (not necessarily published during trial) — concrete enough that an external reader can apply it
  • Layer 1 and Layer 3 are open-source-ready (the methodology and harness can be released)
  • Layer 2 is proprietary (Product.ai's deployment surface advantage)
  • One real surprise emerges in the trial — a recommendation pattern the team did not previously see

References

references.md
Content Phase 3 briefing axiom F-set (Agent-API Recommendation Observability), F4 (Statistical Methodology Load-Bearing), F5 (Citation Drift Structural Forcing Function)
AI Engineering Phase 3 briefing axiom C3 (MCP Bifurcation), F2 (Substrate-vs-Harness)
Mintlify usage data (45.3% of doc requests from AI agents)
Stripe-precedent open-source-and-publish playbook
Product.ai MCP server, SimplyCodes MCP server
Anthropic MCP specification for header conventions
Datadog LLM Observability documentation; Honeycomb BubbleUp documentation

Constraints

  • Claude Code or Co-work as primary substrate
  • Layer 1 prompts must be manually authored — LLM-generated context files perform net-negative (Phase 3 axiom A2/E2)
  • Statistical methodology is non-negotiable — 10 samples per prompt floor, 7-day rolling, ±5% threshold
  • Layer 2 deployment is proprietary; Layers 1 and 3 are open-source-ready
  • Self-hostable observability tooling preferred; Datadog/Honeycomb acceptable but not Braintrust or LangSmith
  • Privacy-respecting agent-attribution — user-agent strings and request fingerprints are inferential, not personally-identifying
  • 3-week duration cap
  • IP separation: API gateway and MCP servers are application-layer; methodology paths are out of scope
Apply
01

Read the Codex (10 min)

The operating principles we work by. If they resonate, the rest of this will land. Open the Codex →

02

12-minute video screen

Hireflix, async. Questions are calibrated to this project specifically.

03

Chemistry call (30-60 min)

Direct call with the CEO. Strategic alignment and mutual fit. No problem-solving exercise.

04

Project begins within 2-3 weeks

1099 contractor agreement, NDA, paid at your stated rate. Day 1 in Santa Monica.

Alpha Team members can take this project without the screen-and-call sequence. Reach out via the Alpha Team channel.