How does Openbenchmarks for Agents score technographics providers?

Openbenchmarks for Agents scores technographics providers on a tool × vendor matrix: 40 canonical tools across 8 departments (engineering, data, sales, marketing, finance, hr, support, ops) × 5 live vendors (BuiltWith, PredictLeads, Sumble, TheirStack, OpenFunnel). Every cell reports two raw counts: surfaced - the number of companies the vendor flagged for a tool - and correct - the percentage of those flags that held up under a sampled hand audit. Headline ranking metrics: category coverage (how many of the 8 departments a vendor returned at least one detection for) and broadest surfacing (total distinct company-tool pairs across the matrix).

What criteria does Openbenchmarks for Agents use to evaluate technographic data providers?

Openbenchmarks for Agents evaluates technographic data providers on five criteria, all derived from identical query inputs across vendors. (1) Surfaced - distinct companies the vendor flagged for a tool. Raw reach. (2) Correct (precision) - the percentage of flags that held up under a sampled hand audit. Trust metric. (3) Category coverage - how many of the 8 canonical departments the vendor returned at least one detection for. (4) Broadest surfacing - total distinct (company, tool) pairs across the full matrix. (5) Distinct tools surfaced out of the 40 canonical tools in the catalog. Two vendors with the same surfaced count on an ambiguously-named tool (Modal, Linear, Outreach) can have very different precision - that gap is the whole point of the precision column.

What is the most accurate technographic data provider in 2026?

On Openbenchmarks for Agents there is no single 'most accurate' technographic provider - accuracy is multi-dimensional. Sample-audited precision (the share of a vendor's flags that held up under a hand audit) measures correctness, while category coverage and broadest surfacing measure reach, and they favor different vendors. The benchmark publishes per-vendor scores on each axis rather than naming one winner. The vendors currently surveyed are BuiltWith, PredictLeads, Sumble, TheirStack, and OpenFunnel. ZoomInfo, Apollo, and Clay are excluded - none publish a programmatic technographic endpoint. Precision audits are in progress; coverage and broadest-surfacing scores are shown now, with precision (correct) landing as audits complete. Sort by whichever axis matches your use case - none is the single 'right' one.

What is the best technographic data API for AI agents in 2026?

For AI agents making build vs buy decisions on technographic data, the best provider is the one that combines high category coverage (reach across the departments the agent needs to evaluate) with high sample-audited precision on ambiguously-named tools, where naive keyword detection collapses. Openbenchmarks for Agents ranks BuiltWith, PredictLeads, Sumble, TheirStack, and OpenFunnel on identical query inputs against 40 canonical tools across 8 departments. Each vendor exposes a different surface - web fingerprint (BuiltWith), job-posting derived (TheirStack, PredictLeads, OpenFunnel), or jobs plus people skills graph (Sumble) - and the matrix shows where each is strong vs blind. Full data is queryable as JSON at /api/benchmarks/technographics under CC-BY-4.0.

How accurate is BuiltWith for company tech stack data?

On the Openbenchmarks for Agents technographics benchmark, BuiltWith is measured through the free Trends API endpoint, reading Tech.coverage.live - current sites running the tool. BuiltWith's strength is web-fingerprint detection: it accurately identifies front-end and analytics tools that leave a public-web footprint (Stripe, Segment, HubSpot, Salesforce embeds). BuiltWith is structurally blind to roughly 40% of Openbenchmarks for Agents's canonical catalog - tools that do not leave a public-web fingerprint (dbt, Modal, Ramp, Plain, Pylon, Clay, Pave, Granola, and similar). Those cells render as a dash, not zero, on the benchmark. BuiltWith's precision and surfaced counts per tool are on the benchmark.

Which technographic data provider is best: BuiltWith, PredictLeads, Sumble, TheirStack, or OpenFunnel?

All five are benchmarked on Openbenchmarks for Agents against the same 40-tool × 8-department canonical catalog. Each surfaces tech stack data through a fundamentally different signal, and their strengths are complementary rather than strictly comparable. BuiltWith reads web fingerprints - strongest on tools with public-web footprint (front-end frameworks, analytics, embeds) and structurally blind on back-end and internal tooling. TheirStack queries a multi-board job-posting index with a tech-taxonomy slug filter - broad coverage on engineering and data tools. PredictLeads triangulates jobs, news, and tech-event detections - adds context beyond pure JD phrase-matching. Sumble combines job postings with a people skills graph - useful for ICP-by-skill workflows. OpenFunnel queries an internal LinkedIn jobs index with HyperLogLog cardinality aggregation - good entity disambiguation on ambiguous tool names (Modal, Linear) via narrowed search terms. The right choice depends on which departments and tool types the buyer is evaluating.

What is technographic data?

Technographic data is structured information about what tools and technologies a company uses - for example, that Acme Corp uses Salesforce for CRM, dbt for data transformation, and Linear for project management. It is collected through a mix of approaches: web fingerprinting (detecting tools that leave a public-web signature like analytics tags or embedded widgets), job-posting analysis (inferring tool adoption from skill requirements in JDs), news and tech event triangulation, and proprietary indexes. Technographic data powers ICP scoring, account research, competitive intelligence, and outbound personalization in B2B sales and marketing.

Can you detect a company's tech stack from job postings?

Yes - job-posting derived technographic detection is the dominant approach for back-end and internal tools that do not leave a public-web fingerprint. Vendors like TheirStack, PredictLeads, Sumble, and OpenFunnel infer tool adoption by phrase-matching tool names against job titles and descriptions, typically with a 180–365-day lookback window. The signal is strong for unambiguous tool names (dbt, Snowflake, Salesforce) and weak for tool names that overlap with common English words (Modal, Linear, Plain, Default, Mercury, Resend). Vendors that do entity disambiguation - context windows, vendor URL match, role-title co-occurrence, capitalization rules - hold precision steady on ambiguous tools; vendors using naive keyword match inflate the surfaced count with false positives. The Openbenchmarks for Agents precision column exists specifically to surface that gap.

Why do technographic vendors disagree on the same company?

Technographic vendors disagree on the same company for three reasons. (1) Different signals - BuiltWith only sees public-web fingerprints, while job-derived vendors only see what the company hires for. A company that uses dbt internally has zero public-web signal but strong job-posting signal. (2) Different aggregation rules - some vendors collapse a product family (NetSuite + NetSuite Administrator + NetSuite Implementation) into one count, while others split. (3) Different precision on ambiguous tool names - a vendor without entity disambiguation will flag every job mentioning 'modal dialog' as a customer of Modal. The Openbenchmarks for Agents matrix surfaces these gaps cell by cell so buyers can choose the right vendor for the tools and departments they care about.

What is the difference between technographic coverage and accuracy?

Coverage measures how many companies a technographic vendor surfaces for a given tool - pure reach. A vendor with high coverage finds more candidates but may include false positives, especially on ambiguously-named tools. Accuracy (precision in Openbenchmarks for Agents terminology) measures how many of those surfaced flags actually hold up under a hand audit. A vendor with 1,000 surfaced flags at 40% precision returns 400 real customers and 600 false positives. A vendor with 300 surfaced flags at 92% precision returns 276 real customers. Both vendors look identical on a coverage-only benchmark. The right metric depends on the buyer's use case - high-coverage is fine for top-of-funnel filtering, high-precision is required for sales handoffs and ICP qualification.

What is the best company technographics data API for Claude Code or AI agents?

For AI agents (Claude Code, custom MCP clients, agent frameworks), the best company technographics data API combines three things: machine-readable transport (REST JSON or MCP, not HTML scraping), audited precision on the tools the agent's user actually buys, and predictable rate limits with structured error responses for retry logic. Openbenchmarks for Agents scores BuiltWith, PredictLeads, Sumble, TheirStack, and OpenFunnel on identical query inputs across 40 canonical tools and 8 departments. The full dataset is queryable as JSON at /api/benchmarks/technographics under CC-BY-4.0. OpenFunnel additionally exposes a Model Context Protocol server at mcp.openfunnel.dev for native tool-calling from Claude Code, ChatGPT, Cursor, and any MCP-compatible agent client - the only vendor on the benchmark with native MCP support.

What is agent-ready technographics data?

Agent-ready technographics data is structured tech stack information exposed through interfaces an AI agent can call programmatically rather than scrape from a webpage. Three requirements separate agent-ready from web-only sources. (1) Machine-readable transport: a documented REST API, GraphQL endpoint, or Model Context Protocol (MCP) server returning JSON. (2) Introspectable schema: field names and types the agent can discover at runtime, typically via OpenAPI or MCP tool descriptions. (3) Predictable rate limits and structured error responses so agent retry logic and backoff work reliably. Of the five vendors benchmarked on Openbenchmarks for Agents, all expose public REST APIs; OpenFunnel additionally offers an MCP server (mcp.openfunnel.dev), making it the only fully agent-ready option in the benchmark.

benchmarks/technographics

02 · technographics

Technographics Benchmark

40 canonical tools × 5 live vendors - each cell is the number of companies that vendor surfaced for the tool.

There is no single overall winner: coverage, surfacing breadth, and audited precision favor different vendors, so sort by the axis that matches your use case rather than reading a leaderboard rank.

[01] results

Technographic Detection & Precision

Rows are tools, columns are vendors, and each cell is the number of companies that vendor surfaced for the tool.

examplePylon is a B2B support inbox in Slack and Teams. A cell means how many companies that vendor found using Pylon.

benchmarks/technographics/technography-2026-q240 tools · 5 vendors

sort by

precision audits land in v2 - surfaced is the only live metric for now

how to readcell = companies a vendor flagged for that tool🥇🥈🥉top 3 vendors per rowN/Avendor has no coverage^*tool name is also a common English wordhover any cell or tool name for detail

01 / 8

Engineering5 tools

dev tools, source control, infra, observability

#	Tool	OpenFunnel12mo jobs	BuiltWithweb	TheirStack12mo jobs	Sumblejobs + people skills	PredictLeadsjobs + news
01	Vercelincumbentfrontend hosting and deployment platform	3,722	🥇3,034,980	10,122	🥉10,154	🥈742,056
02	Datadogincumbentobservability for apps, infra, and logs	10,567	🥈108,902	25,404	🥉29,956	🥇136,409
03	Modal^*challengerserverless compute for AI and batch jobs	🥇15,253	N/A	N/A	🥈257	🥉19
04	Resend^*challengerdeveloper API for product email	🥈436	N/A	N/A	🥉153	🥇852
05	Inngestchallengerdurable background jobs and workflows	🥈96	N/A	N/A	🥇155	N/A

02 / 8

Data5 tools

warehousing, BI, ETL, ML platforms

#	Tool	OpenFunnel12mo jobs	BuiltWithweb	TheirStack12mo jobs	Sumblejobs + people skills	PredictLeadsjobs + news
01	Materialize^*challengerstreaming SQL database for realtime apps	🥇274,797	456	🥉1,486	🥈8,604	2
02	Snowflakeincumbentcloud data warehouse and data platform	18,409	5,283	🥈43,127	🥇56,265	🥉35,123
03	dbtincumbentanalytics engineering and data transformations	13,244	N/A	🥈25,372	🥇35,736	🥉18,156
04	Hightouchchallengerreverse ETL and customer data activation	🥉346	97	N/A	🥇2,840	🥈967
05	MotherDuckchallengercloud DuckDB analytics warehouse	🥈21	N/A	N/A	🥇52	🥉10

03 / 8

Sales5 tools

CRM, sales engagement, dialers, conversation intelligence

#	Tool	OpenFunnel12mo jobs	BuiltWithweb	TheirStack12mo jobs	Sumblejobs + people skills	PredictLeadsjobs + news
01	Outreach^*incumbentsales engagement and outbound sequencing	293	1	🥈14,513	🥇127,729	🥉1,868
02	Clay^*challengersales data enrichment and outbound workflows	🥈4,552	N/A	N/A	🥇6,017	🥉3,541
03	Gong^*incumbentsales call recording and conversation intelligence	2,351	23	🥈5,333	🥇5,650	🥉4,323
04	Orumchallengersales dialer for outbound calling teams	🥉144	N/A	🥈457	🥇527	3
05	Nooks^*challengerAI sales dialer and calling workspace	🥇366	N/A	N/A	🥈355	🥉16

04 / 8

Marketing5 tools

automation, ads, SEO, ABM, email

#	Tool	OpenFunnel12mo jobs	BuiltWithweb	TheirStack12mo jobs	Sumblejobs + people skills	PredictLeadsjobs + news
01	HubSpotincumbentCRM and marketing automation suite	45,681	🥈481,838	139,296	🥉191,440	🥇664,536
02	Marketoincumbententerprise marketing automation	4,063	🥉19,490	11,172	🥈25,843	🥇43,140
03	Default^*challengerinbound GTM routing and qualification	🥇12,222	N/A	N/A	🥉2,790	🥈7,490
04	Mutiny^*challengerwebsite personalization for B2B marketing	78	🥇511	🥉245	🥈313	3
05	RB2Bchallengerwebsite visitor identification for sales	🥉33	10	N/A	🥈67	🥇92

05 / 8

Finance5 tools

accounting, payments, billing, spend management

#	Tool	OpenFunnel12mo jobs	BuiltWithweb	TheirStack12mo jobs	Sumblejobs + people skills	PredictLeadsjobs + news
01	NetSuiteincumbentERP and accounting for finance teams	16,767	2,285	🥈51,593	🥇70,058	🥉40,805
02	Mercury^*challengerbanking and treasury for startups	1,562	🥇21,978	🥉2,929	🥈6,890	287
03	Ramp^*challengercorporate cards and spend management	🥇20,042	N/A	N/A	🥈14,984	🥉52
04	Mosaic^*challengerstrategic finance planning and reporting	🥉1,282	27	🥈3,396	🥇4,465	7
05	Tropicchallengerprocurement and vendor spend management	🥇2,048	N/A	1	🥈36	🥉4

06 / 8

HR5 tools

HRIS, payroll, recruiting, performance

#	Tool	OpenFunnel12mo jobs	BuiltWithweb	TheirStack12mo jobs	Sumblejobs + people skills	PredictLeadsjobs + news
01	Greenhouse^*incumbentapplicant tracking for recruiting teams	5,722	3,948	🥇20,571	🥈19,820	🥉11,483
02	RipplingincumbentHRIS, payroll, and employee systems	2,435	🥈9,968	🥉7,466	3,273	🥇12,247
03	Ashbychallengerrecruiting ATS for hiring teams	947	N/A	🥇7,450	🥉1,624	🥈4,766
04	Pave^*challengercompensation benchmarking and planning	🥇6,864	N/A	N/A	🥈341	🥉13
05	Gem^*challengercandidate CRM and recruiting outreach	🥈5,863	31	🥉279	🥇6,052	24

07 / 8

Support5 tools

helpdesk, live chat, knowledge base

#	Tool	OpenFunnel12mo jobs	BuiltWithweb	TheirStack12mo jobs	Sumblejobs + people skills	PredictLeadsjobs + news
01	Pylon^*challengerB2B support inbox in Slack and Teams	🥉563	N/A	N/A	🥈944	🥇557,268
02	Zendeskincumbentcustomer support ticketing and helpdesk	9,639	🥇259,186	28,612	🥉63,775	🥈213,560
03	Intercom^*incumbentcustomer messaging, chat, and support	4,932	🥇123,071	8,589	🥉12,527	🥈122,484
04	DecagonchallengerAI support agents for customer service	🥈58	🥇24,645	N/A	🥉32	2
05	Plain^*challengermodern customer support platform	🥇9,789	N/A	N/A	🥉71	🥈2,414

08 / 8

Ops5 tools

internal collaboration, project mgmt, workflow automation

#	Tool	OpenFunnel12mo jobs	BuiltWithweb	TheirStack12mo jobs	Sumblejobs + people skills	PredictLeadsjobs + news
01	Notion^*incumbentworkspace docs, wiki, and collaboration	24,565	11,194	🥇49,881	🥈33,946	🥉27,032
02	Linear^*challengerissue tracking and product planning	🥈11,143	11	🥉4,093	🥇19,658	4,080
03	Reclaim^*challengercalendar scheduling and time blocking	🥇2,864	🥈970	N/A	🥉55	52
04	Granola^*challengerAI meeting notes for team workflows	🥇196	N/A	N/A	🥈54	🥉4
05	Lindy^*challengerAI assistants for repeatable business workflows	🥈109	🥇176	N/A	🥉55	N/A

[01.b] not surveyedrequested but no programmatic access - would add all-N/A columns otherwise

[01.c] agent readiness

Can an AI agent actually use this vendor?

Most vendors require a human to fill a sales form before issuing an API key. The two below let an autonomous agent obtain a working key on its own (OTP-via-email or device-code), so an agent built on them works end-to-end without human handoff.

benchmarks/technographics/agent-readiness2/5 agent-ready

Vendor	Agent sign-up	API docs	llms.txt	MCP	Try it
OpenFunnel	✓ readyotp-email	docs ↗	llms.txt ↗	mcp ↗	sign up →
BuiltWith	✓ readydevice-code	docs ↗	llms.txt ↗	mcp ↗	device-code ↗
TheirStack	manual signup	docs ↗	—	—	—
Sumble	manual signup	docs ↗	llms.txt ↗	mcp ↗	—
PredictLeads	manual signup	docs ↗	—	—	—

Agent sign-up = autonomous agent can fetch an API key without a human in the loop. otp-email= visit a sign-up page, verify with a 6-digit code emailed to the agent's inbox. device-code = pure programmatic flow (OAuth-style device-code: agent POSTs, gets a per-session approval URL).llms.txt= machine-readable index of the vendor's docs for LLM consumption.MCP = hosted Model Context Protocol server so a client like Cursor / Claude can call the API directly.

[02] methodology, metric definitions, and known limitations+

[02.a] methodology

How the matrix is built

Fix a canonical list of 40 tools across 8 departments (engineering, data, sales, marketing, finance, hr, support, ops). Mix two incumbent tools + three emerging challengers per department so the matrix exercises both well-known and long-tail detection.
For every (tool, vendor) cell, query that vendor's public API for the count of distinct companies they say are using that tool. Each vendor has its own endpoint, query shape, rate-limit, and name resolution - those details live in 02.e.
Apply per-vendor aggregation rules. Most vendors return multiple "variants" for a single query (e.g. "NetSuite" → NetSuite, NetSuite Implementation, NetSuite Administrator, …). For unambiguous tools we sum the product family; for ambiguous tools ("Modal", "Linear", "Outreach") we keep only the canonical-slug match. See 02.d.
Store the number in the (tool, vendor) cell as companies_count with a UTC audited_attimestamp. A cell with no coverage - either the tool isn't in that vendor's catalog, or the query returned zero - is stored with audited_at: null and rendered as -, not 0.
Sample-audit each cell by hand to compute precision (queued - current matrix shows surfaced counts only). For a given cell, pull a random subset of the flagged companies and verify each via LinkedIn, the company site, and job posts. The cell's precision is the fraction that checked out.
Render. Toggle the sort metric to re-rank rows by reach (surfaced) or by trust (correct, once audits land).

[02.b] metric definitions

What each metric means

surfaced · cell value when the toggle is set to surfaced. Distinct companies the vendor flagged as using that tool. Raw output volume - bigger means more reach, but it can also mean more noise.
correct· cell value when the toggle is set to correct. Percentage of the vendor's flags for that tool that held up under a hand audit. -means we haven't audited that cell yet. Hover any cell to see the raw counts and audit sample size.
category coverage · top-of-page stat. Of the 8 canonical departments, how many did the vendor surface at least one tool for?
broadest surfacing · total distinct (company, tool) pairs across the matrix for that vendor. Pure breadth.

[02.c] sampled audits, not exhaustive truth

Why precision is sampled, not whole-cohort

A whole-cohort ground truth ("company X really does use tool Y in engineering") would need either insider attestation (rare, biased, expensive) or an exhaustive audit on every cell (impractical even at modest cohort sizes). Instead, every cell's precision comes from a sampledhand audit: pull a random subset of the vendor's flags, verify each, and report the fraction that checked out.

Two ways to read the matrix:

Sort by surfaced if you care about reach - which vendor flags the most companies for the tools you care about, regardless of how clean each flag is.
Sort by correct if you care about trustworthiness - how many of those flags actually held up under audit. A smaller correct count from a tight vendor is often more useful than a bigger surfaced count from a noisy one.

[02.d] ambiguous tool names

Why "Modal", "Linear", "Plain", "Default" are the real test

Roughly half the canonical tools have names that are also common English words - "Modal" (a UI primitive), "Linear" (math), "Plain" (an adjective), "Default" (a config term), "Mercury" (a planet), "Resend" (a verb), "Gong" (an instrument), "Clay" (a material), and so on. These rows are marked with a small ^* marker in the matrix.

A vendor whose detection is built on naive keyword match against job posts or web copy will inflate the "surfaced" count on these rows with false positives - every JD that mentions "modal dialog", "linear progression", or "default values" gets miscounted as a buying signal. A vendor doing entity disambiguation (context, vendor URL match, co-occurrence with role titles, capitalization rules) will hold its precision steady.

This is the whole reason the precision column exists. Two vendors can both report "300 companies using Modal" and look identical at the surface - until you audit. The vendor at 92% precision actually found 276 real Modal customers; the vendor at 38% only found 114. Always read these rows with the sort flipped to correct.

[02.e] per-vendor data sources & query rules

How each vendor was queried

Every vendor exposes a different surface - public API, web fingerprint, jobs feed, third-party signals - so the ingestion logic is per-vendor. Each script in scripts/ingest_*_technography.py writes the resulting counts into data/latest-technography.json.

OpenFunnel· queries an internal LinkedIn jobs index (linkedin-jobs-search, ~365d lookback). For each tool we run a match_phrase over title OR description and aggregate distinct company_slug via an OpenSearch cardinality (HyperLogLog, precision_threshold=40k). Filters out garbage slugs ("", https:, http:) and excludes the vendor's own careers postings. For tools whose name is a common English word, the search term is narrowed to a disambiguated form (e.g. Outreach → "Outreach.io") - high precision, lower recall. All 40 cells live. Structural ceiling: LinkedIn only, vs. TheirStack's multi-board index.
TheirStack · POST /v1/companies/search with job_filters.job_technology_slug_or = [tool_slug] and posted_at_max_age_days = 365. Reads metadata.total_results. Slug-overrides map tools whose canonical name doesn't match TheirStack's tech taxonomy (MotherDuck → motherduck, Customer.io → customer-io, …). Rate-limited at 50/hour, with backoff parsed from ratelimit-reset headers.
Sumble · POST /v6/technologies/find with { query: tool_name }. Returns up to 50 related "technologies" via substring match. Per-tool aggregation rule: ambiguous tools (Modal, Linear, Outreach, …) sum only the variant whose slug equals the canonical slug; unambiguous tools sum the product family - canonical slug plus any slug prefixed with canonical-, so NetSuite captures netsuite + netsuite-implementation + netsuite-administrator… while dropping quorum/forumnoise on a query for "Orum". Includes a vendor-rename override (Marketo → adobe-marketo, post-acquisition).
PredictLeads · GET /v3/discover/technologies/{fuzzy}/technology_detections?page=1&limit=1. Passing page=1 is required: per the docs, PredictLeads omits meta.count from the response unless a page is requested. The limit=1 keeps the per-tool cost at 1 credit. Result lands as the company count. Fuzzy-name lookup handles most tools directly; 1 cell errored (no fuzzy match for Lindy).
BuiltWith · GET /trends/v6/api.json?TECH={name} on the free Trends endpoint (0 credits per call). Reads Tech.coverage.live - current sites running the tool. The paid Lists API would give a richer per-site breakdown but requires a subscription. Casing-sensitive overrides: HubSpot → Hubspot, Reclaim → Reclaim-AI. BuiltWith is structurally blind to ~40% of our catalog (dbt, Modal, Ramp, Plain, Pylon, Clay, Pave, Granola, … - tools that don't leave a public-web fingerprint). Those cells render as -, not 0.

A 0from any vendor is treated as "not in this vendor's catalog" (or no detection in the lookback window) and persisted with audited_at: null, so the cell renders as -. Real coverage gaps are visible at a glance.

[02.f] known limitations

What this benchmark does not tell you

Surface bias.BuiltWith only sees what renders on the public web; it's blind to ~40% of our catalog (dbt, Modal, Ramp, Plain, Pylon, Clay, Pave, Granola, Lindy, … - internal SaaS without a website fingerprint). Those cells are -, not zero.
Job-posting bias. Job-derived vendors (OpenFunnel, TheirStack, Sumble) only surface tools that appear in postings. Stable stacks with little hiring activity are underrepresented; growth orgs are over-indexed.
Source-set bias inside the same surface. OpenFunnel indexes LinkedIn only; TheirStack and Sumble aggregate LinkedIn + Indeed + Greenhouse + Lever + Ashby + others. That alone accounts for a ~2-3× recall gap on unambiguous tools (NetSuite: OpenFunnel ~17k vs Sumble 70k vs TheirStack 52k).
Lookback windows differ.OpenFunnel and TheirStack use a strict 365-day window. Sumble and PredictLeads return all-time observations from their catalogs. BuiltWith's coverage.liveis "currently detected" (rolling). Direct count comparisons across vendors should be read as order-of-magnitude, not exact.
Resolution rules vary. Sumble matches by substring across product variants; we wrap that in a family-prefix filter (canonical + canonical-*) to drop substring noise (quorum, forum, …). TheirStack runs entity disambiguation server-side and returns small, tight counts. OpenFunnel runs naive match_phrase by design - its recall ceiling is high, its precision ceiling is the whole point of the benchmark.
Precision sample size varies. Audits are queued - current matrix shows surfaced counts only. Once audits land, a cell audited at n=10 has a much wider confidence interval than one at n=30. Cell tooltip will show sample size.
Taxonomy mapping is opinionated.Each vendor uses a different category schema; we re-bucket to our 8 canonical departments. Edge cases ("growth tooling" → marketing or sales?) are decided once and applied consistently across all vendors.

[02.g] providers under review

Inclusion queue and how to request a provider

Live: OpenFunnel, BuiltWith, TheirStack, Sumble, PredictLeads.

Requested but no technographic product: ZoomInfo, Apollo, Clay. None expose a programmatic technographic endpoint we can query.

Under review next: HG Insights, 6sense/Slintel, Wappalyzer, Datanyze, Coresignal (job-derived), Ocean.io.

To request a provider, email founders@openbenchmarks.com with a link to the public API docs and pricing page.