methodology

inclusion

An organisation qualifies for the map if it satisfies any of:

appears on a major AI list — Epoch AI companies, Forbes AI 50, or CB Insights AI 100;
has published a notable AI model — e.g. listed by Epoch Notable AI Models or tracked on Artificial Analysis;
is an AI company with ≥ $100M lifetime equity funding.

The bar is uniform worldwide — a Berlin or Shanghai org clears or fails the same gate a San Francisco one does. Country-vs-country comparisons would be meaningless otherwise. New candidates surfaced by research, submissions, or rebuilds are evaluated against these three legs; orgs that fall below all three are excluded until they cross one.

What we exclude. Even when a company appears on a major AI list, we cut it if its core product isn't AI. Concretely:

Cybersecurity & AI-governance — agentic SOCs, DLP, guardrails, AI red-teaming. AI is the implementation; the product is security.
Devtools, observability, data plumbing — sandboxes, vector DBs, integration layers, MLOps platforms. Infrastructure for AI builders, not AI itself.
SaaS with AI features — CRMs, GTM tools, doc workspaces. AI is a feature, not the product.
Data labor — labeling marketplaces, RLHF services, contractor staffing. The labor is human.
Generic infra — non-AI-specific chips, data-residency platforms, networking.

ai_infra carve-out. Inference clouds and GPU-as-a-service (Baseten, Fireworks AI, FriendliAI, Together AI, Fal, Crusoe) stay on the map but don't count toward country tier totals — their funding reflects infra spend, not training spend. Datacenter operators are similarly excluded from tiers.

org types

Every org has a type: ai_native (private AI-focused company that raises VC), ai_infra (inference cloud or GPU-as-a-service whose product is AI compute, but who doesn't train models), oss_lab (nonprofit / research collective), sovereign (government-funded national lab), big_tech (diversified company with an AI arm), or datacenter (physical infra).

sources & trust order

Each org is merged from multiple datasets. When two sources disagree on a field (HQ, funding total, latest round), the higher-ranked source wins. Source rank:

1.

Epoch AI — companies dataset

epoch-companies-2026-04-28

Multi-sourced rounds with confidence ratings. Authoritative for frontier-lab funding totals.
2.

Manual gaps

manual-gaps-2026-05-08

Hand-curated entries with primary-source URLs (press releases, S-1s, official posts). Used where Epoch is silent.
3.

Forbes AI 50 (2026, enriched)

forbes-ai-50-2026-enriched

Forbes name list with best-effort HQ and funding figures sourced from public reporting.
4.

CB Insights AI 100 (2026)

cbi-ai-100-2026

Name + category only. Listed orgs without other coverage are flagged for research; HQ backfilled separately.

HQ city/country comes from primary-source pages or, for orgs that lacked it, a separate manual backfill (hq-backfill-2026-05-08). Coordinates are looked up from a hand-maintained city table — orgs in a city not yet in the table show up in the data probe but not on the map.

compute & datacenters

Datacenters are a separate map layer alongside labs (toggle in the bottom-right). Each DC pin shows current power draw (MW) and derived FP16 compute (EFLOPS) — EFLOPS is computed as H100-equivalents × 989 TFLOP/s (NVIDIA H100 SXM dense FP16, no sparsity). The H100-equivalent count comes directly from Epoch.

64 datacenters

8.6 GW total power

5059 EFLOPS total FP16 compute

20 countries

·

Epoch AI — Frontier Data Centers

epoch-frontier-datacenters-2026-05-08

Per-site power (MW), H100-equivalent compute, capital cost, owner, and known users for the largest AI training datacenters worldwide. Derived from satellite imagery + permit/regulatory filings, not press releases. Released under CC BY 4.0.
·

OpenStreetMap Nominatim

dc-coords-2026-05-08

Street-level lat/lng for each datacenter, geocoded from Epoch's published address. City-level fallback when the exact street address fails to resolve.

datacenter inclusion

A datacenter qualifies for the map if it satisfies any of:

appears on a major frontier-compute tracker — Epoch Frontier Data Centers, EuroHPC, the EU AI Factories / Gigafactories programme, or SemiAnalysis Datacenter Model;
is built or announced for AI training/inference at frontier scale — ≥ 40 MW IT power and AI-primary use, OR ≥ 10,000 H100-equivalent GPUs (operational or contracted);
is a sovereign / national-lab supercomputer procured for AI workloads.

Same uniform-bar logic as the org gate: a Narvik or Riyadh site clears or fails the same gate an Abilene one does. Sites below all three legs (generic colocation, crypto-mining campuses, sub-frontier inference POPs, "intent" announcements without site selection or named developer) are excluded.

Status. operational = at least partially live and consuming AI compute today. planned = announced or under construction with a site, developer, and groundbreaking confirmed but no current power draw.

Coverage caveat. Epoch's frontier list is explicitly US-focused — they cover the two-to-three largest sites for each major US frontier lab and estimate ~15% of global delivered AI compute. We layer non-US frontier sites on top of Epoch from a hand-curated global-data-centers source with primary-source URLs (press releases, EuroHPC pages, government announcements), so the same uniform gate applies to every country. Compute and MW figures from non-Epoch sources are self-reported; Epoch's satellite + permit derivation is treated as more authoritative when both cover the same site.

caveats

Funding totals are best-effort: undisclosed rounds, secondary-only sales, and non-equity capital (debt, compute commitments, partnership-style "investments" that aren't equity) are inconsistent across sources. We treat Epoch's totals as canonical for frontier labs and document anything else inline.

Country ranks shift whenever a single very large round closes (an OpenAI or xAI round can shift the US funding total by tens of billions). Compute and power ranks are similarly sensitive to a handful of frontier datacenters coming online.

scope