Methodology

Name: Scrutica Global Compute Infrastructure Database
Creator: Scrutica
License: https://scrutica.com/about

Complete derivation chains for every estimation path. Formulas shown in full; parameters adjustable in-browser. Authority tier definitions, cross-validation methodology, and version history for all methodological changes.

How every number on this platform was produced, in plain language. Each methodology section explains what we are estimating, why it matters for governance, how confident we are, and what the main limitations are. Formulas are available for those who want them; the explanations stand without them.

2 corrections · latest Apr 18, 2026v1.2.0·2026-04-18

Every value traces to a primary source, carries an authority tier (T1–T4), and documents its full derivation chain. Parameters are adjustable in-browser; changing an assumption propagates through the entire estimation pipeline so you can read off the sensitivity of each output to each input.

This page documents how every number on Scrutica was produced. Each section covers one estimation methodology, its input data, the confidence we assign to the result, and the main sources of uncertainty. You do not need to read the formulas to follow the methodology; the plain-language explanations stand on their own.

The methodology applies across ~4,500 facilities in 128 countries, ~18,600 bilateral supply-chain edges (FactSet Revere, WRDS-FactSet, CSET ETO, WRDS Compustat, and chip-deployment chains), 34 sovereign AI programs, 3,349export-control designations, and 13 GPU/accelerator generations plus two announced placeholders (R100 Rubin, Maia 200). Ten scheduled cron routes feed the tables; staleness is computed per record under four TTL classes (regulatory 1–6h, market 6–24h, structural 24–72h, narrative 24–168h).

The methodology applies across roughly 4,500 facilities in 128 countries, the ~18,600 supplier–customer relationships that connect them (drawn from financial-research databases and SEC filings), 34 sovereign AI programs, and 3,349 export-control designations. The site refreshes itself on a fixed schedule — ten scheduled jobs across each week — and every record carries its own freshness window: regulatory updates age within hours; structural facility data within days.

Data Sources

			Coverage
SEC EDGAR / XBRL Primary source for US-listed company financials. Capex, revenue, and subsidiary disclosures from regulatory filings.	T1: Government / Filing	Quarterly (10-Q, 10-K)	US-listed companies: financials, capex, segment data, Exhibit 21 subsidiaries
BIS Entity List US Department of Commerce list of entities subject to export license requirements. Primary source for export control status.	T1: Government / Filing	Monthly (Federal Register)	3,400+ entities under US export controls
GAO / SIA CHIPS Act Tracker Government Accountability Office reports and SIA tracker for CHIPS Act funding recipients, amounts, and facility status.	T1: Government / Filing	Quarterly (SIA); annual (GAO)	52 CHIPS Act funded projects across 35 companies
Epoch AI GPU Clusters Peer-reviewed dataset of GPU clusters and frontier data center facilities. City-level location data; no coordinates for individual clusters.	T2: Peer-Reviewed	Monthly	1,065 GPU clusters worldwide; 200 frontier data centers tracked in the Epoch frontier-DC dataset
CSET Supply Chain Data Georgetown CSET analysis of semiconductor supply chain relationships and market share data.	T2: Peer-Reviewed	Annual	Semiconductor supply chain topology: equipment, foundry, packaging relationships
NVIDIA Earnings Transcripts Quarterly earnings call transcripts with CFO revenue attribution by sovereign AI customer country.	T2: Peer-Reviewed	Quarterly	Sovereign AI revenue, customer disclosures, GPU shipment context
PitchBook (via WRDS) Private capital transaction database. Deal sizes undisclosed ~45% of the time; relationship data always available.	T3: Industry / Analyst	Daily	Private capital deals: DC investments, AI infrastructure funding rounds
CIQ Key Developments (via WRDS) Capital IQ corporate event feed. NLP extraction yields ~534 high-confidence facility/compute events from 15,810 unique events.	T3: Industry / Analyst	Daily	Corporate announcements: facility expansions, capacity additions, GPU deployments
Grid Queue Filings (PJM, NYISO) Regional transmission organization interconnection queue data. Early-stage intelligence on planned facilities; queued capacity is not the same as operational capacity.	T3: Industry / Analyst	Monthly	6,093 PJM interconnection queue entries (MW, status, applicant); NYISO and CAISO ingest planned post-launch
Sovereign AI Research Files Compiled from government announcements, NVIDIA earnings transcripts, CRS reports, and press coverage. All FX conversions documented.	T3: Industry / Analyst	As published	34 country sovereign AI programs: announced investment, government vs. private split, deployed-vs-announced reconciliation
Press Reports / Industry Analysis Secondary source. Used only when no primary source available. Always marked is_estimated: true.	T4: Press / Secondary	As published	Facility announcements, capacity rumors, deployment speculation

Estimation Methodologies

FLOP Estimation (3 paths)

→

Three independent estimation paths for facility compute capacity: hardware-based (GPU count + spec sheet), power-based (thermal envelope inversion), cost-based (capex disaggregation). Each produces a point estimate with propagated estimation bounds.

How much computing power does a facility have? Three different ways to answer that question, depending on what data is available. When we know the exact hardware, the estimate is tight; when we only know the investment amount, the range widens accordingly.

Hardware: narrow bounds. Power: medium bounds. Cost: widest bounds.

Compute Cost Index

→

Four sub-indices tracking $/petaFLOP-day across procurement models: Cloud Spot, Cloud Reserved, On-Premises, and Sovereign. Derived from published hourly rates and Epoch AI hardware specs (BF16 TFLOP/s with 2:4 structured sparsity).

What does a standard unit of AI training compute cost, and how does that cost vary by provider, geography, and procurement model? The same computing power can cost 3-10x more depending on where and how you buy it; that spread determines who can afford frontier model training.

Cloud pricing: Tier 1 (provider APIs). On-prem TCO: Tier 3 (modeled).

Cascade Simulation

→

BFS propagation model over a weighted supply chain graph. Configurable shock severity, propagation delay, and per-category substitutability decay rates. Edge weights from FactSet Revere bilateral relationships.

If a key supplier is disrupted, who loses access and how badly? The simulation traces disruption through the AI supply chain; the output identifies which dependencies are hardest to replace and which countries face the greatest concentration risk.

Topology: Tier 2 (CSET, SEC filings). Decay rates: Tier 3 (expert assessment).

Data Quality Framework

Provenance

Every record carries data_source, source_url, and is_estimated. No value exists without attribution.

Every number on Scrutica links back to where it came from. You can always trace a value to its original source document, and every estimated value is labeled as such.

Temporal Tracking

Append-only compute_capacity_snapshots track when values change. Every update preserves the previous state and records when the new value was learned.

When a value changes, we keep the old one. This means you can see how a facility’s reported capacity has evolved over time and when each update was recorded, not just the latest figure.

False Precision Ban

Values are set to null rather than guessed. A missing value is honest; a fabricated one is worse than useless.

If we do not have reliable data for a value, we leave it blank rather than fill it with a guess. A gap in the data is more useful than a confident-sounding number with no basis.

FLOP Estimation Methodology

Three independent estimation paths, each usable when different input data is available. Hardware path takes GPU count and spec sheet directly; power path inverts the thermal envelope; cost path disaggregates capex. Estimation bounds widen as the input data becomes more indirect.

A facility’s compute capacity determines what models can be trained there and whether regulatory thresholds (such as the EU AI Act’s 10²⁵ FLOP trigger) are met. Three estimation approaches accommodate different levels of available data; when less is known, the uncertainty range widens accordingly.

Highest Confidence

Path A: Hardware-Based

When GPU count and model are known. Narrowest estimation bounds.When the number and type of GPUs in a facility are known. Produces the most precise estimate.

peak = gpu_count × fp16_tflops × 1e12

Medium Confidence

Path B: Power-Based

When only power capacity is known. Derives GPU count from thermal envelope.When only the facility's electricity capacity is known. The estimate works backward from power consumption to GPU count.

gpus = power × 1000 / pue × gpu_frac / tdp

Lowest Confidence

Path C: Cost-Based

When only investment amount is known. Widest estimation bounds.When only the total investment amount is known. Produces the widest range of uncertainty because hardware costs vary.

gpus = invest × gpu_frac / cost_per_gpu

GPU Specifications (Fixed Values)

Model	FP16 TFLOP/s	TDP (kW)	Cost (USD)	Source
NVIDIA A100 SXM 80GB	312	0.400	$12K$8K–$15K	NVIDIA spec sheet (Ampere). Cost: secondary market estimates, no official ASP.
NVIDIA A100 PCIe 80GB	312	0.300	$10K$6K–$12K	NVIDIA spec sheet (Ampere PCIe). Same FP16 dense as SXM; lower TDP (300W vs 400W). Cost: secondary market estimates.
NVIDIA H100 PCIe	756	0.350	$25K$20K–$30K	NVIDIA spec sheet (Hopper PCIe). Dense FP16 = 756 TFLOP/s (vs 989.5 for SXM). HBM2e (not HBM3). 350W TDP. Cost: secondary market estimates.
NVIDIA H100 NVL	835	0.400	$30K$25K–$35K	NVIDIA spec sheet (Hopper NVL). Dense FP16 = 835 TFLOP/s. 94GB HBM3 per GPU (not 80GB). Dual-GPU NVL pairs share 188GB. Cost: secondary market estimates.
AMD MI250X	383	0.560	$12K$8K–$15K	AMD MI250X datasheet (CDNA 2). Dense FP16 = 383 TFLOP/s (FP64 = 47.9). Used in LUMI (10,752 GPUs), Frontier (37,888). 128GB HBM2e. Cost: secondary market estimates.
NVIDIA H100 SXM5	990	0.700	$30K$25K–$40K	NVIDIA spec sheet (Hopper). Cost: secondary market estimates, no official ASP.
NVIDIA H200 SXM	990	0.700	$30K$25K–$35K	NVIDIA spec sheet (Hopper). Same compute as H100; advantage is HBM3e (141GB, 4.8 TB/s). Cost: secondary market estimates, no official ASP.
NVIDIA B200	2,250	1.000	$45K$35K–$55K	NVIDIA Blackwell Technical Brief (HGX B200, 1000W). Dense FP16; sparse = 4,500.
AMD MI300X	1,307	0.750	$15K$10K–$25K	AMD MI300X datasheet (CDNA 3). 1,307.4 TFLOPS is dense FP16; 2,614.9 with sparsity. Cost: secondary market estimates, no official ASP.
NVIDIA GB200 (per GPU)	2,500	1.200	$60K$40K–$70K	NVIDIA spec sheet (Blackwell). Dense FP16/BF16 Tensor = 2,500 TFLOP/s per GPU. GB200 superchip contains 2 GPUs; values here are per-GPU. Cost is_estimated (no official ASP; range from analyst estimates). Verified against Epoch AI ml_hardware.csv.
AMD MI350X	2,310	1.000	$20K$15K–$30K	AMD product brief (CDNA 4). Dense BF16 = 2,309.6 TFLOP/s; FP8 = 4,600. 288GB HBM3E, 8 TB/s bandwidth. Shipped Q3 2025. Cost is_estimated (no official ASP). Verified against Epoch AI ml_hardware.csv.
Amazon AWS Trainium3	671	0.700	$0$0K–$0K	AWS Neuron docs. Dense BF16 = 671 TFLOP/s; FP8 = 2,517 TFLOP/s (2.52 PFLOPS). TDP 700W (SemiAnalysis estimate). Not sold individually; cloud-only via EC2 trn3 instances. Cost N/A. Verified against Epoch AI ml_hardware.csv.
Google TPU v7 Ironwood	2,307	0.960	$0$0K–$0K	Google Cloud TPU docs. BF16 = 2,307 TFLOP/s; FP8 = 4,614. TDP ~960W (derived: 29.3x efficiency vs TPUv2 per Google blog). Cloud-only via TPU API. Cost N/A. Verified against Epoch AI ml_hardware.csv.
NVIDIA R100 (Rubin)	0	0.000	$0$0K–$0K	Announced GTC 2026. TSMC 3nm, HBM4. 50 PFLOPS FP4 per NVIDIA. Volume H2 2026. FP16 dense TFLOP/s NOT YET DISCLOSED; set to 0. Do not use for estimation until specs published.
Microsoft Maia 200	0	0.000	$0$0K–$0K	Microsoft custom AI accelerator. TSMC 3nm, 216GB HBM3E. Perf specs NOT YET DISCLOSED; set to 0. Internal to Azure, not sold. Do not use for estimation until specs published.

Path A: Hardware-Based — Tunable Parameters

When GPU count and model are known. Narrowest estimation bounds.When the number and type of GPUs in a facility are known. Produces the most precise estimate.

		Range
Interconnect Efficiency Fraction of peak throughput achievable across multi-node interconnect. 0.85 for NVLink, 0.7 for InfiniBand/Ethernet backend. Ethernet surpassed InfiniBand in AI back-end network market share in 2025 (Dell'Oro Group); UEC 1.0 spec released June 2025. Meta validated Ethernet RoCE at 24K-GPU scale for LLaMA 3.	0.85	0.60–0.95	Epoch AI
Model FLOP Utilization (MFU) Fraction of theoretical peak FLOPs achieved during training. Typical range 30–50%. Chinchilla reported 0.46–0.57 (depending on model size); PaLM achieved 0.46–0.57; Meta reported 38–43% for LLaMA 3 405B at 16K-GPU scale.	0.40	0.20–0.65	Epoch AI

Path B: Power-Based — Tunable Parameters

When only power capacity is known. Derives GPU count from thermal envelope.When only the facility's electricity capacity is known. The estimate works backward from power consumption to GPU count.

		Range
GPU Fraction of IT Load Fraction of IT power draw attributable to GPU accelerators vs. CPUs, storage, networking.	0.70	0.40–0.90	Industry estimates
Interconnect Efficiency Fraction of peak throughput achievable across multi-node interconnect. 0.85 for NVLink, 0.7 for InfiniBand/Ethernet backend. Ethernet surpassed InfiniBand in AI back-end network market share in 2025 (Dell'Oro Group); UEC 1.0 spec released June 2025. Meta validated Ethernet RoCE at 24K-GPU scale for LLaMA 3.	0.85	0.60–0.95	Epoch AI
Model FLOP Utilization (MFU) Fraction of theoretical peak FLOPs achieved during training. Typical range 30–50%. Chinchilla reported 0.46–0.57 (depending on model size); PaLM achieved 0.46–0.57; Meta reported 38–43% for LLaMA 3 405B at 16K-GPU scale.	0.40	0.20–0.65	Epoch AI
Power Usage Effectiveness (PUE) Ratio of total facility power to IT equipment power. Lower is more efficient. Google: 1.10, industry average ~1.20.	1.20	1.05–1.60	Industry average; Google reports 1.10

Path C: Cost-Based — Tunable Parameters

When only investment amount is known. Widest estimation bounds.When only the total investment amount is known. Produces the widest range of uncertainty because hardware costs vary.

		Range
GPU Fraction of Capex GPUs as a fraction of total data center capital expenditure. SemiAnalysis estimates 40–50%.	0.45	0.30–0.60	SemiAnalysis
Interconnect Efficiency Fraction of peak throughput achievable across multi-node interconnect. 0.85 for NVLink, 0.7 for InfiniBand/Ethernet backend. Ethernet surpassed InfiniBand in AI back-end network market share in 2025 (Dell'Oro Group); UEC 1.0 spec released June 2025. Meta validated Ethernet RoCE at 24K-GPU scale for LLaMA 3.	0.85	0.60–0.95	Epoch AI
Model FLOP Utilization (MFU) Fraction of theoretical peak FLOPs achieved during training. Typical range 30–50%. Chinchilla reported 0.46–0.57 (depending on model size); PaLM achieved 0.46–0.57; Meta reported 38–43% for LLaMA 3 405B at 16K-GPU scale.	0.40	0.20–0.65	Epoch AI

Interactive FLOP Estimator

GPU Model

GPU Count10,000

Interconnect Efficiency0.85

Model FLOP Utilization (MFU)0.40

Estimation Results

Peak FP16 ThroughputMaximum processing speed9.89e18 FLOP/s

Training ThroughputEffective training speed3.36e18 FLOP/s

Daily FLOP BudgetDaily compute capacity2.91 × 10²³ FLOP

Estimation bounds (log scale)0.7 orders of magnitude

1.03 × 10²³2.91 × 10²³5.28 × 10²³

EU AI Act 10²⁵ cumulative FLOP training threshold:

At this daily capacity, a training run would reach the threshold in 34 days

Models trained above this line must be reported under the EU AI Act as general-purpose AI with systemic risk, triggering red-teaming, incident reporting, and cybersecurity requirements.

Derivation Steps

1Peak FP16 Throughput

peak_flops = gpu_count × per_gpu_fp16_tflops × 1e12

gpu_count10,000

per_gpu_fp16_tflops990

peak_flops9.89 × 10^18 FLOP/s

2Effective Throughput

effective_flops = peak_flops × interconnect_efficiency

interconnect_efficiency0.85

effective_flops8.41 × 10^18 FLOP/s

3Training Throughput

training_flops = effective_flops × mfu_assumption

mfu_assumption0.40

training_flops3.36 × 10^18 FLOP/s

4Daily FLOP Budget

daily_flop_budget = training_flops × 86,400

seconds_per_day86,400

daily_flop_budget2.91 × 10^23 FLOP/day

Compute Cost Index

Normalized pricing in $/petaFLOP-day derived from published hourly rates and Epoch AI hardware specs (BF16 TFLOP/s with 2:4 structured sparsity). Four sub-indices cover cloud spot, cloud reserved, on-premises TCO, and sovereign procurement. All prices are converted to a common unit so effective compute cost is directly comparable across providers.

What does a standard unit of AI training compute actually cost, and how does that cost vary by provider, geography, and procurement model? This index converts all pricing to a single comparable unit. The same computing power can cost 3-10x more depending on where and how it is procured; that gap directly shapes who can afford to train frontier models and who cannot.

Scrutica Compute Unit (SCU)

1 SCU = 1 petaFLOP-day of FP16 training compute

= 8.64 × 10¹⁹ FLOP

One petaFLOP-day is 10¹⁵ FLOP/s sustained for 86,400 seconds. This standardized unit enables direct cost comparison across hardware generations, cloud providers, and procurement models. The Cost Index expresses all prices as $/SCU.

The SCU is a standardized measure of AI computing power sustained for one day. Converting all prices to this common unit makes it possible to compare the cost of training compute across different providers, chip generations, and purchasing arrangements. When a hyperscaler quotes $2.50/hr for a GPU instance and a neocloud quotes $1.80/hr for a different configuration, the $/SCU conversion reveals which actually delivers cheaper compute per unit of training work.

Cost Normalization Calculator

GPU Model

Instance $/hour

GPUs per instance

// Step-by-step derivation

Per-GPU hourly rate: $0.4375/hr

GPU peak PFLOP/s: 0.990

PFLOP per hour: 3562.2

$/PFLOP-hour: $0.000123

$/petaFLOP-day: $0.0029

BF16 TFLOP/s dense, no 2:4 structured sparsity (source: Epoch AI ML Hardware dataset). On-prem defaults: MFU , PUE , $5K networking + $3K facility share per GPU (editorial estimates, tier 4).

BF16 Dense Throughput — Shared Unit

Both the FLOP Capacity Engine and the Cost Index quote dense BF16 Tensor Core throughput without 2:4 structured sparsity (989.5 TFLOP/s per H100 SXM, 312 per A100, 2,250 per B200). This matches the Epoch AI ml_hardware reference and the throughput real training jobs achieve; most production workloads do not enable sparsity, and the MFU parameter is calibrated against this dense baseline. NVIDIA marketing and cloud-provider cut sheets sometimes quote the 2:4-sparsity figure (1,979 TFLOP/s for H100) — cost-per-petaFLOP-day values in this index will be roughly double a number computed against those sparsity-inclusive specs; that gap is the unit mismatch, not a pricing disagreement.

Sub-Indices

Cloud Spot

On-demand / spot pricing for GPU compute from major cloud providers (AWS, GCP, Azure, Lambda, CoreWeave).

Cloud provider pricing APIs

Historical spot price tracking

Cloud Reserved

1-year and 3-year reserved instance pricing. Committed-use discounts.

Cloud provider pricing pages

Enterprise contract disclosures

On-Premises

Total cost of ownership for self-operated GPU clusters including hardware, power, cooling, facility, and labor.

Hardware vendor ASPs

Electricity tariff databases

Construction cost indices

Sovereign

Effective cost for government-funded sovereign AI compute including subsidies, grants, and below-market financing.

CHIPS Act disclosures

EU Chips Act awards

Sovereign AI program announcements

Data Collection ScheduleHow Prices Are Collected

Data Source	Cadence	Method
Cloud Provider APIs	Weekly	Automated API polling
Enterprise Contract Disclosures	Quarterly	SEC filing extraction
Hardware Vendor ASPs	Quarterly	Earnings transcript parsing
Electricity Tariff Data	Annually	EIA / regional utility databases
Construction Cost Indices	Annually	Turner / RSMeans indices

Chokepoint Cascade Simulation

BFS propagation over a weighted directed graph of supply chain relationships. Severity attenuates per hop via category-specific substitutability decay rates; edges weighted by FactSet Revere relationship values where available. When market correlation weighting is enabled, edge criticality blends editorial scores with FactSet 3-month stock price correlation to reflect market-revealed economic coupling. Configurable shock severity, propagation delay, and threshold cutoff.

If a critical supplier is disrupted, who loses access to what, and for how long? This simulation traces disruptions through the AI supply chain. Some components (EUV lithography equipment) have no substitutes; others (power delivery, construction) can be sourced from multiple vendors. The decay rate at each hop reflects that asymmetry.

BFS Propagation Algorithm

How Disruptions Propagate

The model starts at the disrupted company and traces outward through the supply chain, one step at a time. At each step, the severity of the disruption decreases based on how easy it is to find an alternative supplier for that category of goods. Equipment with no substitute (ASML’s EUV lithography machines) passes nearly full disruption downstream; categories with many vendors (construction, power delivery) absorb most of the shock. The process continues until the remaining severity falls below a 5% threshold or reaches six hops from the origin.

// Pseudocode

function simulateCascade(graph, disrupted_node) {

queue = [disrupted_node]

while queue is not empty:

node = queue.dequeue()

for each downstream in node.customers:

severity = node.severity × (1 − substitutability_decay)

if severity > threshold:

downstream.severity = severity

downstream.tick = node.tick + propagation_delay

queue.enqueue(downstream)

}

Severity attenuates at each hop based on the supply chain category's substitutability decay rate. Categories with near-zero substitutability (EUV lithography) propagate disruptions with minimal loss; categories with many alternatives (construction, power delivery) attenuate rapidly.

Configurable Parameters

Parameter	Default	Range	Description
propagation_delay	30 days	1–365	Time for a disruption to propagate from one supply chain node to the next.
initial_shock_severity	1	0.1–1	Severity of the initial disruption at the source node (1.0 = complete disruption).

Cascade Parameter Sensitivity

Scenario

Complete loss of TSMC Taiwan fab capacity (natural disaster, blockade).

Propagation Decay0.90

0.30 (rapid attenuation)0.95 (near-full propagation)

Decay 0.75

645

orgs, 3 hops

Decay 0.90

647

orgs, 3 hops

Decay 0.95

648

orgs, 3 hops

Results from BFS propagation over the live supply chain graph (5 scenarios, 14 decay points each). For the full animated simulation, see the Cascade Simulation page. Higher decay = more severe propagation (each hop retains a larger fraction of the disruption).

Substitutability Decay by Category

Each supply chain category has a decay rate representing how much severity is absorbed per hop. Lower decay = harder to substitute = more severe cascade propagation.

Some parts of the supply chain can absorb disruptions because alternatives exist. Construction firms, power suppliers, and data center operators have multiple vendors. Semiconductor equipment is the opposite: ASML has no competitor for EUV lithography, so disruptions pass through with almost no loss. The decay rate for each category captures this difference.

0.02

EUV LithographyNear-zero substitutability. ASML sole source.

0.05

Advanced Foundry (<7nm)TSMC dominant. Samsung limited alternative.

0.15

GPU AcceleratorsNVIDIA dominant but AMD MI300X is a partial substitute.

0.10

HBM MemorySK Hynix ~56% market share, Samsung ~25%, Micron ~19% (Q3 2025, Astute Group). HBM4 mass production began Feb 2026 (SK Hynix at M16/M15X; Samsung at Pyeongtaek). Concentrated but multi-source. Updated 2026-04-13.

0.15

Advanced Packaging (CoWoS)TSMC dominant for CoWoS but capacity 3x'd to ~75K wafers/month (end 2025, from ~40K end 2024), projecting 130K end 2026. OSAT partners (Amkor ~180-190K, SPIL ~60-80K wafers/year) now handle overflow. Bottleneck described as "effectively resolved" by analysts. Substitutability increased. Source: TrendForce (Tier 2). Updated 2026-04-01.

0.30

AI Networking (InfiniBand + Ethernet)Ethernet surpassed InfiniBand in AI back-end network revenue in 2025 (Dell'Oro Group). UEC 1.0 (June 2025) achieves InfiniBand-class performance. Meta validated Ethernet RoCE at 24K GPUs. Higher substitutability than pre-2025 InfiniBand monopoly. Updated 2026-03-31.

0.50

Data Center ConstructionMany EPC contractors globally. Highly substitutable.

0.25

Power DeliveryRegional utilities, PPAs, on-site generation available in theory, but PJM capacity prices surged 10x ($329/MW-day for 2026-27 vs $29/MW-day for 2024-25). AEP Ohio paused new DC interconnections. Power is NOW a binding constraint on expansion, not a future risk. Substitutability lower than pre-2025 assessment. Source: PJM capacity auction (Tier 1). Updated 2026-04-01.

Inventory Buffers (Temporal Mode)

Each supply chain layer holds a different amount of inventory that delays disruption propagation. Buffer data sourced from 10-K/annual report inventory line items (Tier 1), TrendForce/SemiAnalysis (Tier 2), and analyst estimates (Tier 3).

Real supply chains have stockpiles: chip makers hold weeks of inventory, memory suppliers pre-sell their output. These buffers determine how long downstream companies can operate before a disruption reaches them.

Layer	Buffer	Range	Shape	Tier
EUV Lithography	None	0–0 wk	cliff	T1
Advanced Wafers (N3/N5)	10 wk	8–16 wk	linear	T1
Packaged Chips (GPU)	3 wk	2–5 wk	cliff	T1
Server Assembly	14 wk	8–26 wk	exponential	T3
HBM Memory	3 wk	0–8 wk	cliff	T2
ABF Substrates	12 wk	4–30 wk	linear	T2

Substitution Elasticity (Temporal Mode)

When a supplier is disrupted, how fast can alternatives absorb demand? Qualification time is the lag before any substitute output appears; capacity ceiling is the maximum fraction absorbable. Ramp follows an S-curve from qualification to ceiling.

Not all supply chain links are equally fragile. Some disrupted suppliers can be replaced in months (Micron HBM); others have no substitute at all (ASML EUV lithography). This table shows how long replacement takes and how much demand alternative suppliers can absorb.

TSMC Advanced Node

Samsung Foundry24mo

15%T2

Intel Foundry (IFS)36mo

5%T2

ASML EUVNo viable substitute

SK Hynix HBM

Samsung HBM0mo

45%T2

Micron HBM0mo

25%T2

NVIDIA Training GPUs

AMD MI300X/MI35012mo

15%T2

TSMC CoWoS Packaging

Amkor (OSAT)18mo

20%T2

SPIL (OSAT)18mo

10%T2

Historical Validation

Temporal parameters calibrated against three observed disruption events. Observed propagation delays and buffer absorption patterns validate the model's depletion and substitution functions.

We tested the model against real supply chain disruptions to verify that the predicted timelines match what actually happened. Three events span distinct disruption archetypes: sudden shutdown, commodity shock absorbed by buffers, and sustained capacity bottleneck.

Texas Freeze (Samsung Austin S2)February 2021

Delay: 8-12 weeks

Severity: 1-2% global 300mm capacity/week; 5% Q2 smartphone production

Buffers: Partial: 5G AP/modem inventory delayed Q2 impact by ~1 product cycle

Substitution: None at process level; NXP shifted minimal volume to other fabs

Source: TrendForce March 2021; Samsung Q1 2021 earnings ($268-357M loss) (T1)

Ukraine Neon Gas SupplyFebruary 2022

Delay: Buffer absorbed shock entirely — no production curtailment at major fabs

Severity: 45-54% of semiconductor-grade neon disrupted; zero production impact

Buffers: Critical: 3-12 months neon inventory at large fabs (post-2014 Crimea lesson)

Substitution: TSMC neon recycling; Cryoin relocated to South Korea; 3-18 month qualification

Source: USITC Executive Briefing (DeCarlo & Goodman, Apr 2022); CSIS March 2022; Reuters (T1)

CoWoS Packaging BottleneckQ2 2023 – ongoing

Delay: Immediate (demand-pull, no propagation lag)

Severity: H100 lead times peaked at 8-11 months; ~80% of customer needs met

Buffers: None: packaging is final step, not stockpileable

Substitution: OSAT overflow (SPIL tool-in Q2 2025, ASE ramp); TSMC doubling capacity 2024

Source: TSMC Chairman Mark Liu (Sep 2023); TrendForce Aug 2024 (SPIL order); UBS analyst estimates (T1)

Edge Weight Coverage

Edge weights draw on three signals. Supply share (% of customer input from this supplier) is available on ~3.4% of edges; the remaining use a default of 30%. Criticality (1–10 replaceability) is editorial for all edges but can be blended with FactSet 3-month stock price correlation (Pearson r, available on ~78% of FactSet edges) when market correlation weighting is enabled. The blend formula is adjustedCriticality = w × 5(1+r) + (1−w) × editorial, where w defaults to 0.5. For edges without correlation data (WRDS, unlisted companies), editorial scores stand alone. Sole-source edges retain criticality ≥ 9 regardless of correlation. Price correlations are from FactSet Workstation downloads (vintage: April 2026). The structural centrality proxy — the product of flow share and criticality — follows Li et al. 2020 (PMC7546950).

Total Affected Compute Metric

The “total affected compute” metric blends two weighting approaches: for nodes with facility-level FLOP/day estimates, impact is FLOP-weighted; for nodes without FLOP data, impact is weighted by network degree centrality (number of supply chain connections). Users should interpret this as an approximation — the topology-weighted component is a structural proxy, not a measurement.

Graph Edge Sources

The supply chain graph is assembled from multiple data sources; each edge carries its own provenance and authority tier.

The supplier/customer relationships that feed this simulation come from four sources of varying reliability. Relationships from SEC regulatory filings (Tier 1) are the most reliable; market-share-derived links from CSET (Tier 2) capture industry structure without bilateral specificity; press and analyst reports (Tier 3) fill gaps.

CSET Provision DataT2

Equipment supplier → foundry, foundry → chip designer

Market-share-weighted edges. Topology only (no bilateral specificity).

SEC Exhibit 21 / FactSetT1

Parent → subsidiary, supplier → customer

Verified bilateral relationships from regulatory filings.

PitchBook / PreqinT3

Investor → facility, GP → infrastructure fund

Financial relationship edges. Deal amounts undisclosed ~45% of the time.

Epoch AI ClustersT2

Organization → GPU cluster, cluster → facility

Training deployment edges with hardware type attribution.

Confidence Levels & Data Quality Framework

Four authority tiers (T1: primary measurement, T2: research database, T3: press/analyst, T4: estimated/inferred) assigned by original source authority, not intermediary. A CRS report citing a Goldman Sachs estimate remains T3. Aggregate confidence for derived values takes the weakest input tier.

A number from a company’s SEC filing is more reliable than one from a press report, which in turn is more reliable than an analyst estimate. Every value on Scrutica carries a confidence label (T1 through T4) so you can judge how much weight to place on it. When a derived value combines sources of different quality, the overall confidence reflects the weakest input.

Tier 1VerifiedTreatment:123.4 MW

Value from a primary authoritative source with direct measurement or legal disclosure obligation.

SEC 10-K filing capex figuresGovernment grant award amounts (CHIPS Act)Official facility registry coordinatesCompany-reported GPU counts

Tier 2Peer-ReviewedTreatment:123.4 MW

Value from academic publication or research organization with documented methodology.

Epoch AI training compute estimatesCSET supply chain market share analysisAcademic papers on semiconductor economics

Tier 3EstimatedTreatment:123.4 MW

Value derived from proxy methods, industry analyst estimates, or press reports. Methodology documented.

GPU count derived from power capacity (Path B)PitchBook deal size estimatesCIQ NLP-extracted facility capacitiesSemiAnalysis capex breakdowns

Tier 4SpeculativeTreatment:123.4 MW

Value from secondary inference, expert opinion, or interpolation. Widest estimation bounds.

GPU count from investment amount (Path C)Sovereign AI "announced" figures (government + private conflated)Grid queue entries with withheld applicant

Aggregate Confidence Calculation

When a derived value combines inputs from multiple tiers, the aggregate confidence is the lowesttier among all inputs. A GPU count estimate (Tier 3) combined with a verified power capacity (Tier 1) yields a Tier 3 aggregate, because the weakest link determines the chain's strength.

When Scrutica calculates a number from multiple inputs, the overall confidence reflects the least reliable source. If one input comes from a manufacturer's SEC filing (highly reliable) but another comes from a press estimate (less reliable), the combined result inherits the lower confidence. This prevents derived values from appearing more certain than their weakest ingredient.

aggregate_tier = max(input_1.tier, input_2.tier, ..., input_n.tier)

Contradiction Protocol

When sources disagree on a factual value, Scrutica shows both values with source attribution rather than silently picking a winner. The protocol:

Surface both values with full source attribution and dates
Display the higher-authority value as primary (lower tier number = higher authority)
Show the discrepancy magnitude and possible explanations (temporal difference, methodology difference, scope difference)
Never use hedging language ("likely," "probably") to paper over the contradiction. State the uncertainty explicitly.

The `is_estimated` Flag

This boolean means "this value was derived via a model or proxy method." It does notmean "this value might be null" or "this value has uncertainty."

is_estimated: false

Value from primary source with direct measurement. SEC 10-K capex, company-disclosed GPU count, government registry coordinates.

is_estimated: true

Value produced by inference: GPU count from power capacity, FLOP estimate from investment amount, market share from analyst reports.

External Validation

Scrutica estimates are compared against independently published figures from Epoch AI, CSET, TrendForce, and MLPerf. Where estimates diverge, the discrepancy traces to different assumptions about GPU utilization rates, hardware configuration, or facility-level power draw.

Comparison Approach

Epoch AI publishes estimated computing power (PFLOP/s) for GPU clusters in their Notable GPU Clusters dataset. Where Epoch and Scrutica both estimate the same facility, we compare hardware-path estimates.
Differences arise from GPU count assumptions (Epoch uses disclosed counts; Scrutica derives from power capacity when counts are unavailable) and MFU assumptions.
MLPerf Inference submissions provide verified hardware configurations (accelerator type, count, memory) that cross-check Scrutica's chip catalog specifications.

Validation Sources

Validation Source	Data Type	Coverage	Methodology Difference
Epoch AI GPU Clusters	Facility PFLOP/s estimates	~786 clusters, ~26 with PFLOP estimates	Epoch uses disclosed GPU counts; Scrutica uses three estimation paths (hardware, power, cost) and reports estimation bounds for each
CSET AI Indicators	Company-level AI investment	~691 companies	CSET aggregates financial disclosures; Scrutica adds FactSet Revere supply chain edges for interdependence analysis
TrendForce / Counterpoint	Semiconductor market share	Quarterly updates	Analyst estimates vs. Scrutica concentration scoring derived from supply chain topology
MLPerf Inference	Hardware specifications (accelerator type, memory, count)	92 datacenter system submissions (v4.1)	MLPerf publishes verified hardware configurations per submission; Scrutica cross-checks accelerator types and memory specs against its chip catalog

MLPerf Hardware Cross-Check

20 unique system configurations from MLPerf Inference v4.1 submissions provide independent verification of accelerator specifications. Accelerator families represented: AMD Instinct MI300X, NVIDIA H100, NVIDIA L40S, NVIDIA Jetson AGX Orin 64G, NVIDIA H200, AMD MI300X, NVIDIA L4, NVIDIA B200, NVIDIA A100, NVIDIA GH200, TPU v5e, TPU v6, NVIDIA GH200 Grace Hopper Superchip 144GB, NVIDIA GH200 Grace Hopper Superchip 96GB, UntetherAI speedAI240 Preview, UntetherAI speedAI240 Slim.

Facility-level PFLOP/s cross-validation requires overlapping estimates from independent sources. As Scrutica and Epoch coverage expands, matched facility pairs will appear in the table above. Users who want to reproduce our estimates can adjust all parameters in the Interactive Methodology Explorer above.

Version History

v1.2.02026-04-18

MODIFIED

cascade-edge-weightingCascade propagation now blends editorial criticality with FactSet 3-month stock price correlation (Pearson r) when market correlation weighting is enabled. Edges without correlation data (WRDS, unlisted companies) keep editorial scores unchanged; sole-source edges retain criticality ≥ 9 regardless of correlation. Default blend weight w = 0.5.

MODIFIED

h100_sxm5,a100_sxm,b200Canonicalized BF16 throughput on DENSE (no 2:4 structured sparsity) across the FLOP engine and Cost Index: 989.5 TFLOP/s per H100 SXM, 312 per A100, 2,250 per B200. Pre-Pass-46 the FLOP engine quoted dense and the Cost Index computed against sparsity-inclusive specs; both defensible from NVIDIA’s datasheet, but the two features disagreed on units. Dense matches Epoch AI ml_hardware (the cited source) and the throughput real training workloads achieve. Cost-per-petaFLOP-day values landed roughly 2x higher; provider ordering invariant.

MODIFIED

substitutability-decay-hbmHBM Memory decay description refreshed for HBM4 mass production (Feb 2026 at SK Hynix M16/M15X and Samsung Pyeongtaek) and Q3 2025 market shares from Astute Group: SK Hynix ≈56%, Samsung ≈25%, Micron ≈19%. Concentrated but multi-source.

ADDED

mlperf-cross-checkExternal validation now incorporates MLPerf Inference v4.1: 53 unique datacenter system configurations covering 16 accelerator families (NVIDIA A100/H100/H200/B200/GH200/L4/L40S, AMD MI300X, Google TPU v5e/v6, plus accelerator-startup submissions). Provides independent verification of accelerator type, count, and memory specifications against the Scrutica chip catalog.

v1.1.02026-04-01

ADDED

gb200Added NVIDIA GB200 (Blackwell, per-GPU). Dense FP16 = 2,500 TFLOP/s, 1.2 kW TDP. Verified against Epoch AI ml_hardware.csv.

ADDED

mi350xAdded AMD MI350X (CDNA 4). Dense BF16 = 2,309.6 TFLOP/s, 1.0 kW TDP. Shipped Q3 2025. Verified against Epoch AI ml_hardware.csv.

ADDED

trainium3Added Amazon Trainium3. Dense BF16 = 671 TFLOP/s, FP8 = 2,517 TFLOP/s. Cloud-only. Verified against Epoch AI ml_hardware.csv.

ADDED

tpu_v7_ironwoodAdded Google TPU v7 Ironwood. BF16 = 2,307 TFLOP/s. Cloud-only. Verified against Epoch AI ml_hardware.csv.

ADDED

r100_rubinAdded NVIDIA R100 (Rubin) placeholder. Announced GTC 2026, volume H2 2026. FP16 specs not yet disclosed; set to 0.

ADDED

maia_200Added Microsoft Maia 200 placeholder. TSMC 3nm, 216GB HBM3E. Perf specs not yet disclosed; set to 0.

MODIFIED

b200Corrected B200 fp16_tflops from 1,125 (was TF32) to 2,250 (dense FP16/BF16 Tensor). Verified against Epoch AI ml_hardware.csv and NVIDIA datasheet.1125 → 2250

v1.0.02026-03-29

ADDED

Initial methodology release. Three estimation paths (hardware, power, cost) with parameters from NVIDIA/AMD spec sheets and Epoch AI research.

Cite This Methodology

Scrutica. "Methodology: FLOP Estimation, Compute Cost Index, and Cascade Simulation." v1.2.0, April 2026. https://scrutica.com/methodology

When citing specific estimates, include the methodology version and estimation path (e.g., "Hardware Path, v1.2.0") so readers can reproduce the calculation.

Related analysis

FLOP EngineInfrastructure

Interactive version of these formulas

AboutReference

Mission and limitations

GlossaryReference

Technical terms used in these methodologies

Methodology

2 corrections · latest Apr 18, 2026v1.2.0·2026-04-18

Data Sources

			Coverage
SEC EDGAR / XBRL Primary source for US-listed company financials. Capex, revenue, and subsidiary disclosures from regulatory filings.	T1: Government / Filing	Quarterly (10-Q, 10-K)	US-listed companies: financials, capex, segment data, Exhibit 21 subsidiaries
BIS Entity List US Department of Commerce list of entities subject to export license requirements. Primary source for export control status.	T1: Government / Filing	Monthly (Federal Register)	3,400+ entities under US export controls
GAO / SIA CHIPS Act Tracker Government Accountability Office reports and SIA tracker for CHIPS Act funding recipients, amounts, and facility status.	T1: Government / Filing	Quarterly (SIA); annual (GAO)	52 CHIPS Act funded projects across 35 companies
Epoch AI GPU Clusters Peer-reviewed dataset of GPU clusters and frontier data center facilities. City-level location data; no coordinates for individual clusters.	T2: Peer-Reviewed	Monthly	1,065 GPU clusters worldwide; 200 frontier data centers tracked in the Epoch frontier-DC dataset
CSET Supply Chain Data Georgetown CSET analysis of semiconductor supply chain relationships and market share data.	T2: Peer-Reviewed	Annual	Semiconductor supply chain topology: equipment, foundry, packaging relationships
NVIDIA Earnings Transcripts Quarterly earnings call transcripts with CFO revenue attribution by sovereign AI customer country.	T2: Peer-Reviewed	Quarterly	Sovereign AI revenue, customer disclosures, GPU shipment context
PitchBook (via WRDS) Private capital transaction database. Deal sizes undisclosed ~45% of the time; relationship data always available.	T3: Industry / Analyst	Daily	Private capital deals: DC investments, AI infrastructure funding rounds
CIQ Key Developments (via WRDS) Capital IQ corporate event feed. NLP extraction yields ~534 high-confidence facility/compute events from 15,810 unique events.	T3: Industry / Analyst	Daily	Corporate announcements: facility expansions, capacity additions, GPU deployments
Grid Queue Filings (PJM, NYISO) Regional transmission organization interconnection queue data. Early-stage intelligence on planned facilities; queued capacity is not the same as operational capacity.	T3: Industry / Analyst	Monthly	6,093 PJM interconnection queue entries (MW, status, applicant); NYISO and CAISO ingest planned post-launch
Sovereign AI Research Files Compiled from government announcements, NVIDIA earnings transcripts, CRS reports, and press coverage. All FX conversions documented.	T3: Industry / Analyst	As published	34 country sovereign AI programs: announced investment, government vs. private split, deployed-vs-announced reconciliation
Press Reports / Industry Analysis Secondary source. Used only when no primary source available. Always marked is_estimated: true.	T4: Press / Secondary	As published	Facility announcements, capacity rumors, deployment speculation

Estimation Methodologies

FLOP Estimation (3 paths)

→

Hardware: narrow bounds. Power: medium bounds. Cost: widest bounds.

Compute Cost Index

→

Cloud pricing: Tier 1 (provider APIs). On-prem TCO: Tier 3 (modeled).

Cascade Simulation

→

Topology: Tier 2 (CSET, SEC filings). Decay rates: Tier 3 (expert assessment).

Data Quality Framework

Provenance

Every record carries data_source, source_url, and is_estimated. No value exists without attribution.

Every number on Scrutica links back to where it came from. You can always trace a value to its original source document, and every estimated value is labeled as such.

Temporal Tracking

Append-only compute_capacity_snapshots track when values change. Every update preserves the previous state and records when the new value was learned.

When a value changes, we keep the old one. This means you can see how a facility’s reported capacity has evolved over time and when each update was recorded, not just the latest figure.

False Precision Ban

Values are set to null rather than guessed. A missing value is honest; a fabricated one is worse than useless.

If we do not have reliable data for a value, we leave it blank rather than fill it with a guess. A gap in the data is more useful than a confident-sounding number with no basis.

FLOP Estimation Methodology

Highest Confidence

Path A: Hardware-Based

When GPU count and model are known. Narrowest estimation bounds.When the number and type of GPUs in a facility are known. Produces the most precise estimate.

peak = gpu_count × fp16_tflops × 1e12

Medium Confidence

Path B: Power-Based

When only power capacity is known. Derives GPU count from thermal envelope.When only the facility's electricity capacity is known. The estimate works backward from power consumption to GPU count.

gpus = power × 1000 / pue × gpu_frac / tdp

Lowest Confidence

Path C: Cost-Based

When only investment amount is known. Widest estimation bounds.When only the total investment amount is known. Produces the widest range of uncertainty because hardware costs vary.

gpus = invest × gpu_frac / cost_per_gpu

GPU Specifications (Fixed Values)

Model	FP16 TFLOP/s	TDP (kW)	Cost (USD)	Source
NVIDIA A100 SXM 80GB	312	0.400	$12K$8K–$15K	NVIDIA spec sheet (Ampere). Cost: secondary market estimates, no official ASP.
NVIDIA A100 PCIe 80GB	312	0.300	$10K$6K–$12K	NVIDIA spec sheet (Ampere PCIe). Same FP16 dense as SXM; lower TDP (300W vs 400W). Cost: secondary market estimates.
NVIDIA H100 PCIe	756	0.350	$25K$20K–$30K	NVIDIA spec sheet (Hopper PCIe). Dense FP16 = 756 TFLOP/s (vs 989.5 for SXM). HBM2e (not HBM3). 350W TDP. Cost: secondary market estimates.
NVIDIA H100 NVL	835	0.400	$30K$25K–$35K	NVIDIA spec sheet (Hopper NVL). Dense FP16 = 835 TFLOP/s. 94GB HBM3 per GPU (not 80GB). Dual-GPU NVL pairs share 188GB. Cost: secondary market estimates.
AMD MI250X	383	0.560	$12K$8K–$15K	AMD MI250X datasheet (CDNA 2). Dense FP16 = 383 TFLOP/s (FP64 = 47.9). Used in LUMI (10,752 GPUs), Frontier (37,888). 128GB HBM2e. Cost: secondary market estimates.
NVIDIA H100 SXM5	990	0.700	$30K$25K–$40K	NVIDIA spec sheet (Hopper). Cost: secondary market estimates, no official ASP.
NVIDIA H200 SXM	990	0.700	$30K$25K–$35K	NVIDIA spec sheet (Hopper). Same compute as H100; advantage is HBM3e (141GB, 4.8 TB/s). Cost: secondary market estimates, no official ASP.
NVIDIA B200	2,250	1.000	$45K$35K–$55K	NVIDIA Blackwell Technical Brief (HGX B200, 1000W). Dense FP16; sparse = 4,500.
AMD MI300X	1,307	0.750	$15K$10K–$25K	AMD MI300X datasheet (CDNA 3). 1,307.4 TFLOPS is dense FP16; 2,614.9 with sparsity. Cost: secondary market estimates, no official ASP.
NVIDIA GB200 (per GPU)	2,500	1.200	$60K$40K–$70K	NVIDIA spec sheet (Blackwell). Dense FP16/BF16 Tensor = 2,500 TFLOP/s per GPU. GB200 superchip contains 2 GPUs; values here are per-GPU. Cost is_estimated (no official ASP; range from analyst estimates). Verified against Epoch AI ml_hardware.csv.
AMD MI350X	2,310	1.000	$20K$15K–$30K	AMD product brief (CDNA 4). Dense BF16 = 2,309.6 TFLOP/s; FP8 = 4,600. 288GB HBM3E, 8 TB/s bandwidth. Shipped Q3 2025. Cost is_estimated (no official ASP). Verified against Epoch AI ml_hardware.csv.
Amazon AWS Trainium3	671	0.700	$0$0K–$0K	AWS Neuron docs. Dense BF16 = 671 TFLOP/s; FP8 = 2,517 TFLOP/s (2.52 PFLOPS). TDP 700W (SemiAnalysis estimate). Not sold individually; cloud-only via EC2 trn3 instances. Cost N/A. Verified against Epoch AI ml_hardware.csv.
Google TPU v7 Ironwood	2,307	0.960	$0$0K–$0K	Google Cloud TPU docs. BF16 = 2,307 TFLOP/s; FP8 = 4,614. TDP ~960W (derived: 29.3x efficiency vs TPUv2 per Google blog). Cloud-only via TPU API. Cost N/A. Verified against Epoch AI ml_hardware.csv.
NVIDIA R100 (Rubin)	0	0.000	$0$0K–$0K	Announced GTC 2026. TSMC 3nm, HBM4. 50 PFLOPS FP4 per NVIDIA. Volume H2 2026. FP16 dense TFLOP/s NOT YET DISCLOSED; set to 0. Do not use for estimation until specs published.
Microsoft Maia 200	0	0.000	$0$0K–$0K	Microsoft custom AI accelerator. TSMC 3nm, 216GB HBM3E. Perf specs NOT YET DISCLOSED; set to 0. Internal to Azure, not sold. Do not use for estimation until specs published.

Path A: Hardware-Based — Tunable Parameters

When GPU count and model are known. Narrowest estimation bounds.When the number and type of GPUs in a facility are known. Produces the most precise estimate.

		Range
Interconnect Efficiency Fraction of peak throughput achievable across multi-node interconnect. 0.85 for NVLink, 0.7 for InfiniBand/Ethernet backend. Ethernet surpassed InfiniBand in AI back-end network market share in 2025 (Dell'Oro Group); UEC 1.0 spec released June 2025. Meta validated Ethernet RoCE at 24K-GPU scale for LLaMA 3.	0.85	0.60–0.95	Epoch AI
Model FLOP Utilization (MFU) Fraction of theoretical peak FLOPs achieved during training. Typical range 30–50%. Chinchilla reported 0.46–0.57 (depending on model size); PaLM achieved 0.46–0.57; Meta reported 38–43% for LLaMA 3 405B at 16K-GPU scale.	0.40	0.20–0.65	Epoch AI

Path B: Power-Based — Tunable Parameters

When only power capacity is known. Derives GPU count from thermal envelope.When only the facility's electricity capacity is known. The estimate works backward from power consumption to GPU count.

		Range
GPU Fraction of IT Load Fraction of IT power draw attributable to GPU accelerators vs. CPUs, storage, networking.	0.70	0.40–0.90	Industry estimates
Interconnect Efficiency Fraction of peak throughput achievable across multi-node interconnect. 0.85 for NVLink, 0.7 for InfiniBand/Ethernet backend. Ethernet surpassed InfiniBand in AI back-end network market share in 2025 (Dell'Oro Group); UEC 1.0 spec released June 2025. Meta validated Ethernet RoCE at 24K-GPU scale for LLaMA 3.	0.85	0.60–0.95	Epoch AI
Model FLOP Utilization (MFU) Fraction of theoretical peak FLOPs achieved during training. Typical range 30–50%. Chinchilla reported 0.46–0.57 (depending on model size); PaLM achieved 0.46–0.57; Meta reported 38–43% for LLaMA 3 405B at 16K-GPU scale.	0.40	0.20–0.65	Epoch AI
Power Usage Effectiveness (PUE) Ratio of total facility power to IT equipment power. Lower is more efficient. Google: 1.10, industry average ~1.20.	1.20	1.05–1.60	Industry average; Google reports 1.10

Path C: Cost-Based — Tunable Parameters

When only investment amount is known. Widest estimation bounds.When only the total investment amount is known. Produces the widest range of uncertainty because hardware costs vary.

		Range
GPU Fraction of Capex GPUs as a fraction of total data center capital expenditure. SemiAnalysis estimates 40–50%.	0.45	0.30–0.60	SemiAnalysis
Interconnect Efficiency Fraction of peak throughput achievable across multi-node interconnect. 0.85 for NVLink, 0.7 for InfiniBand/Ethernet backend. Ethernet surpassed InfiniBand in AI back-end network market share in 2025 (Dell'Oro Group); UEC 1.0 spec released June 2025. Meta validated Ethernet RoCE at 24K-GPU scale for LLaMA 3.	0.85	0.60–0.95	Epoch AI
Model FLOP Utilization (MFU) Fraction of theoretical peak FLOPs achieved during training. Typical range 30–50%. Chinchilla reported 0.46–0.57 (depending on model size); PaLM achieved 0.46–0.57; Meta reported 38–43% for LLaMA 3 405B at 16K-GPU scale.	0.40	0.20–0.65	Epoch AI

Interactive FLOP Estimator

GPU Model

GPU Count10,000

Interconnect Efficiency0.85

Model FLOP Utilization (MFU)0.40

Estimation Results

Peak FP16 ThroughputMaximum processing speed9.89e18 FLOP/s

Training ThroughputEffective training speed3.36e18 FLOP/s

Daily FLOP BudgetDaily compute capacity2.91 × 10²³ FLOP

Estimation bounds (log scale)0.7 orders of magnitude

1.03 × 10²³2.91 × 10²³5.28 × 10²³

EU AI Act 10²⁵ cumulative FLOP training threshold:

At this daily capacity, a training run would reach the threshold in 34 days

Models trained above this line must be reported under the EU AI Act as general-purpose AI with systemic risk, triggering red-teaming, incident reporting, and cybersecurity requirements.

Derivation Steps

1Peak FP16 Throughput

peak_flops = gpu_count × per_gpu_fp16_tflops × 1e12

gpu_count10,000

per_gpu_fp16_tflops990

peak_flops9.89 × 10^18 FLOP/s

2Effective Throughput

effective_flops = peak_flops × interconnect_efficiency

interconnect_efficiency0.85

effective_flops8.41 × 10^18 FLOP/s

3Training Throughput

training_flops = effective_flops × mfu_assumption

mfu_assumption0.40

training_flops3.36 × 10^18 FLOP/s

4Daily FLOP Budget

daily_flop_budget = training_flops × 86,400

seconds_per_day86,400

daily_flop_budget2.91 × 10^23 FLOP/day

Compute Cost Index

Scrutica Compute Unit (SCU)

1 SCU = 1 petaFLOP-day of FP16 training compute

= 8.64 × 10¹⁹ FLOP

Cost Normalization Calculator

GPU Model

Instance $/hour

GPUs per instance

// Step-by-step derivation

Per-GPU hourly rate: $0.4375/hr

GPU peak PFLOP/s: 0.990

PFLOP per hour: 3562.2

$/PFLOP-hour: $0.000123

$/petaFLOP-day: $0.0029

BF16 TFLOP/s dense, no 2:4 structured sparsity (source: Epoch AI ML Hardware dataset). On-prem defaults: MFU , PUE , $5K networking + $3K facility share per GPU (editorial estimates, tier 4).

BF16 Dense Throughput — Shared Unit

Sub-Indices

Cloud Spot

On-demand / spot pricing for GPU compute from major cloud providers (AWS, GCP, Azure, Lambda, CoreWeave).

Cloud provider pricing APIs

Historical spot price tracking

Cloud Reserved

1-year and 3-year reserved instance pricing. Committed-use discounts.

Cloud provider pricing pages

Enterprise contract disclosures

On-Premises

Total cost of ownership for self-operated GPU clusters including hardware, power, cooling, facility, and labor.

Hardware vendor ASPs

Electricity tariff databases

Construction cost indices

Sovereign

Effective cost for government-funded sovereign AI compute including subsidies, grants, and below-market financing.

CHIPS Act disclosures

EU Chips Act awards

Sovereign AI program announcements

Data Collection ScheduleHow Prices Are Collected

Data Source	Cadence	Method
Cloud Provider APIs	Weekly	Automated API polling
Enterprise Contract Disclosures	Quarterly	SEC filing extraction
Hardware Vendor ASPs	Quarterly	Earnings transcript parsing
Electricity Tariff Data	Annually	EIA / regional utility databases
Construction Cost Indices	Annually	Turner / RSMeans indices

Chokepoint Cascade Simulation

BFS Propagation Algorithm

How Disruptions Propagate

// Pseudocode

function simulateCascade(graph, disrupted_node) {

queue = [disrupted_node]

while queue is not empty:

node = queue.dequeue()

for each downstream in node.customers:

severity = node.severity × (1 − substitutability_decay)

if severity > threshold:

downstream.severity = severity

downstream.tick = node.tick + propagation_delay

queue.enqueue(downstream)

}

Configurable Parameters

Parameter	Default	Range	Description
propagation_delay	30 days	1–365	Time for a disruption to propagate from one supply chain node to the next.
initial_shock_severity	1	0.1–1	Severity of the initial disruption at the source node (1.0 = complete disruption).

Cascade Parameter Sensitivity

Scenario

Complete loss of TSMC Taiwan fab capacity (natural disaster, blockade).

Propagation Decay0.90

0.30 (rapid attenuation)0.95 (near-full propagation)

Decay 0.75

645

orgs, 3 hops

Decay 0.90

647

orgs, 3 hops

Decay 0.95

648

orgs, 3 hops

Substitutability Decay by Category

Each supply chain category has a decay rate representing how much severity is absorbed per hop. Lower decay = harder to substitute = more severe cascade propagation.

0.02

EUV LithographyNear-zero substitutability. ASML sole source.

0.05

Advanced Foundry (<7nm)TSMC dominant. Samsung limited alternative.

0.15

GPU AcceleratorsNVIDIA dominant but AMD MI300X is a partial substitute.

0.10

0.15

0.30

0.50

Data Center ConstructionMany EPC contractors globally. Highly substitutable.

0.25

Inventory Buffers (Temporal Mode)

Layer	Buffer	Range	Shape	Tier
EUV Lithography	None	0–0 wk	cliff	T1
Advanced Wafers (N3/N5)	10 wk	8–16 wk	linear	T1
Packaged Chips (GPU)	3 wk	2–5 wk	cliff	T1
Server Assembly	14 wk	8–26 wk	exponential	T3
HBM Memory	3 wk	0–8 wk	cliff	T2
ABF Substrates	12 wk	4–30 wk	linear	T2

Substitution Elasticity (Temporal Mode)

TSMC Advanced Node

Samsung Foundry24mo

15%T2

Intel Foundry (IFS)36mo

5%T2

ASML EUVNo viable substitute

SK Hynix HBM

Samsung HBM0mo

45%T2

Micron HBM0mo

25%T2

NVIDIA Training GPUs

AMD MI300X/MI35012mo

15%T2

TSMC CoWoS Packaging

Amkor (OSAT)18mo

20%T2

SPIL (OSAT)18mo

10%T2

Historical Validation

Temporal parameters calibrated against three observed disruption events. Observed propagation delays and buffer absorption patterns validate the model's depletion and substitution functions.

Texas Freeze (Samsung Austin S2)February 2021

Delay: 8-12 weeks

Severity: 1-2% global 300mm capacity/week; 5% Q2 smartphone production

Buffers: Partial: 5G AP/modem inventory delayed Q2 impact by ~1 product cycle

Substitution: None at process level; NXP shifted minimal volume to other fabs

Source: TrendForce March 2021; Samsung Q1 2021 earnings ($268-357M loss) (T1)

Ukraine Neon Gas SupplyFebruary 2022

Delay: Buffer absorbed shock entirely — no production curtailment at major fabs

Severity: 45-54% of semiconductor-grade neon disrupted; zero production impact

Buffers: Critical: 3-12 months neon inventory at large fabs (post-2014 Crimea lesson)

Substitution: TSMC neon recycling; Cryoin relocated to South Korea; 3-18 month qualification

Source: USITC Executive Briefing (DeCarlo & Goodman, Apr 2022); CSIS March 2022; Reuters (T1)

CoWoS Packaging BottleneckQ2 2023 – ongoing

Delay: Immediate (demand-pull, no propagation lag)

Severity: H100 lead times peaked at 8-11 months; ~80% of customer needs met

Buffers: None: packaging is final step, not stockpileable

Substitution: OSAT overflow (SPIL tool-in Q2 2025, ASE ramp); TSMC doubling capacity 2024

Source: TSMC Chairman Mark Liu (Sep 2023); TrendForce Aug 2024 (SPIL order); UBS analyst estimates (T1)

Edge Weight Coverage

Total Affected Compute Metric

Graph Edge Sources

The supply chain graph is assembled from multiple data sources; each edge carries its own provenance and authority tier.

CSET Provision DataT2

Equipment supplier → foundry, foundry → chip designer

Market-share-weighted edges. Topology only (no bilateral specificity).

SEC Exhibit 21 / FactSetT1

Parent → subsidiary, supplier → customer

Verified bilateral relationships from regulatory filings.

PitchBook / PreqinT3

Investor → facility, GP → infrastructure fund

Financial relationship edges. Deal amounts undisclosed ~45% of the time.

Epoch AI ClustersT2

Organization → GPU cluster, cluster → facility

Training deployment edges with hardware type attribution.

Confidence Levels & Data Quality Framework

Tier 1VerifiedTreatment:123.4 MW

Value from a primary authoritative source with direct measurement or legal disclosure obligation.

SEC 10-K filing capex figuresGovernment grant award amounts (CHIPS Act)Official facility registry coordinatesCompany-reported GPU counts

Tier 2Peer-ReviewedTreatment:123.4 MW

Value from academic publication or research organization with documented methodology.

Epoch AI training compute estimatesCSET supply chain market share analysisAcademic papers on semiconductor economics

Tier 3EstimatedTreatment:123.4 MW

Value derived from proxy methods, industry analyst estimates, or press reports. Methodology documented.

GPU count derived from power capacity (Path B)PitchBook deal size estimatesCIQ NLP-extracted facility capacitiesSemiAnalysis capex breakdowns

Tier 4SpeculativeTreatment:123.4 MW

Value from secondary inference, expert opinion, or interpolation. Widest estimation bounds.

GPU count from investment amount (Path C)Sovereign AI "announced" figures (government + private conflated)Grid queue entries with withheld applicant

Aggregate Confidence Calculation

aggregate_tier = max(input_1.tier, input_2.tier, ..., input_n.tier)

Contradiction Protocol

When sources disagree on a factual value, Scrutica shows both values with source attribution rather than silently picking a winner. The protocol:

Surface both values with full source attribution and dates
Display the higher-authority value as primary (lower tier number = higher authority)
Show the discrepancy magnitude and possible explanations (temporal difference, methodology difference, scope difference)
Never use hedging language ("likely," "probably") to paper over the contradiction. State the uncertainty explicitly.

The `is_estimated` Flag

This boolean means "this value was derived via a model or proxy method." It does notmean "this value might be null" or "this value has uncertainty."

is_estimated: false

Value from primary source with direct measurement. SEC 10-K capex, company-disclosed GPU count, government registry coordinates.

is_estimated: true

Value produced by inference: GPU count from power capacity, FLOP estimate from investment amount, market share from analyst reports.

External Validation

Comparison Approach

Epoch AI publishes estimated computing power (PFLOP/s) for GPU clusters in their Notable GPU Clusters dataset. Where Epoch and Scrutica both estimate the same facility, we compare hardware-path estimates.
Differences arise from GPU count assumptions (Epoch uses disclosed counts; Scrutica derives from power capacity when counts are unavailable) and MFU assumptions.
MLPerf Inference submissions provide verified hardware configurations (accelerator type, count, memory) that cross-check Scrutica's chip catalog specifications.

Validation Sources

Validation Source	Data Type	Coverage	Methodology Difference
Epoch AI GPU Clusters	Facility PFLOP/s estimates	~786 clusters, ~26 with PFLOP estimates	Epoch uses disclosed GPU counts; Scrutica uses three estimation paths (hardware, power, cost) and reports estimation bounds for each
CSET AI Indicators	Company-level AI investment	~691 companies	CSET aggregates financial disclosures; Scrutica adds FactSet Revere supply chain edges for interdependence analysis
TrendForce / Counterpoint	Semiconductor market share	Quarterly updates	Analyst estimates vs. Scrutica concentration scoring derived from supply chain topology
MLPerf Inference	Hardware specifications (accelerator type, memory, count)	92 datacenter system submissions (v4.1)	MLPerf publishes verified hardware configurations per submission; Scrutica cross-checks accelerator types and memory specs against its chip catalog

MLPerf Hardware Cross-Check

Version History

v1.2.02026-04-18

MODIFIED

ADDED

v1.1.02026-04-01

ADDED

gb200Added NVIDIA GB200 (Blackwell, per-GPU). Dense FP16 = 2,500 TFLOP/s, 1.2 kW TDP. Verified against Epoch AI ml_hardware.csv.

ADDED

mi350xAdded AMD MI350X (CDNA 4). Dense BF16 = 2,309.6 TFLOP/s, 1.0 kW TDP. Shipped Q3 2025. Verified against Epoch AI ml_hardware.csv.

ADDED

trainium3Added Amazon Trainium3. Dense BF16 = 671 TFLOP/s, FP8 = 2,517 TFLOP/s. Cloud-only. Verified against Epoch AI ml_hardware.csv.

ADDED

tpu_v7_ironwoodAdded Google TPU v7 Ironwood. BF16 = 2,307 TFLOP/s. Cloud-only. Verified against Epoch AI ml_hardware.csv.

ADDED

r100_rubinAdded NVIDIA R100 (Rubin) placeholder. Announced GTC 2026, volume H2 2026. FP16 specs not yet disclosed; set to 0.

ADDED

maia_200Added Microsoft Maia 200 placeholder. TSMC 3nm, 216GB HBM3E. Perf specs not yet disclosed; set to 0.

MODIFIED

b200Corrected B200 fp16_tflops from 1,125 (was TF32) to 2,250 (dense FP16/BF16 Tensor). Verified against Epoch AI ml_hardware.csv and NVIDIA datasheet.1125 → 2250

v1.0.02026-03-29

ADDED

Initial methodology release. Three estimation paths (hardware, power, cost) with parameters from NVIDIA/AMD spec sheets and Epoch AI research.

Cite This Methodology

Scrutica. "Methodology: FLOP Estimation, Compute Cost Index, and Cascade Simulation." v1.2.0, April 2026. https://scrutica.com/methodology

When citing specific estimates, include the methodology version and estimation path (e.g., "Hardware Path, v1.2.0") so readers can reproduce the calculation.

Related analysis

FLOP EngineInfrastructure

Interactive version of these formulas

AboutReference

Mission and limitations

GlossaryReference

Technical terms used in these methodologies

Methodology

Data Sources

Estimation Methodologies

FLOP Estimation (3 paths)

Compute Cost Index

Cascade Simulation

Data Quality Framework

Provenance

Temporal Tracking

False Precision Ban

FLOP Estimation Methodology

Path A: Hardware-Based

Path B: Power-Based

Path C: Cost-Based

GPU Specifications (Fixed Values)

Path A: Hardware-Based — Tunable Parameters

Path B: Power-Based — Tunable Parameters

Path C: Cost-Based — Tunable Parameters

Interactive FLOP Estimator

Compute Cost Index

Scrutica Compute Unit (SCU)

Cost Normalization Calculator

BF16 Dense Throughput — Shared Unit

Sub-Indices

Cloud Spot

Cloud Reserved

On-Premises

Sovereign

Data Collection ScheduleHow Prices Are Collected

Chokepoint Cascade Simulation

BFS Propagation Algorithm

How Disruptions Propagate

Configurable Parameters

Cascade Parameter Sensitivity

Substitutability Decay by Category

Inventory Buffers (Temporal Mode)

Substitution Elasticity (Temporal Mode)

Historical Validation

Edge Weight Coverage

Total Affected Compute Metric

Graph Edge Sources

Confidence Levels & Data Quality Framework

Aggregate Confidence Calculation

Contradiction Protocol

The is_estimated Flag

External Validation

Comparison Approach

Validation Sources

Version History

Cite This Methodology

Methodology

Data Sources

Estimation Methodologies

FLOP Estimation (3 paths)

Compute Cost Index

Cascade Simulation

Data Quality Framework

Provenance

Temporal Tracking

False Precision Ban

FLOP Estimation Methodology

Path A: Hardware-Based

Path B: Power-Based

Path C: Cost-Based

GPU Specifications (Fixed Values)

Path A: Hardware-Based — Tunable Parameters

Path B: Power-Based — Tunable Parameters

Path C: Cost-Based — Tunable Parameters

Interactive FLOP Estimator

Compute Cost Index

Scrutica Compute Unit (SCU)

Cost Normalization Calculator

BF16 Dense Throughput — Shared Unit

Sub-Indices

Cloud Spot

Cloud Reserved

On-Premises

Sovereign

Data Collection ScheduleHow Prices Are Collected

Chokepoint Cascade Simulation

The `is_estimated` Flag

The `is_estimated` Flag