FLOP Estimation Engine

Three independent paths to facility-level compute capacity: hardware inventory (GPU count x peak FLOP/s x MFU x utilization), power draw (thermal envelope, PUE-adjusted, to GPU count to FLOP/s), and capital cost (capex decomposition to GPU count to FLOP/s). Cross-validation across paths produces estimation bounds.

How much AI computing power does a data center have? Three independent estimation paths: hardware inventory, electricity consumption, and construction cost. Agreement across paths narrows the estimation bounds; disagreement widens them, and we show you the range with an explanation of why.

3 corrections · latest Apr 25, 2026

v1.2.0·2026-04-18

Estimation Path

GPU count and model known. Narrowest estimation bounds.When the number and type of GPUs in a facility are known. Produces the most precise estimate.

Assumed GPU Model

FP16: 990 TFLOP/s (dense)TDP: 700WCost: $30K ($25K–$40K)

Input Parameters

GPU Count

Source: User input

Shared Parameters

Interconnect efficiencyNetwork overhead (performance lost to communication between GPUs)

Source: Epoch AI|Fraction of peak throughput achievable across multi-node interconnect. 0.85 for NVLink, 0.7 for InfiniBand/Ethernet backend. Ethernet surpassed InfiniBand in AI back-end network market share in 2025 (Dell'Oro Group); UEC 1.0 spec released June 2025. Meta validated Ethernet RoCE at 24K-GPU scale for LLaMA 3.

MFU (Model FLOP Utilization)Training efficiency (fraction of peak speed achieved during AI training)

Source: Epoch AI|Fraction of theoretical peak FLOPs achieved during training. Typical range 30–50%. Chinchilla reported 0.46–0.57 (depending on model size); PaLM achieved 0.46–0.57; Meta reported 38–43% for LLaMA 3 405B at 16K-GPU scale.

Estimation Results

Path A: Hardware-Based

Peak FP16 ThroughputMaximum processing speed9.89e18 FLOP/s

Training ThroughputEffective training speed3.36e18 FLOP/s

Daily FLOP Budgetwhat is this?what is this?

2.91 × 10²³FLOP/day

Estimation bounds (log scale)0.7 orders of magnitude

1.03 × 10²³2.91 × 10²³5.28 × 10²³

Bounds vary interconnect efficiency and MFU across documented ranges. These are parameter sensitivity ranges derived from input bounds, not distributional estimates.

EU AI Act 10²⁵ Cumulative FLOP Threshold

This facility could reach the threshold in 34 days of continuous training at current estimated capacity.

At current estimated capacity, this facility could accumulate enough compute to cross the threshold in 34 days of continuous operation.

The 10²⁵ threshold is cumulative training FLOP (a total), not a daily rate. Actual training runs include downtime, checkpointing, and communication overhead.

AI models trained above this threshold (10²⁵ cumulative FLOP) must be reported under the EU AI Act as general-purpose AI systems with potential systemic risk, which triggers requirements for red-teaming, incident reporting, and cybersecurity measures. The days shown assume continuous operation; actual training includes downtime and restarts, so real timelines will be longer.

Full Methodology

Note on FP16 Performance FiguresNote on Performance Measurement

GPU TFLOP/s values use dense (no structured sparsity) FP16/BF16 Tensor Core specs: A100: 312, H100/H200: 989.5, MI300X: 1,307.4, B200: 2,250 TFLOP/s. NVIDIA’s 2:4 structured sparsity doubles these rates (H100 = 1,979; B200 = 4,500) and AMD’s matrix engines likewise publish a sparsity-inclusive 2,614.9 for MI300X. Most production training does not enable structured sparsity, so the dense baseline is the right reference and the MFU parameter is calibrated against it. Same convention as Epoch AI’s ml_hardware dataset.

The Compute Cost Index uses the same dense BF16 throughput (989.5 TFLOP/sper H100), so $/petaFLOP-day values compare directly to MFU-normalized throughput here. A reader deriving $/SCU from cloud $/hour against sparsity-inclusive vendor specs will read values ~2× lower than the index; that gap is the unit mismatch, not a pricing disagreement.

GPU manufacturers advertise peak performance numbers that vary depending on the measurement standard. This calculator uses a single consistent standard (dense FP16 operations without structured sparsity) aligned with Epoch AI’s methodology, which means the numbers here will not match what cloud providers advertise. The difference is methodological, not an error; both produce equivalent results when calibrated consistently. The separate Compute Cost Index uses the same dense FP16 standard, so $/petaFLOP-day values compare directly to the throughput numbers shown here.

Which facilities exceed regulatory FLOP thresholds?

Multi-regime threshold calculator: EU AI Act (10²⁵), US Executive Order, IFR. Reverse calculator, facility auto-populate, post-training compute analysis.

FLOP Compliance Calculator →

Data Provenance

Primary67%Research33%

Sources

Source	Tier
NVIDIA Spec Sheets	T1
Epoch AI GPU Specs	T2
MLPerf Benchmarks	T1

Processing

Three estimation paths: hardware (GPU count x FP16 spec), power (MW / PUE / GPU TDP), cost (capex / GPU unit price). Outputs are parameter sensitivity ranges, not statistical confidence intervals. Every assumption is adjustable and its source documented inline.

Reading this data

~valueEstimated — derived, modeled, or inferred. Hover for method and source.

valueDerived from another measured field (e.g. power from GPU count).

Source conflict — multiple sources disagree. Click for competing values.

hoverHover any annotated value to see authority tier, method, source, and vintage.

Full methodology

Related analysis

Facility DirectoryInfrastructure

The facilities this estimates

FLOP ComplianceGovernance

Apply regulatory thresholds to these estimates

MethodologyReference

Full derivation chains for every formula