CORE — Capital and Operations Resource Evaluator
The Capital and Operations Resource Evaluator (CORE) grades the hardware layer of your AI infrastructure — whether the GPUs you operate are the right hardware for the workloads you run, how old the fleet is relative to the GPU refresh cycle, and how efficiently embodied carbon is amortized across the compute capacity those GPUs deliver. CORE identifies whether hardware decisions are creating a structural ceiling on what ACE and PACE can achieve.
Primary metric
Section titled “Primary metric”core_efficiency_score — a number from 0.0 to 1.0 composed of three fleet-weighted components:
- Hardware fit (40%) — how well each GPU model matches the declared primary workload type
- Fleet age (35%) — GPU age in years, anchored to published refresh-cycle references
- Embodied carbon (25%) — carbon efficiency normalized by compute capacity delivered (kg CO₂e per TFLOP/s)
Formula
Section titled “Formula”core_efficiency_score = hardware_fit_score × 0.40 + fleet_age_score × 0.35 + embodied_carbon_score × 0.25All three component scores are GPU-count-weighted fleet averages. A fleet with 800 H100s and 200 A100s weights H100 at 80% and A100 at 20%.
Hardware-workload fit
Section titled “Hardware-workload fit”Scores how well each GPU model suits the declared primary workload. Derived from MLPerf Training v3.1 and MLPerf Inference v4.0 benchmark results (mlcommons.org/results), normalized so the best available result per workload = 1.00. AMD and Intel scores from their respective MLPerf submissions.
Selected fit scores:
| GPU | training_large | training_small | inference | mixed |
|---|---|---|---|---|
| B200 | 1.00 | 0.92 | 0.95 | 0.97 |
| GB200 | 1.00 | 0.90 | 0.95 | 0.97 |
| B100 | 0.95 | 0.90 | 0.92 | 0.93 |
| H200 | 0.90 | 0.89 | 0.93 | 0.91 |
| H100 | 0.88 | 0.88 | 0.90 | 0.89 |
| MI300X | 0.85 | 0.88 | 0.90 | 0.87 |
| Gaudi3 | 0.72 | 0.75 | 0.78 | 0.75 |
| A100 | 0.52 | 0.72 | 0.60 | 0.62 |
| L40S | 0.42 | 0.72 | 0.83 | 0.68 |
| Gaudi2 | 0.55 | 0.62 | 0.60 | 0.58 |
| A40 | 0.38 | 0.70 | 0.75 | 0.60 |
| RTX Pro 6000 | 0.40 | 0.72 | 0.88 | 0.65 |
| RTX 4090 | 0.28 | 0.58 | 0.72 | 0.50 |
| V100 | 0.22 | 0.42 | 0.38 | 0.32 |
| unknown | 0.40 | 0.40 | 0.40 | 0.40 |
Consumer GPUs (RTX 4090, RTX 3090) have no MLPerf Datacenter submission — their scores are extrapolated from published FLOP/s ratios and carry higher uncertainty.
MIG utilization multiplier
Section titled “MIG utilization multiplier”When MIG partitioning is enabled on a GPU record, the fit score is multiplied by active_instances / total_instances. Idle MIG partitions represent configured but unused compute capacity.
mig_multiplier = mig_instances_active / mig_instance_countfit_score_effective = fit_score × mig_multiplierFleet age scoring
Section titled “Fleet age scoring”Age is computed from assessment_year − purchase_year per GPU record, then weighted by GPU count across the fleet. Scoring is anchored to published GPU hardware refresh-cycle references.
| Age (years) | Score | Rationale |
|---|---|---|
| ≤ 2 | 1.00 | Current generation, full performance, full software support |
| ≤ 4 | 0.80 | One generation behind, still widely supported |
| ≤ 6 | 0.60 | Two generations behind, approaching EOL driver support |
| ≤ 8 | 0.35 | EOL or near-EOL, limited workload support |
| > 8 | 0.15 | Legacy, significant efficiency penalty |
Sources: public hyperscaler sustainability reports; GPU generation cadence references; DOE ESIF hardware lifecycle documentation.
Embodied carbon scoring
Section titled “Embodied carbon scoring”Embodied carbon is normalized by compute capacity delivered — kg CO₂e per TFLOP/s — rather than absolute carbon per GPU. This correctly reflects that a high-performance GPU with higher embodied carbon may be more carbon-efficient per unit of useful compute than a legacy GPU with lower absolute carbon.
kg_per_tflops = embodied_carbon_kg / tflops_per_gpuscore = 1 − (kg_per_tflops / 2.24) [clamped 0.0 – 1.0]Reference baseline: A100 at 700 kg CO₂e / 312 TFLOP/s = 2.24 kg/TFLOP/s → score 0.0. GPUs with better compute-per-carbon score above 0.0. GPUs below A100 density also score 0.0 (clamped).
Selected embodied carbon values and scores:
| GPU | TFLOP/s (FP16) | CO₂e (kg) | kg/TFLOP/s | Score |
|---|---|---|---|---|
| B200 | 4,500 | 1,200 | 0.267 | 0.881 |
| H100 | 1,979 | 700 | 0.354 | 0.842 |
| MI300X | 2,614 | 950 | 0.364 | 0.838 |
| H200 | 1,979 | 950 | 0.480 | 0.786 |
| Gaudi3 | 1,835 | 900 | 0.490 | 0.781 |
| L40S | 733 | 420 | 0.573 | 0.744 |
| A100 | 312 | 700 | 2.244 | 0.0 (reference) |
| V100 | 125 | 500 | 4.000 | 0.0 (clamped) |
Sources: NVIDIA Product Carbon Footprint reports; Dell/HPE lifecycle data; Gupta et al. 2021; Patterson et al. 2022. Blackwell values extrapolated from die-size scaling.
Worked example
Section titled “Worked example”Fleet: 512 H100s purchased 2023, 256 A100s purchased 2021. Primary use: training_large. Assessment year: 2026.
H100 record (512 GPUs, age = 3 years): fit_score = 0.88 (H100 × training_large) age_score = 0.80 (age 3 ≤ 4) embodied_score = 0.842 (700 kg / 1979 TFLOP/s = 0.354 kg/TFLOP/s)
A100 record (256 GPUs, age = 5 years): fit_score = 0.52 (A100 × training_large) age_score = 0.60 (age 5 ≤ 6) embodied_score = 0.00 (reference baseline — clamped)
Fleet-weighted averages (512 H100 + 256 A100 = 768 total): hardware_fit_score = (512×0.88 + 256×0.52) / 768 = (450.6 + 133.1) / 768 = 0.760 fleet_age_score = (512×0.80 + 256×0.60) / 768 = (409.6 + 153.6) / 768 = 0.733 embodied_carbon_score = (512×0.842 + 256×0.00) / 768 = 431.1 / 768 = 0.562
core_efficiency_score = 0.760 × 0.40 + 0.733 × 0.35 + 0.562 × 0.25= 0.304 + 0.257 + 0.141= 0.702 → Grade B (Good)The A100s drag down all three components. Refreshing the 256 A100s to H100s would push composite to ~0.85 (Grade A).
Letter grade
Section titled “Letter grade”| Grade | Condition | Label |
|---|---|---|
| A | score ≥ 0.80 | Excellent |
| B | score ≥ 0.65 | Good |
| C | score ≥ 0.50 | Adequate |
| D | score ≥ 0.35 | Poor |
| F | score < 0.35 | Failing |
Findings exported to GRADE
Section titled “Findings exported to GRADE”| Metric | Unit | Description |
|---|---|---|
core_efficiency_score | score_0_to_1 | Weighted composite — GRADE input |
hardware_fit_score | score_0_to_1 | Fleet-weighted fit for declared primary workload |
fleet_age_score | score_0_to_1 | Fleet-weighted age band score |
embodied_carbon_score | score_0_to_1 | Fleet-weighted kg CO₂e per TFLOP/s score |
fleet_age_years_weighted | years | GPU-count-weighted mean fleet age |
hardware_efficiency_grade | grade_points_0_to_4 | A=4, B=3, C=2, D=1, F=0 |
ACE cross-reference
Section titled “ACE cross-reference”CORE reads gpu_efficiency_score from an ACE output when ace_report_path is provided. This is informational — it does not affect the CORE composite formula in v0.1, but it appears in the GRADE report alongside CORE findings to give context on whether a hardware fit gap is being exposed by actual workloads.
Input schema
Section titled “Input schema”CORE takes a JSON file with a hardware fleet list and a workload profile. The fleet is a list of HardwareRecord objects — one per distinct GPU model and purchase cohort.
| Field | Type | Required | Description |
|---|---|---|---|
organization | string | yes | Organization name |
period | string | yes | Assessment year (YYYY) |
scheduler | string | yes | slurm, kubernetes, or unknown |
hardware_fleet | list | yes | List of hardware records (see below) |
workload_profile | object | yes | Workload characteristics |
ace_report_path | string | no | Path to ACE ptl_output for cross-reference |
Hardware record fields:
| Field | Type | Required | Description |
|---|---|---|---|
gpu_model | string | yes | h100, a100, b200, mi300x, v100, etc. |
gpu_count | int | yes | Number of GPUs in this record |
purchase_year | int | yes | Year purchased or last substantially upgraded |
tdp_watts | int | yes | GPU TDP in watts |
embodied_carbon_kg | float | no | Total embodied CO₂e for record; uses default if absent |
mig_enabled | bool | no | Whether MIG partitioning is active |
mig_instance_count | int | no | Total MIG instances configured |
mig_instances_active | int | no | MIG instances currently in use |
Workload profile fields:
| Field | Type | Required | Description |
|---|---|---|---|
primary_use | string | yes | training_large, training_small, inference, mixed, or unknown |
avg_gpu_utilization | float | yes | Average GPU utilization (0.0–1.0) |
avg_job_duration_minutes | float | yes | Average job duration in minutes |
CLI usage
Section titled “CLI usage”# Score a single organization from a core_input JSON filecore analyze \ --input core_input.json \ --output core_output.json
# Run all synthetic hardware fleets and print gradescore demo
# Validate a ptl_output_v1.json export against the PTL schemacore validate --input core_output.jsonExample core_input.json:
{ "organization": "National AI Research Center", "period": "2026", "scheduler": "slurm", "hardware_fleet": [ { "gpu_model": "h100", "gpu_count": 512, "purchase_year": 2023, "tdp_watts": 700 }, { "gpu_model": "a100", "gpu_count": 256, "purchase_year": 2021, "tdp_watts": 400 } ], "workload_profile": { "primary_use": "training_large", "avg_gpu_utilization": 0.72, "avg_job_duration_minutes": 240 }}