Skip to content

CORE — Capital and Operations Resource Evaluator

The Capital and Operations Resource Evaluator (CORE) grades the hardware layer of your AI infrastructure — whether the GPUs you operate are the right hardware for the workloads you run, how old the fleet is relative to the GPU refresh cycle, and how efficiently embodied carbon is amortized across the compute capacity those GPUs deliver. CORE identifies whether hardware decisions are creating a structural ceiling on what ACE and PACE can achieve.

core_efficiency_score — a number from 0.0 to 1.0 composed of three fleet-weighted components:

  • Hardware fit (40%) — how well each GPU model matches the declared primary workload type
  • Fleet age (35%) — GPU age in years, anchored to published refresh-cycle references
  • Embodied carbon (25%) — carbon efficiency normalized by compute capacity delivered (kg CO₂e per TFLOP/s)
core_efficiency_score =
hardware_fit_score × 0.40 +
fleet_age_score × 0.35 +
embodied_carbon_score × 0.25

All three component scores are GPU-count-weighted fleet averages. A fleet with 800 H100s and 200 A100s weights H100 at 80% and A100 at 20%.

Scores how well each GPU model suits the declared primary workload. Derived from MLPerf Training v3.1 and MLPerf Inference v4.0 benchmark results (mlcommons.org/results), normalized so the best available result per workload = 1.00. AMD and Intel scores from their respective MLPerf submissions.

Selected fit scores:

GPUtraining_largetraining_smallinferencemixed
B2001.000.920.950.97
GB2001.000.900.950.97
B1000.950.900.920.93
H2000.900.890.930.91
H1000.880.880.900.89
MI300X0.850.880.900.87
Gaudi30.720.750.780.75
A1000.520.720.600.62
L40S0.420.720.830.68
Gaudi20.550.620.600.58
A400.380.700.750.60
RTX Pro 60000.400.720.880.65
RTX 40900.280.580.720.50
V1000.220.420.380.32
unknown0.400.400.400.40

Consumer GPUs (RTX 4090, RTX 3090) have no MLPerf Datacenter submission — their scores are extrapolated from published FLOP/s ratios and carry higher uncertainty.

When MIG partitioning is enabled on a GPU record, the fit score is multiplied by active_instances / total_instances. Idle MIG partitions represent configured but unused compute capacity.

mig_multiplier = mig_instances_active / mig_instance_count
fit_score_effective = fit_score × mig_multiplier

Age is computed from assessment_year − purchase_year per GPU record, then weighted by GPU count across the fleet. Scoring is anchored to published GPU hardware refresh-cycle references.

Age (years)ScoreRationale
≤ 21.00Current generation, full performance, full software support
≤ 40.80One generation behind, still widely supported
≤ 60.60Two generations behind, approaching EOL driver support
≤ 80.35EOL or near-EOL, limited workload support
> 80.15Legacy, significant efficiency penalty

Sources: public hyperscaler sustainability reports; GPU generation cadence references; DOE ESIF hardware lifecycle documentation.

Embodied carbon is normalized by compute capacity delivered — kg CO₂e per TFLOP/s — rather than absolute carbon per GPU. This correctly reflects that a high-performance GPU with higher embodied carbon may be more carbon-efficient per unit of useful compute than a legacy GPU with lower absolute carbon.

kg_per_tflops = embodied_carbon_kg / tflops_per_gpu
score = 1 − (kg_per_tflops / 2.24) [clamped 0.0 – 1.0]

Reference baseline: A100 at 700 kg CO₂e / 312 TFLOP/s = 2.24 kg/TFLOP/s → score 0.0. GPUs with better compute-per-carbon score above 0.0. GPUs below A100 density also score 0.0 (clamped).

Selected embodied carbon values and scores:

GPUTFLOP/s (FP16)CO₂e (kg)kg/TFLOP/sScore
B2004,5001,2000.2670.881
H1001,9797000.3540.842
MI300X2,6149500.3640.838
H2001,9799500.4800.786
Gaudi31,8359000.4900.781
L40S7334200.5730.744
A1003127002.2440.0 (reference)
V1001255004.0000.0 (clamped)

Sources: NVIDIA Product Carbon Footprint reports; Dell/HPE lifecycle data; Gupta et al. 2021; Patterson et al. 2022. Blackwell values extrapolated from die-size scaling.

Fleet: 512 H100s purchased 2023, 256 A100s purchased 2021. Primary use: training_large. Assessment year: 2026.

H100 record (512 GPUs, age = 3 years):
fit_score = 0.88 (H100 × training_large)
age_score = 0.80 (age 3 ≤ 4)
embodied_score = 0.842 (700 kg / 1979 TFLOP/s = 0.354 kg/TFLOP/s)
A100 record (256 GPUs, age = 5 years):
fit_score = 0.52 (A100 × training_large)
age_score = 0.60 (age 5 ≤ 6)
embodied_score = 0.00 (reference baseline — clamped)
Fleet-weighted averages (512 H100 + 256 A100 = 768 total):
hardware_fit_score = (512×0.88 + 256×0.52) / 768 = (450.6 + 133.1) / 768 = 0.760
fleet_age_score = (512×0.80 + 256×0.60) / 768 = (409.6 + 153.6) / 768 = 0.733
embodied_carbon_score = (512×0.842 + 256×0.00) / 768 = 431.1 / 768 = 0.562
core_efficiency_score =
0.760 × 0.40 +
0.733 × 0.35 +
0.562 × 0.25
= 0.304 + 0.257 + 0.141
= 0.702 → Grade B (Good)

The A100s drag down all three components. Refreshing the 256 A100s to H100s would push composite to ~0.85 (Grade A).

GradeConditionLabel
Ascore ≥ 0.80Excellent
Bscore ≥ 0.65Good
Cscore ≥ 0.50Adequate
Dscore ≥ 0.35Poor
Fscore < 0.35Failing
MetricUnitDescription
core_efficiency_scorescore_0_to_1Weighted composite — GRADE input
hardware_fit_scorescore_0_to_1Fleet-weighted fit for declared primary workload
fleet_age_scorescore_0_to_1Fleet-weighted age band score
embodied_carbon_scorescore_0_to_1Fleet-weighted kg CO₂e per TFLOP/s score
fleet_age_years_weightedyearsGPU-count-weighted mean fleet age
hardware_efficiency_gradegrade_points_0_to_4A=4, B=3, C=2, D=1, F=0

CORE reads gpu_efficiency_score from an ACE output when ace_report_path is provided. This is informational — it does not affect the CORE composite formula in v0.1, but it appears in the GRADE report alongside CORE findings to give context on whether a hardware fit gap is being exposed by actual workloads.

CORE takes a JSON file with a hardware fleet list and a workload profile. The fleet is a list of HardwareRecord objects — one per distinct GPU model and purchase cohort.

FieldTypeRequiredDescription
organizationstringyesOrganization name
periodstringyesAssessment year (YYYY)
schedulerstringyesslurm, kubernetes, or unknown
hardware_fleetlistyesList of hardware records (see below)
workload_profileobjectyesWorkload characteristics
ace_report_pathstringnoPath to ACE ptl_output for cross-reference

Hardware record fields:

FieldTypeRequiredDescription
gpu_modelstringyesh100, a100, b200, mi300x, v100, etc.
gpu_countintyesNumber of GPUs in this record
purchase_yearintyesYear purchased or last substantially upgraded
tdp_wattsintyesGPU TDP in watts
embodied_carbon_kgfloatnoTotal embodied CO₂e for record; uses default if absent
mig_enabledboolnoWhether MIG partitioning is active
mig_instance_countintnoTotal MIG instances configured
mig_instances_activeintnoMIG instances currently in use

Workload profile fields:

FieldTypeRequiredDescription
primary_usestringyestraining_large, training_small, inference, mixed, or unknown
avg_gpu_utilizationfloatyesAverage GPU utilization (0.0–1.0)
avg_job_duration_minutesfloatyesAverage job duration in minutes
Terminal window
# Score a single organization from a core_input JSON file
core analyze \
--input core_input.json \
--output core_output.json
# Run all synthetic hardware fleets and print grades
core demo
# Validate a ptl_output_v1.json export against the PTL schema
core validate --input core_output.json

Example core_input.json:

{
"organization": "National AI Research Center",
"period": "2026",
"scheduler": "slurm",
"hardware_fleet": [
{
"gpu_model": "h100",
"gpu_count": 512,
"purchase_year": 2023,
"tdp_watts": 700
},
{
"gpu_model": "a100",
"gpu_count": 256,
"purchase_year": 2021,
"tdp_watts": 400
}
],
"workload_profile": {
"primary_use": "training_large",
"avg_gpu_utilization": 0.72,
"avg_job_duration_minutes": 240
}
}