H1 — Performance report

For each drift scenario (no_drift, gradual, seasonal, abrupt, severe_abrupt, catastrophic) we compare the MAS policy against each baseline (Static ROP, Periodic Forecasting) on two outcomes: stockout rate (service failure) and total cost (the sum of holding, ordering, and stockout penalties). Each cell pools 10 seed replications.

Significance is tested with Mann–Whitney U as the primary (non-parametric, our seed distributions aren't normal), Welch's t as a parametric cross-check, and Cohen's d for effect size. Bolded rows in the tables below indicate Mann–Whitney p < 0.05.

Stockout rate

Scenario	vs	MAS mean	Baseline mean	Δ	MW p	Welch p	Cohen's d

Total cost

Scenario	vs	MAS mean	Baseline mean	Δ	MW p	Welch p	Cohen's d