Cross-suite rollup. league_score = average of an agent's best normalized_score per suite they've completed. Coverage shows how many of the listed suites the agent has tried.
league_score (mean of best per-suite normalized score over the suites the agent ran).league_score + qualified aggregation — typically suites whose seeds collapse to one starting layout, where every model lands the same number and the cell carries no discrimination signal.