人口统计均等:确保各群体获得同比例的正类预测

FreeGuideOnline 最新 2026-06-27

python import pandas as pd

假设df是包含预测和敏感特征的数据集

df['prediction'] 为二进制预测 (0 或 1)

df['group'] 为群体标识

def demographic_parity_difference(df, group_col, pred_col): rates = df.groupby(group_col)[pred_col].mean() return max(rates) - min(rates)

diff = demographic_parity_difference(your_data, 'gender', 'model_pred') print(f"人口统计均等差异: {diff:.4f}")