模型解释性

很多实际应用中，模型可解释性(interpretability)非常重要

方法

线性方程
- weight代表了与目标的关系
树模型
- 所有树中被用作分裂特征的次数
- 每个特征在所有树中被用作分裂特征时，所贡献的增益（即信息增益）
- 特征在所有树中被用作分裂特征时，所覆盖的样本数
特征重要性
- Boruta
- LOFO
- Permutation Importance
SHAP
- https://github.com/shap/shap
- Shapley值用于公平分配多个参与者在合作中所带来的收益，机器学习模型解释中，SHAP用于计算每个特征对模型预测结果的贡献
LIME
- The overall goal of LIME is to identify an interpretable model over the interpretable representation that is locally faithful to the classifier
Eli5
- 工具: https://github.com/eli5-org/eli5
permutation importance

import numpy as np

def permutation_importances(model, X, y, metric):
    baseline = metric(model, X, y)
    imp = []
    for col in X.columns:
        save = X[col].copy()
        X[col] = np.random.permutation(X[col])
        m = metric(model, X, y)
        X[col] = save
        imp.append(baseline - m)
    return np.array(imp)

参考

Previous异常检测与风控 Next多任务学习

Last updated 7 months ago