打开黑箱：机器学习模型的可解释性技术言理, 道理讲通了就可以了

打开黑箱：机器学习模型的可解释性技术

date

Jun 6, 2020

slug

explanatory-machine-learning

status

Published

summary

刷 TDS(TowardDataScience.com) 和统计之都发现三个好东西： - 从任何机器学习模型中抽取人类可理解的洞察（Extracting human understandable insights from any Machine Learning model，作者 Parul Pandey），文章介绍了最为常用的 Permutation Importance, Partial Dependence Plots, SHAP values，Advanced Uses of SHAP Values 四种方法； - 可解释的机器学习指南（Guide to Interpretable Machine Learning，作者 Matthew Stewart, PhD Researcher），主要讲可视化，相对更加专业。 - Christoph Molnar 的新书 Interpretable Machine Learning，该书籍对机器学习可解释性有较为系统的介绍。最近没时间仔细学习，故快速浏览了一下两篇文章和这本书目录，以 [1]、[2]、[3 ] 分别标注来源作简单小结，以备后用。

tags

AI

Academic

type

Post

auth

刷 TDS(TowardDataScience.com) 和统计之都发现三个好东西：

从任何机器学习模型中抽取人类可理解的洞察（Extracting human understandable insights from any Machine Learning model，作者 Parul Pandey），文章介绍了最为常用的 Permutation Importance, Partial Dependence Plots, SHAP values，Advanced Uses of SHAP Values 四种方法；

可解释的机器学习指南（Guide to Interpretable Machine Learning，作者 Matthew Stewart, PhD Researcher），主要讲可视化，相对更加专业。

Christoph Molnar 的新书 Interpretable Machine Learning，该书籍对机器学习可解释性有较为系统的介绍。

最近没时间仔细学习，故快速浏览了一下两篇文章和这本书目录，以 [1]、[2]、[3 ] 分别标注来源作简单小结，以备后用。

可解释的模型(Interpretable Models)

Linear Regression 线性回归 [2] [3]

Logistic Regression 逻辑回归 [3]

GLM, GAM and more 广义线性模型、广义加法模型和更多 [3]

Decision Tree 决策树 [2] [3]

Decision Rules 判定准则（不仅是贝叶斯判定准则） [3]

RuleFit 规则拟合 [3]

Other Interpretable Models 其他可解释模型 [3]

Naive Bayes Classifier 朴素贝叶斯分类器 [3]
K-nearest Neighbors K 近邻法 [3]

模型不可知方法(Example-Based Explanations)

Partial Dependence Plots 部分依赖图 [1] [2] [3]

Individual Conditional Expectation (ICE) [2] [3]

Accumulated Local Effects (ALE) Plot [2] [3]

Feature Interaction [2] [3]

Permutation Importance 排列重要性 [1] [3]

Global Surrogate [3]

Local Surrogate (LIME) [2] [3]

Scoped Rules (Anchors) [2] [3]

Shapley Values [3]

SHAP Values SHapley Additive exPlanation 值 [1] [2] [3]

Advanced Uses of SHAP Values SHAP 值的高级应用 [1] [3]

Dimensional Reduction Techniques (PCA, t-SNE) 降维技术 [2]

Model Distillation 模型蒸馏 [2]

基于例子的解释(Example Based Explanations)

Counterfactual Explanations 反事实解释 [2] [3]

Adversarial Examples 对抗样本 [3]

Prototypes and Criticisms 原型和批评 [3]

Influential Instances 有影响力的实例 [3]

神经网络的解释(Neural Network Interpretation)

Learned Features 学成特征 [3]

参考

第一篇文章 https://towardsdatascience.com/interpretable-machine-learning-1dec0f2f3e6b

第二篇文章 https://towardsdatascience.com/guide-to-interpretable-machine-learning-d40e8a64b6cf

那本书 https://christophm.github.io/interpretable-ml-book/

刷 TDS(TowardDataScience.com) 和统计之都发现三个好东西：

从任何机器学习模型中抽取人类可理解的洞察（Extracting human understandable insights from any Machine Learning model，作者 Parul Pandey），文章介绍了最为常用的 Permutation Importance, Partial Dependence Plots, SHAP values，Advanced Uses of SHAP Values 四种方法；

可解释的机器学习指南（Guide to Interpretable Machine Learning，作者 Matthew Stewart, PhD Researcher），主要讲可视化，相对更加专业。

Christoph Molnar 的新书 Interpretable Machine Learning，该书籍对机器学习可解释性有较为系统的介绍。

© Reynard 2021 - 2024