Tag

Adversarial Testing

0 views collected around this technical thread.

Tencent Cloud Developer
Tencent Cloud Developer
Sep 23, 2020 · Artificial Intelligence

NLP Model Interpretability: White-box and Black-box Methods and Business Applications

The article reviews NLP interpretability techniques, contrasting white‑box approaches that probe model internals such as neuron analysis, diagnostic classifiers, and attention with black‑box strategies like rationales, adversarial testing, and local surrogates, and argues that black‑box methods are generally more practical for business deployment despite offering shallower insights.

Adversarial TestingBERTInterpretability
0 likes · 12 min read
NLP Model Interpretability: White-box and Black-box Methods and Business Applications