Beyond Patches: Mining Interpretable Part-Prototypes for Explainable AI

Mahdi Alehdaghi; Rajarshi Bhattacharya; Pourya Shamsolmoali; Rafael M.O. Cruz; Maguelonne Heritier; Eric Granger

doi:10.1609/aaai.v40i44.41052

Beyond Patches: Mining Interpretable Part-Prototypes for Explainable AI

Mahdi Alehdaghi
, Rajarshi Bhattacharya
, Pourya Shamsolmoali
, Rafael M.O. Cruz
, Maguelonne Heritier
, Eric Granger

École de technologie supérieure
University of York
Genetec Inc.

Research output: Contribution to Book/Report types › Contribution to conference proceedings › peer-review

Abstract

As AI systems grow more capable, it becomes increasingly important that their decisions remain understandable and aligned with human expectations. A key challenge is the limited interpretability of deep learning models. Post-hoc methods like GradCAM offer heatmaps but provide limited conceptual insight, while prototype-based approaches offer example-based explanations yet often rely on rigid region selection and lack semantic consistency. To address these limitations, we introduce PCMNet, a part-prototypical concept mining network that learns human-comprehensible prototypes from semantically meaningful image regions without additional supervision. By clustering these prototypes into coherent concept groups and extracting concept activation vectors, PCMNet provides structured, concept-level explanations and enhances robustness to occlusion and challenging conditions, which are both critical for building reliable and aligned AI systems. Experiments on multiple image classification benchmarks show that PCMNet outperforms state-of-the-art methods in interpretability, stability, and robustness. This work contributes to AI alignment by enhancing transparency, controllability, and trustworthiness in AI systems.

Original language	English
Title of host publication	Proceedings of the AAAI Conference on Artificial Intelligence
Editors	Sven Koenig, Chad Jenkins, Matthew E. Taylor
Publisher	Association for the Advancement of Artificial Intelligence
Pages	37213-37221
Number of pages	9
Edition	44
ISBN (Print)	9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067
DOIs	https://doi.org/10.1609/aaai.v40i44.41052
Publication status	Published - 2026
Event	40th AAAI Conference on Artificial Intelligence, AAAI 2026 - Singapore, Singapore Duration: 20 Jan 2026 → 27 Jan 2026

Publication series

Name	Proceedings of the AAAI Conference on Artificial Intelligence
Number	44
Volume	40
ISSN (Print)	2159-5399
ISSN (Electronic)	2374-3468

Conference

Conference	40th AAAI Conference on Artificial Intelligence, AAAI 2026
Country/Territory	Singapore
City	Singapore
Period	20/01/26 → 27/01/26

Access to Document

10.1609/aaai.v40i44.41052

Cite this

Alehdaghi, M., Bhattacharya, R., Shamsolmoali, P., Cruz, R. M. O., Heritier, M., & Granger, E. (2026). Beyond Patches: Mining Interpretable Part-Prototypes for Explainable AI. In S. Koenig, C. Jenkins, & M. E. Taylor (Eds.), Proceedings of the AAAI Conference on Artificial Intelligence (44 ed., pp. 37213-37221). (Proceedings of the AAAI Conference on Artificial Intelligence; Vol. 40, No. 44). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v40i44.41052

Alehdaghi, Mahdi ; Bhattacharya, Rajarshi ; Shamsolmoali, Pourya et al. / Beyond Patches : Mining Interpretable Part-Prototypes for Explainable AI. Proceedings of the AAAI Conference on Artificial Intelligence. editor / Sven Koenig ; Chad Jenkins ; Matthew E. Taylor. 44. ed. Association for the Advancement of Artificial Intelligence, 2026. pp. 37213-37221 (Proceedings of the AAAI Conference on Artificial Intelligence; 44).

@inproceedings{24317965bc2b4ee7b7d70d83932d6b6c,

title = "Beyond Patches: Mining Interpretable Part-Prototypes for Explainable AI",

abstract = "As AI systems grow more capable, it becomes increasingly important that their decisions remain understandable and aligned with human expectations. A key challenge is the limited interpretability of deep learning models. Post-hoc methods like GradCAM offer heatmaps but provide limited conceptual insight, while prototype-based approaches offer example-based explanations yet often rely on rigid region selection and lack semantic consistency. To address these limitations, we introduce PCMNet, a part-prototypical concept mining network that learns human-comprehensible prototypes from semantically meaningful image regions without additional supervision. By clustering these prototypes into coherent concept groups and extracting concept activation vectors, PCMNet provides structured, concept-level explanations and enhances robustness to occlusion and challenging conditions, which are both critical for building reliable and aligned AI systems. Experiments on multiple image classification benchmarks show that PCMNet outperforms state-of-the-art methods in interpretability, stability, and robustness. This work contributes to AI alignment by enhancing transparency, controllability, and trustworthiness in AI systems.",

author = "Mahdi Alehdaghi and Rajarshi Bhattacharya and Pourya Shamsolmoali and Cruz, \{Rafael M.O.\} and Maguelonne Heritier and Eric Granger",

note = "Publisher Copyright: {\textcopyright} 2026, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.; 40th AAAI Conference on Artificial Intelligence, AAAI 2026 ; Conference date: 20-01-2026 Through 27-01-2026",

year = "2026",

doi = "10.1609/aaai.v40i44.41052",

language = "English",

isbn = "9781577359067",

series = "Proceedings of the AAAI Conference on Artificial Intelligence",

publisher = "Association for the Advancement of Artificial Intelligence",

number = "44",

pages = "37213--37221",

editor = "Sven Koenig and Chad Jenkins and Taylor, \{Matthew E.\}",

booktitle = "Proceedings of the AAAI Conference on Artificial Intelligence",

edition = "44",

}

Alehdaghi, M, Bhattacharya, R, Shamsolmoali, P, Cruz, RMO, Heritier, M & Granger, E 2026, Beyond Patches: Mining Interpretable Part-Prototypes for Explainable AI. in S Koenig, C Jenkins & ME Taylor (eds), Proceedings of the AAAI Conference on Artificial Intelligence. 44 edn, Proceedings of the AAAI Conference on Artificial Intelligence, no. 44, vol. 40, Association for the Advancement of Artificial Intelligence, pp. 37213-37221, 40th AAAI Conference on Artificial Intelligence, AAAI 2026, Singapore, Singapore, 20/01/26. https://doi.org/10.1609/aaai.v40i44.41052

Beyond Patches: Mining Interpretable Part-Prototypes for Explainable AI. / Alehdaghi, Mahdi; Bhattacharya, Rajarshi; Shamsolmoali, Pourya et al.
Proceedings of the AAAI Conference on Artificial Intelligence. ed. / Sven Koenig; Chad Jenkins; Matthew E. Taylor. 44. ed. Association for the Advancement of Artificial Intelligence, 2026. p. 37213-37221 (Proceedings of the AAAI Conference on Artificial Intelligence; Vol. 40, No. 44).

Research output: Contribution to Book/Report types › Contribution to conference proceedings › peer-review

TY - GEN

T1 - Beyond Patches

T2 - 40th AAAI Conference on Artificial Intelligence, AAAI 2026

AU - Alehdaghi, Mahdi

AU - Bhattacharya, Rajarshi

AU - Shamsolmoali, Pourya

AU - Cruz, Rafael M.O.

AU - Heritier, Maguelonne

AU - Granger, Eric

PY - 2026

Y1 - 2026

N2 - As AI systems grow more capable, it becomes increasingly important that their decisions remain understandable and aligned with human expectations. A key challenge is the limited interpretability of deep learning models. Post-hoc methods like GradCAM offer heatmaps but provide limited conceptual insight, while prototype-based approaches offer example-based explanations yet often rely on rigid region selection and lack semantic consistency. To address these limitations, we introduce PCMNet, a part-prototypical concept mining network that learns human-comprehensible prototypes from semantically meaningful image regions without additional supervision. By clustering these prototypes into coherent concept groups and extracting concept activation vectors, PCMNet provides structured, concept-level explanations and enhances robustness to occlusion and challenging conditions, which are both critical for building reliable and aligned AI systems. Experiments on multiple image classification benchmarks show that PCMNet outperforms state-of-the-art methods in interpretability, stability, and robustness. This work contributes to AI alignment by enhancing transparency, controllability, and trustworthiness in AI systems.

AB - As AI systems grow more capable, it becomes increasingly important that their decisions remain understandable and aligned with human expectations. A key challenge is the limited interpretability of deep learning models. Post-hoc methods like GradCAM offer heatmaps but provide limited conceptual insight, while prototype-based approaches offer example-based explanations yet often rely on rigid region selection and lack semantic consistency. To address these limitations, we introduce PCMNet, a part-prototypical concept mining network that learns human-comprehensible prototypes from semantically meaningful image regions without additional supervision. By clustering these prototypes into coherent concept groups and extracting concept activation vectors, PCMNet provides structured, concept-level explanations and enhances robustness to occlusion and challenging conditions, which are both critical for building reliable and aligned AI systems. Experiments on multiple image classification benchmarks show that PCMNet outperforms state-of-the-art methods in interpretability, stability, and robustness. This work contributes to AI alignment by enhancing transparency, controllability, and trustworthiness in AI systems.

UR - https://www.scopus.com/pages/publications/105034975515

U2 - 10.1609/aaai.v40i44.41052

DO - 10.1609/aaai.v40i44.41052

M3 - Contribution to conference proceedings

AN - SCOPUS:105034975515

SN - 9781577359067

T3 - Proceedings of the AAAI Conference on Artificial Intelligence

SP - 37213

EP - 37221

BT - Proceedings of the AAAI Conference on Artificial Intelligence

A2 - Koenig, Sven

A2 - Jenkins, Chad

A2 - Taylor, Matthew E.

PB - Association for the Advancement of Artificial Intelligence

Y2 - 20 January 2026 through 27 January 2026

ER -

Alehdaghi M, Bhattacharya R, Shamsolmoali P, Cruz RMO, Heritier M, Granger E. Beyond Patches: Mining Interpretable Part-Prototypes for Explainable AI. In Koenig S, Jenkins C, Taylor ME, editors, Proceedings of the AAAI Conference on Artificial Intelligence. 44 ed. Association for the Advancement of Artificial Intelligence. 2026. p. 37213-37221. (Proceedings of the AAAI Conference on Artificial Intelligence; 44). doi: 10.1609/aaai.v40i44.41052

Beyond Patches: Mining Interpretable Part-Prototypes for Explainable AI

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this