Multimodal Industrial Anomaly Detection by Crossmodal Feature Mapping

Accepted at CVPR2024







Multimodal Industrial Anomaly Detection by Crossmodal Feature Mapping
Alex Costanzino*, Pierluigi Zama Ramirez*, Giuseppe Lisanti, Luigi Di Stefano
*Equal Contribution

Recent advancements have shown the potential of leveraging both point clouds and images to localize anomalies. Nevertheless, their applicability in industrial manufacturing is often constrained by significant drawbacks, such as the use of memory banks, which leads to a substantial increase in terms of memory footprint and inference times. We propose a novel light and fast framework that learns to map features from one modality to the other on nominal samples and detect anomalies by pinpointing inconsistencies between observed and mapped features. Extensive experiments show that our approach achieves state-of-the-art detection and segmentation performance in both the standard and few-shot settings on the MVTec 3D-AD dataset while achieving faster inference and occupying less memory than previous multimodal AD methods. Furthermore, we propose a layer pruning technique to improve memory and time efficiency with a marginal sacrifice in performance.


    title = {Multimodal Industrial Anomaly Detection by Crossmodal Feature Mapping},
    author = {Costanzino, Alex and Zama Ramirez, Pierluigi and Lisanti, Giuseppe and Di Stefano, Luigi},
    booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
    note = {CVPR},
    year = {2024},

We gratefully acknowledge the support of SACMI Imola.


Alex Costanzino
PhD Student
University of Bologna
Pierluigi Zama Ramirez
Assistant Professor
University of Bologna
Giuseppe Lisanti
Associate Professor
University of Bologna
Luigi Di Stefano
Full Professor
University of Bologna