Intelligent Fusion of Multi-Modal Medical Imaging: A Comprehensive Review of Methods, Challenges, and Clinical Integration

  • Majda Maatallah Laboratory of Computer Science and Applied Mathematics, Dept. of Computer Science, Faculty of Science and Technology, Chadli Bendjedid, University, El-Tarf, Algeria https://orcid.org/0000-0002-3445-2940
  • Abdelmadjid Benmachiche Laboratory of Computer Science and Applied Mathematics, Dept. of Computer Science, Faculty of Science and Technology, Chadli Bendjedid, University, El-Tarf, Algeria. https://orcid.org/0000-0002-0690-2625
  • Khadija Rais Laboratory of Mathematics, Informatics and Systems (LAMIS), Echahid Cheikh Larbi Tebessi University, Tebessa, Algeria https://orcid.org/0009-0004-3907-7782
  • Salma Touam Laboratory of Physical-Chemistry of Materials, Dept. of Physics, Faculty of Science and Technology, Chadli Bendjedid University, El Tarf, Algeria. https://orcid.org/0000-0002-2513-7703
Keywords: Multimodal Medical Image Fusion (MMIF), Deep Learning, Feature-Level Fusion, Pixel-Level Fusion, Decision-Level Fusion, Medical Imaging

Abstract

Multimodal Medical Imaging Fusion (MMIF) is defined as the incorporation of information from multiple imaging modalities in a way that is mutually supplementary, thereby addressing limitations associated with using a single imaging modality to evaluate a patient and increasing diagnostic accuracy. Further, this review provides a dedicated synthesis of deep learning architectures in MMIF, examining CNN-based hybrids, attention-enhanced transformers, GAN-driven unsupervised fusion, and emerging diffusion models. The state of the art in MMIF can be classified into three levels of fusion: (1) pixel level, fusion of raw pixel intensity values to preserve spatial detail; (2) feature level, features are derived from textures, edges, and region-of-interest (ROI) descriptors; (3) decision level, fusing independent outputs of each source using ensemble or rule-based methods to produce a single, integrated output from all sources, potentially improving interpretability of the integrated output. The use of AI algorithms improves fusion outcomes by yielding higher-quality results. However, clinicians' confidence in deep-learning-based models is limited due to their inability to generalise across multiple scanners, protocols, and medical systems. This analysis demonstrates that clinical AI systems must be developed with interpretability as a core attribute, to provide an explanation of how each modality is contributing to the final decision, and to establish a fusion policy that preserves the ability to make accurate diagnostic determinations based on fused images. In addition to developing more sophisticated algorithms, future developments in MMIF will require collaborative partnerships between developers and clinicians to develop fused images into reliable diagnostic tools to be used in precision medicine.

Downloads

Download data is not yet available.

References

I. Soualmia, S. Maalem, A. Benmachiche, K. Rais, and M. Derdour, “Comparative Survey of AI-Driven Credit Card Fraud Detection: Machine Learning, Deep Learning and Hybrid Systems,” in 2025 International Conference on Networking and Advanced Systems (ICNAS), Oct. 2025, pp. 1–9. doi: 10.1109/ICNAS68168.2025.11298125.

S. O. Boufaida, A. Benmachiche, M. Maatallah, and C. Chemam, “An Extensive Examination of Varied Approaches in E-Learning and MOOC Research: A Thorough Overview,” in 2024 6th International Conference on Pattern Analysis and Intelligent Systems (PAIS), Apr. 2024, pp. 1–8. doi: 10.1109/PAIS62114.2024.10541129.

A. Benmachiche, A. Sahia, S. O. Boufaida, K. Rais, M. Derdour, and F. Maazouzi, “Enhancing learning recommendations in mooc search engines through named entity recognition,” Educ. Inf. Technol., vol. 30, no. 9, pp. 13041–13071, Jun. 2025, doi: 10.1007/s10639-024-13308-4.

M. Zubair, M. Hussain, M. A. Albashrawi, M. Bendechache, and M. Owais, “A comprehensive review of techniques, algorithms, advancements, challenges, and clinical applications of multi-modal medical image fusion for improved diagnosis,” Comput. Methods Programs Biomed., vol. 272, p. 109014, Dec. 2025, doi: 10.1016/j.cmpb.2025.109014.

N. Goswami, A. Dogra, S. Bakshi, and B. Goyal, “Multimodal medical image fusion: techniques, databases, evaluation metrics, and clinical applications—A comprehensive review,” The Open Neuroimaging Journal, vol. 18, no. 1, 2025, doi: 10.2174/0118744400417835251022042920.

B. K. Sedraoui, A. Benmachiche, A. Makhlouf, and C. Chemam, “Intrusion Detection with deep learning: A literature review,” in 2024 6th International Conference on Pattern Analysis and Intelligent Systems (PAIS), Apr. 2024, pp. 1–8. doi: 10.1109/PAIS62114.2024.10541191.

B. K. Sedraoui, A. Benmachiche, A. Makhlouf, D. Abbas, and M. Derdour, “Cybersecurity in E-Learning: A Literature Review on Phishing Detection Using ML and DL Techniques,” in 2025 International Conference on Networking and Advanced Systems (ICNAS), Oct. 2025, pp. 1–10. doi: 10.1109/ICNAS68168.2025.11298114.

W. Tan, P. Tiwari, H. M. Pandey, C. Moreira, and A. K. Jaiswal, “Multimodal medical image fusion algorithm in the era of big data,” Neural Comput. Appl., vol. 37, no. 28, pp. 22995–23015, 2025, doi: 10.1007/s00521-020-05173-2.

F. Zhao, C. Zhang, and B. Geng, “Deep Multimodal Data Fusion,” ACM Comput. Surv., vol. 56, no. 9, pp. 1–36, Oct. 2024, doi: 10.1145/3649447.

Y. Li et al., “A review of deep learning-based information fusion techniques for multimodal medical image classification,” Comput. Biol. Med., vol. 177, p. 108635, Jul. 2024, doi: 10.1016/j.compbiomed.2024.108635.

V. A. Barola, P. Singh, and M. Diwakar, “A Recent Survey on Multi-modal Medical Image Fusion,” Biomed. Inform. Smart Healthc., vol. 1, no. 3, pp. 89–97, 2025, doi: 10.62762/BISH.2025.414869.

G. Mirzaei, A. Gupta, and H. Adeli, “Data fusion of medical imaging in neurological disorders,” Rev. Neurosci., vol. 37, no. 1, pp. 43–60, 2026, doi: 10.1515/revneuro-2025-0062.

M. A. Saleh, A. A. Ali, K. Ahmed, and A. M. Sarhan, “A brief analysis of multimodal medical image fusion techniques,” Electronics, vol. 12, no. 1, p. 97, 2022, doi: 10.3390/electronics12010097.

M. Haribabu, V. Guruviah, and P. Yogarajah, “Recent advancements in multimodal medical image fusion techniques for better diagnosis: an overview,” Curr. Med. Imaging Rev., vol. 19, no. 7, pp. 673–694, 2023, doi: 10.2174/1573405618666220606161137.

M. Diwakar, P. Singh, V. Ravi, and A. Maurya, “A non-conventional review on multi-modality-based medical image fusion,” Diagnostics, vol. 13, no. 5, p. 820, 2023, doi: 10.3390/diagnostics13050820.

J. Sui, D. Zhi, and V. D. Calhoun, “Data-driven multimodal fusion: approaches and applications in psychiatric research,” Psychoradiology, vol. 3, p. kkad026, 2023, doi: 10.1093/psyrad/kkad026.

S. Ullah Khan, M. Ahmad Khan, M. Azhar, F. Khan, Y. Lee, and M. Javed, “Multimodal medical image fusion towards future research: A review,” J. King Saud Univ. - Comput. Inf. Sci., vol. 35, no. 8, p. 101733, Sep. 2023, doi: 10.1016/j.jksuci.2023.101733.

S. Steyaert et al., “Multimodal data fusion for cancer biomarker discovery with deep learning,” Nat. Mach. Intell., vol. 5, no. 4, pp. 351–362, Apr. 2023, doi: 10.1038/s42256-023-00633-5.

S. Kalamkar and G. M. A., “Multimodal image fusion: A systematic review,” Decis. Anal. J., vol. 9, p. 100327, Dec. 2023, doi: 10.1016/j.dajour.2023.100327.

S. Bhosekar, P. Singh, D. Garg, V. Ravi, and M. Diwakar, “A review of deep learning-based multi-modal medical image fusion,” The Open Bioinformatics Journal, vol. 18, no. 1, 2025, doi: 10.2174/0118750362370697250630063814.

T. M. Hayat and S. Madhavi D., “A Comprehensive Analysis of Medical Image Fusion Techniques: A Detailed Review:,” in Proceedings of the 1st International Conference on Artificial Intelligence for Internet of Things: Accelerating Innovation in Industry and Consumer Electronics, Virtual, India: SCITEPRESS - Science and Technology Publications, 2023, pp. 147–152. doi: 10.5220/0012603200003739.

M. A. Azam et al., “A review on multimodal medical image fusion: Compendious analysis of medical modalities, multimodal databases, fusion techniques and quality metrics,” Comput. Biol. Med., vol. 144, p. 105253, May 2022, doi: 10.1016/j.compbiomed.2022.105253.

B. Huang, F. Yang, M. Yin, X. Mo, and C. Zhong, “A Review of Multimodal Medical Image Fusion Techniques,” Comput. Math. Methods Med., vol. 2020, p. 8279342, Apr. 2020, doi: 10.1155/2020/8279342.

S. Liu, M. Wang, L. Yin, X. Sun, Y.-D. Zhang, and J. Zhao, “Two-Scale Multimodal Medical Image Fusion Based on Structure Preservation,” Front. Comput. Neurosci., vol. 15, Jan. 2022, doi: 10.3389/fncom.2021.803724.

M. M. Almasri and A. M. Alajlan, “Artificial Intelligence-Based Multimodal Medical Image Fusion Using Hybrid S2 Optimal CNN,” Electronics, vol. 11, no. 14, p. 2124, Jan. 2022, doi: 10.3390/electronics11142124.

B. K. Sedraoui, A. Benmachiche, A. Makhlouf, K. Rais, and C. Chemam, “CNN-OOA-Based Cyber Threat Detection: Protecting E-Learning from Phishing,” Arab. J. Sci. Eng., Apr. 2026, doi: 10.1007/s13369-026-11122-3.

S. O. Boufaida, A. Benmachiche, A. Bennour, M. Maatallah, M. Derdour, and F. Ghabban, “Enhancing MOOC Course Classification with Convolutional Neural Networks via Lion Algorithm-Based Hyperparameter Tuning,” SN Comput. Sci., vol. 6, no. 6, p. 707, Jul. 2025, doi: 10.1007/s42979-025-04179-8.

W. Kong, C. Li, and Y. Lei, “Multimodal medical image fusion using convolutional neural network and extreme learning machine,” Front. Neurorobotics, vol. 16, p. 1050981, 2022, doi: 10.3389/fnbot.2022.1050981.

K. Vanitha, D. Satyanarayana, and M. N. G. Prasad, “Multi-modal Medical Image Fusion Algorithm Based on Spatial Frequency Motivated PA-PCNN in the NSST Domain,” Curr. Med. Imaging, vol. 17, no. 5, pp. 634–643, May 2021, doi: 10.2174/1573405616666201118123220.

S. Goyal, V. Singh, A. Rani, and N. Yadav, “Multimodal image fusion and denoising in NSCT domain using CNN and FOTGV,” Biomed. Signal Process. Control, vol. 71, p. 103214, 2022, doi: 10.1016/j.bspc.2021.103214.

K. Vanitha, D. Satyanarayana, and M. N. Giri Prasad, “Medical Image Fusion Based on Energy Attribute and PA-PCNN in NSST Domain,” in Soft Computing and Signal Processing, V. S. Reddy, V. K. Prasad, J. Wang, and K. T. V. Reddy, Eds., Singapore: Springer Nature, 2022, pp. 457–467. doi: 10.1007/978-981-16-7088-6_42.

P. Arora, R. Mehta, and P. K. Soni, “Multimodal medical image analysis using deep learning registration and LWT-SVD fusion,” Discov. Comput., vol. 29, no. 1, p. 4, 2026, doi: 10.1007/s10791-025-09857-y.

Q. Zuo, J. Zhang, and Y. Yang, “DMC-Fusion: Deep Multi-Cascade Fusion With Classifier-Based Feature Synthesis for Medical Multi-Modal Images,” IEEE J. Biomed. Health Inform., vol. 25, no. 9, pp. 3438–3449, Sep. 2021, doi: 10.1109/JBHI.2021.3083752.

A. S. Nisha and T. S. Siva Rani, “Novel hybrid CNN with Bi-LSTM multi-focus image fusion method based on modified tetrolet transform in MRI and CT images,” J. Intell. Fuzzy Syst., vol. 45, no. 4, pp. 6767–6783, Oct. 2023, doi: 10.3233/JIFS-224439.

D. K. Chaudhary, P. Singh, and A. Shankar, “RGF-DnCNN-GMM: Multi-Modal Medical Image Fusion Using Rolling Guidance Filtering, CNN Denoising, and Gradient-Based Adaptive Fusion,” in 2025 5th International Conference on Internet of Things: Smart Innovation and Usages (IoT-SIU), Nov. 2025, pp. 1–5. doi: 10.1109/IOT-SIU65919.2025.11402756.

V. S. Parvathy, S. Pothiraj, and J. Sampson, “Hyperparameter Optimization of Deep Neural Network in Multimodality Fused Medical Image Classification for Medical and Industrial IoT,” in Smart Sensors for Industrial Internet of Things: Challenges, Solutions and Applications, D. Gupta, V. Hugo C. de Albuquerque, A. Khanna, and P. L. Mehta, Eds., Cham: Springer International Publishing, 2021, pp. 127–146. doi: 10.1007/978-3-030-52624-5_9.

A. Benmachiche and A. Makhlouf, “Optimization of Hidden Markov Model with Gaussian Mixture Densities for Arabic Speech Recognition,” WSEAS Trans. Circuits Syst., vol. 15, pp. 85–95, Apr. 2019.

A. Benmachiche, A. Makhlouf, and T. Bouhadada, “Optimization learning of hidden Markov model using the bacterial foraging optimization algorithm for speech recognition,” Int. J. Knowl.-Based Intell. Eng. Syst., vol. 23, pp. 171–181, Oct. 2020, doi: 10.3233/KES-200039.

A. Benmachiche, B. Tahar, L. M. Tayeb, and Z. Asma, “A dynamic navigation for autonomous mobiles robots,” Intell. Decis. Technol., vol. 10, no. 1, pp. 81–91, Feb. 2016, doi: 10.3233/IDT-150239.

A. Benmachiche, A. A. Betouil, I. Boutabia, A. Nouari, K. Boumahni, and H. Bouzata, “A Fuzzy Navigation Approach Using the Intelligent Lights Algorithm for an Autonomous Mobile Robot,” in 12th International Conference on Information Systems and Advanced Technologies “ICISAT 2022,” M. R. Laouar, V. E. Balas, B. Lejdel, S. Eom, and M. A. Boudia, Eds., Cham: Springer International Publishing, 2023, pp. 112–121. doi: 10.1007/978-3-031-25344-7_11.

A. Mellouk and A. Benmachiche, “A survey on Navigation Systems in Dynamic Environments,” in Proceedings of the 10th International Conference on Information Systems and Technologies, in ICIST ’20. New York, NY, USA: Association for Computing Machinery, Mar. 2021, pp. 1–7. doi: 10.1145/3447568.3448527.

P.-H. Dinh, “A novel approach based on Three-scale image decomposition and Marine predators algorithm for multi-modal medical image fusion,” Biomed. Signal Process. Control, vol. 67, p. 102536, May 2021, doi: 10.1016/j.bspc.2021.102536.

S. O. Boufaida, A. Benmachiche, M. Derdour, M. Maatallah, M. S. Kahil, and M. C. Ghanem, “TSA-GRU: A Novel Hybrid Deep Learning Module for Learner Behavior Analytics in MOOCs,” Future Internet, vol. 17, no. 8, Aug. 2025, doi: 10.3390/fi17080355.

I. Boutabia, A. Benmachiche, A. Bennour, A. A. Betouil, M. Derdour, and F. Ghabban, “Hybrid CNN-ViT Model for Student Engagement Detection in Open Classroom Environments,” SN Comput. Sci., vol. 6, no. 6, p. 684, Jul. 2025, doi: 10.1007/s42979-025-04228-2.

B. K. Sedraoui, A. Benmachiche, A. Bennour, A. Makhlouf, M. Derdour, and F. Ghabban, “LSTM-SWAP: A Hybrid Deep Learning Model for Cheating Detection,” SN Comput. Sci., vol. 6, no. 7, p. 798, Sep. 2025, doi: 10.1007/s42979-025-04334-1.

P. Peng and Y. Luo, “Multimodal Medical Image Fusion Using a Progressive Parallel Strategy Based on Deep Learning,” Electronics, vol. 14, no. 11, p. 2266, Jan. 2025, doi: 10.3390/electronics14112266.

S. Yu, M. He, R. Nie, C. Wang, and X. Wang, An unsupervised hybrid model based on CNN and ViT for multimodal medical image fusion. 2021, p. 240. doi: 10.1109/CECIT53797.2021.00048.

W. Li, Y. Zhang, G. Wang, Y. Huang, and R. Li, “DFENet: A dual-branch feature enhanced network integrating transformers and convolutional feature learning for multimodal medical image fusion,” Biomed. Signal Process. Control, vol. 80, p. 104402, 2023, doi: 10.1016/j.bspc.2022.104402.

X. Xie et al., “Mrscfusion: Joint residual swin transformer and multiscale cnn for unsupervised multimodal medical image fusion,” IEEE Trans. Instrum. Meas., vol. 72, pp. 1–17, 2023, doi: 10.1109/TIM.2023.3317470.

B. Zou et al., “FocalNetFuse: enhancing multimodal image fusion quality with focal modulation networks: FocalNetFuse: enhancing multimodal image fusion quality…B. Zou et al.,” Vis. Comput., vol. 42, Jan. 2026, doi: 10.1007/s00371-025-04206-y.

Z. Ding, H. Li, Y. Guo, D. Zhou, Y. Liu, and S. Xie, “M4FNet: Multimodal medical image fusion network via multi-receptive-field and multi-scale feature integration,” Comput. Biol. Med., vol. 159, p. 106923, Jun. 2023, doi: 10.1016/j.compbiomed.2023.106923.

J. Di, W. Guo, J. Liu, L. Ren, and J. Lian, “AMMNet: A multimodal medical image fusion method based on an attention mechanism and MobileNetV3,” Biomed. Signal Process. Control, vol. 96, p. 106561, Oct. 2024, doi: 10.1016/j.bspc.2024.106561.

Y. Zhou, X. Yang, S. Liu, and J. Yin, “Multimodal Medical Image Fusion Network Based on Target Information Enhancement,” IEEE Access, vol. 12, pp. 70851–70869, 2024, doi: 10.1109/ACCESS.2024.3402965.

X. Sun, H. Liu, G. Chen, Y. Sheng, and C. Zhang, “Multimodal Medical Image Fusion via Manifold Structure Modeling and Information Geometry Enhancement,” IEEE Trans. Circuits Syst. Video Technol., 2026, doi: 10.1109/TCSVT.2026.3661187.

W. Li, P. Jia, D. He, S. Liu, G. Wang, and Y. Huang, “SAFusion: Scenario-Adaptive Network for Multimodal Medical Image Fusion,” IEEE J. Biomed. Health Inform., 2026, doi: 10.1109/JBHI.2026.3651957.

A. A. Kamara, S. He, and A. J. Fofanah, “FAMAFuse: Functional-Anatomical Multiscale Attention for Multimodal Image Fusion,” IEEE Trans. Circuits Syst. Video Technol., 2025, doi: 10.1109/TCSVT.2025.3626562.

J. Dhar et al., “Multimodal Fusion Learning with Dual Attention for Medical Imaging,” in 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Feb. 2025, pp. 4362–4371. doi: 10.1109/WACV61041.2025.00428.

M. Safari, A. Fatemi, and L. Archambault, “MedFusionGAN: multimodal medical image fusion using an unsupervised deep generative adversarial network,” BMC Med. Imaging, vol. 23, no. 1, p. 203, 2023, doi: 10.1186/s12880-023-01160-w.

C. Fan, H. Lin, and Y. Qiu, “U-Patch GAN: A Medical Image Fusion Method Based on GAN,” J. Digit. Imaging, vol. 36, no. 1, pp. 339–355, Feb. 2023, doi: 10.1007/s10278-022-00696-7.

N. Anita, M. R. Devi, R. A. M. Rose, and J. S. J. Lijha, “MIMO-TGAN: Multi-Modality Medical Image Fusion via Triple Generator Network for Brain Abnormality Detection,” Int. J. Comput. Intell. Syst., Mar. 2026, doi: 10.1007/s44196-026-01189-z.

C. Sui et al., “IG-GAN: Interactive Guided Generative Adversarial Networks for Multimodal Image Fusion,” IEEE Trans. Geosci. Remote Sens., vol. 62, pp. 1–19, 2024, doi: 10.1109/TGRS.2024.3433619.

K. Guo, X. Hu, and X. Li, “MMFGAN: A novel multimodal brain medical image fusion based on the improvement of generative adversarial network,” Multimed. Tools Appl., vol. 81, no. 4, pp. 5889–5927, Feb. 2022, doi: 10.1007/s11042-021-11822-y.

Y. Zhou, K. He, D. Xu, J. Gong, and W. Mei, “DIFusion: Multimodal medical image fusion based on detail preservation and invertible neural networks,” Biomed. Signal Process. Control, vol. 115, p. 109415, 2026, doi: 10.1016/j.bspc.2025.109415.

G. C. Kumar, K. M, J. S, and N. S, “Structured constraints based Deep guided Generative adversarial network(GAN) for deformable multimodal medical image fusion(MMIF) and enhancement,” in 2025 2nd International Conference on New Frontiers in Communication, Automation, Management and Security (ICCAMS), Jul. 2025, pp. 1–5. doi: 10.1109/ICCAMS65118.2025.11234098.

Z. Zhao et al., “DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion,” presented at the Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 8082–8093. Accessed: Mar. 26, 2026. [Online]. Available: https://openaccess.thecvf.com/content/ICCV2023/html/Zhao_DDFM_Denoising_Diffusion_Model_for_Multi-Modality_Image_Fusion_ICCV_2023_paper.html

J. Wang, M. Liu, W. Shen, R. Ding, Y. Wang, and E. Meijering, “EPDiff: Erasure Perception Diffusion Model for Unsupervised Anomaly Detection in Preoperative Multimodal Images,” IEEE Trans. Med. Imaging, vol. 45, no. 1, pp. 379–390, Jan. 2026, doi: 10.1109/TMI.2025.3597545.

L. Han, “AD-Diff: enhancing Alzheimer’s disease prediction accuracy through multimodal fusion,” Front. Comput. Neurosci., vol. 19, Mar. 2025, doi: 10.3389/fncom.2025.1484540.

A. Snani, M. Khadir, A. Pranolo, and M. Abdalla, “GAN-Enhanced multimodal fusion and ensemble learning for imbalanced chest X-Ray classification,” Int. J. Adv. Intell. Inform., vol. 11, p. 514, Aug. 2025, doi: 10.26555/ijain.v11i3.2092.

T. Zhou, L. Liu, H. Lu, M. Zhang, and Z. Zhang, “MAC-GAN: Medical advice condition GAN for multimodal lung tumor image fusion,” Biomed. Signal Process. Control, vol. 119, p. 109981, Jun. 2026, doi: 10.1016/j.bspc.2026.109981.

Y. Kang et al., “Deep learning-based multimodal fusion of MRI and whole slide image for predicting neoadjuvant therapy response in locally advanced head and neck squamous cell carcinoma,” BMC Med. Imaging, 2026, doi: 10.1186/s12880-026-02173-x.

Y. Akbari, F. Abdullakutty, S. Al-Maadeed, R. Al Saady, A. Bouridane, and R. Hamoudi, “A novel virtual patient approach for cross-patient multimodal fusion in enhanced breast cancer detection,” Comput. Med. Imaging Graph., vol. 127, p. 102687, 2026, doi: 10.1016/j.compmedimag.2025.102687.

D. Duenias, B. Nichyporuk, T. Arbel, T. R. Raviv, and others, “Hyperfusion: A hypernetwork approach to multimodal integration of tabular and medical imaging data for predictive modeling,” Med. Image Anal., vol. 102, p. 103503, 2025, doi: 10.1016/j.media.2025.103503.

C. Yu, J. Ye, Y. Liu, X. Zhang, and Z. Zhang, “AMF-MedIT: An efficient align-modulation-fusion framework for medical image–tabular data,” Biomed. Signal Process. Control, vol. 118, p. 109772, 2026, doi: 10.1016/j.bspc.2026.109772.

B. Zeng et al., “C2HFusion: Clinical context-driven hierarchical fusion of multimodal data for personalized and quantitative prognostic assessment in pancreatic cancer,” Med. Image Anal., p. 103937, 2026, doi: 10.1016/j.media.2026.103937.

Q. Ye, M. Luo, J. Zhou, C. Cheng, L. Peng, and J. Wu, “NMD-FusionNet: a multimodal fusion-based medical imaging-assisted diagnostic model for liver cancer,” J. King Saud Univ. Comput. Inf. Sci., vol. 37, no. 6, p. 147, 2025, doi: 10.1007/s44443-025-00162-8.

H. Xiang, H. Zhang, Y. Cheng, X. Quan, and W. Huang, “SMFusion: Semantic-Preserving Fusion of Multimodal Medical Images for Enhanced Clinical Diagnosis,” IEEE J. Biomed. Health Inform., 2025, doi: 10.1109/JBHI.2025.3649749.

G. Dai et al., “Prompt-Level Contrastive Learning for Context-Aware Multi-modal Image Representation in Medical Diagnosis,” Pattern Recognit., p. 113027, 2026, doi: 10.1016/j.patcog.2025.113027.

R. S. Rao, A. Mishra, and S. Swain, “CAMF-SkinNet: Cross-Attention Multimodal Fusion of Visual, Textual, and Dermatology-Specific Embeddings for Skin Disease Classification,” in International Conference on Distributed Computing and Intelligent Technology, Springer, 2026, pp. 425–435. doi: 10.1007/978-3-032-16632-6_27.

W. Guo, L. Wang, J. Zeng, Q. Han, K. Jin, and X. Wang, “SMAFusion: Multimodal medical image fusion based on spatial registration and local-global multi-scale feature adaptive fusion,” Neurocomputing, p. 131039, 2025, doi: 10.1016/j.neucom.2025.131039.

M. A. Azam, K. B. Khan, M. Ahmad, and M. Mazzara, “Multimodal Medical Image Registration and Fusion for Quality Enhancement,” Comput. Mater. Contin., vol. 68, pp. 821–840, Feb. 2021, doi: 10.32604/cmc.2021.016131.

Y. Wu, J. Chen, L. Hu, H. Xu, H. Liang, and J. Wu, “OmniFuse: A general modality fusion framework for multi-modality learning on low-quality medical data,” Inf. Fusion, vol. 117, p. 102890, 2025, doi: 10.1016/j.inffus.2024.102890.

D. M. Pathak et al., “Optimal feature selection for medical image fusion using deep learning with transformer,” Biomed. Signal Process. Control, vol. 111, p. 108377, 2026, doi: 10.1016/j.bspc.2025.108377.

S. Sangeetha et al., “An enhanced multimodal fusion deep learning neural network for lung cancer classification,” Syst. Soft Comput., vol. 6, p. 200068, 2024, doi: 10.1016/j.sasc.2023.200068.

J. Cheng, F. Liu, and S. Wei, “Multimodal fusion network with multi-scale structure and metabolic focus for enhancing Alzheimer’s disease prediction,” Appl. Intell., vol. 56, no. 2, p. 66, 2026, doi: 10.1007/s10489-026-07105-4.

J. Xu, S. Zhuang, Y. He, H. Wang, Z. Zhuang, and H. Zeng, “Multimodal Sparse Fusion Transformer Network with Spatio-Temporal Decoupling for Breast Tumor Classification,” Med. Image Anal., p. 103966, 2026, doi: 10.1016/j.media.2026.103966.

J. Huang et al., “UltraMamba: Mamba-based Multimodal Ultrasound Image Adaptive Fusion for Breast Lesion Segmentation,” IEEE Trans. Med. Imaging, 2026, doi: 10.1109/TMI.2026.3653779.

L. Li et al., “SymUnet-DynCFC: Multimodal MRI Fusion for Robust Cartilage Segmentation and Clinically Confirmed Moderate-to-Severe KOA Diagnosis,” Inf. Fusion, p. 104145, 2026, doi: 10.1016/j.inffus.2026.104145.

L. Dong et al., “AI-based prediction of best-corrected visual acuity in patients with multiple retinal diseases using multimodal medical imaging,” Br. J. Ophthalmol., vol. 110, no. 2, pp. 158–165, 2026, doi: 10.1136/bjo-2025-327189s.

J. Chen and J. Chen, “Multimodal image feature fusion for improving medical ultrasound image segmentation,” Biomed. Signal Process. Control, vol. 89, p. 105705, 2024, doi: 10.1016/j.bspc.2023.105705.

Q. Lu, L. Zheng, J. Su, W. Ma, H. Ma, and Y. Zhang, “MCAB-GFEResNet: A multimodal fusion model for pre-treatment prediction of neoadjuvant chemoradiotherapy response in rectal cancer,” Biomed. Signal Process. Control, vol. 117, p. 109672, 2026, doi: 10.1016/j.bspc.2026.109672.

P. Ravikumaran, K. Vimala Devi, and K. Valarmathi, “An Improved Kidney Tumor Prediction Using Deep Convolutional Neural Network-Restricted Boltzmann Machine Technique in Medical Image Segmentation,” J. Med. Imaging Health Inform., vol. 11, no. 12, pp. 3191–3198, Dec. 2021, doi: 10.1166/jmihi.2021.3917.

H. Lu, M. Yu, X. Wei, X. Xu, and J. Xu, “Unbiased Multimodal Fusion for Medical Image Segmentation Based on Dual-Stream Adapter,” Knowl.-Based Syst., p. 114653, 2025, doi: 10.1016/j.knosys.2025.114653.

M. Wang et al., “Task-generalized adaptive cross-domain learning for multimodal image fusion,” IEEE Trans. Multimed., 2026, doi: 10.1109/TMM.2026.3660142.

G. Zhang, R. Nie, J. Cao, L. Chen, and Y. Zhu, “FDGNet: A pair feature difference guided network for multimodal medical image fusion,” Biomed. Signal Process. Control, vol. 81, p. 104545, Mar. 2023, doi: 10.1016/j.bspc.2022.104545.

S. O. Boufaida, A. Benmachiche, and M. Maatallah, “Real-Time Image Processing Algorithms for Embedded Systems,” Jan. 09, 2026, arXiv: arXiv:2601.06243. doi: 10.48550/arXiv.2601.06243.

A. Bouamrane, M. Derdour, A. Bennour, A. Benmachiche, and M. Gasmi, “Machine Learning for Medical Image Analysis,” in AI for Medical Image Analysis: Reconciling Innovation and Ethical Considerations, N. Ben Aoun, S. Ahmad, and M. Hammad, Eds., Cham: Springer Nature Switzerland, 2026, pp. 97–125. doi: 10.1007/978-3-032-02963-8_4.

T. B. Nguyen-Tat, T. Q. Hung, P. T. Nam, and V. M. Ngo, “Evaluating pre-processing and deep learning methods in medical imaging: Combined effectiveness across multiple modalities,” Alex. Eng. J., vol. 119, pp. 558–586, Apr. 2025, doi: 10.1016/j.aej.2025.01.090.

L. He, “Non-rigid Multi-Modal Medical Image Registration Based on Improved Maximum Mutual Information PV Image Interpolation Method,” Front. Public Health, vol. 10, Jun. 2022, doi: 10.3389/fpubh.2022.863307.

A. Khorasani, N. Dadashi serej, M. Jalilian, A. Shayganfar, and M. B. Tavakoli, “Performance comparison of different medical image fusion algorithms for clinical glioma grade classification with advanced magnetic resonance imaging (MRI),” Sci. Rep., vol. 13, no. 1, p. 17646, Oct. 2023, doi: 10.1038/s41598-023-43874-5.

T. Tirupal, B. Mohan, and S. Kumar, “Multimodal Medical Image Fusion Techniques – A Review,” Curr. Signal Transduct. Ther., vol. 15, Feb. 2020, doi: 10.2174/1574362415666200226103116.

J. Chen, L. Chen, and M. Shabaz, “Image Fusion Algorithm at Pixel Level Based on Edge Detection,” J. Healthc. Eng., vol. 2021, p. 5760660, Aug. 2021, doi: 10.1155/2021/5760660.

K. P. Indira, R. Rani Hemamalini, and R. Indhumathi, “Pixel based Medical Image Fusion Techniques using Discrete Wavelet Transform and Stationary Wavelet Transform,” Indian J. Sci. Technol., vol. 8, no. 26, Oct. 2015, doi: 10.17485/ijst/2015/v8i26/56192.

B. Miles, M. W. K. Law, I. Ben-Ayed, G. Garvin, A. Fenster, and S. Li, “Pixel level image fusion for medical imaging: an energy minimizing approach,” presented at the SPIE Medical Imaging, B. Van Ginneken and C. L. Novak, Eds., San Diego, California, USA, Feb. 2012, p. 831511. doi: 10.1117/12.911613.

C. E. Ogbuanya, A. Obayi, S. Larabi-Marie-Sainte, A. O. Saad, and L. Berriche, “A hybrid optimization approach for accelerated multimodal medical image fusion,” PLOS One, vol. 20, no. 7, p. e0324973, Jul. 2025, doi: 10.1371/journal.pone.0324973.

S. Shehanaz, E. Daniel, S. R. Guntur, and S. Satrasupalli, “Optimum weighted multimodal medical image fusion using particle swarm optimization,” Optik, vol. 231, p. 166413, Apr. 2021, doi: 10.1016/j.ijleo.2021.166413.

A. A. Alzahrani, “Enhanced multimodal medical image fusion via modified DWT with arithmetic optimization algorithm,” Sci. Rep., vol. 14, no. 1, p. 19261, Aug. 2024, doi: 10.1038/s41598-024-69997-x.

I. Boutabia, A. Benmachiche, A. A. Betouil, C. Chemam, and K. Rais, “Advanced Text Prediction System Integrated Within the Search Engine for the Open Classroom Approach Based on Particle Swarm Optimization and Long Short-Term Memory Models,” Arab. J. Sci. Eng., Mar. 2026, doi: 10.1007/s13369-026-11247-5.

A. Benmachiche, M. Derdour, M. S. Kahil, M. C. Ghanem, and M. Deriche, “Adaptive Hybrid PSO–APF Algorithm for Advanced Path Planning in Next-Generation Autonomous Robots,” Sensors, vol. 25, no. 18, Sep. 2025, doi: 10.3390/s25185742.

A. Makhlouf, A. Benmachiche, and I. Boutabia, “Enhanced Autonomous Mobile Robot Navigation Using a Hybrid BFO/PSO Algorithm for Dynamic Obstacle Avoidance,” Informatica, vol. 48, no. 17, Nov. 2024, doi: 10.31449/inf.v48i17.6716.

J. Mi, L. Wang, Y. Liu, and J. Zhang, “KDE-GAN: A multimodal medical image-fusion model based on knowledge distillation and explainable AI modules,” Comput. Biol. Med., vol. 151, p. 106273, Dec. 2022, doi: 10.1016/j.compbiomed.2022.106273.

M. Z. Khan et al., “Multimodality medical image fusion using directional total variation based linear spectral clustering in NSCT domain,” Sci. Rep., vol. 16, no. 1, p. 5367, Feb. 2026, doi: 10.1038/s41598-025-26916-y.

K. S. S. V. V. Ramesh and S. S. Kumar, “YUV-based SVD-VGG hybrid fusion for multimodal MRI-PET image integration,” PLOS ONE, vol. 21, no. 1, p. e0340781, Jan. 2026, doi: 10.1371/journal.pone.0340781.

D. C. Lepcha et al., “Multimodal Medical Image Fusion based on Pixel Significance using Anisotropic Diffusion and Cross Bilateral Filter,” Hum.-Centric Comput. Inf. Sci., vol. 12, no. 0, pp. 190–206, Mar. 2022, doi: 10.22967/HCIS.2022.12.015.

L. Wei, R. Zhu, X. Li, L. Zhao, X. Hu, and X. Zhang, “Pixel-level structure awareness for enhancing multi-modal medical image fusion,” Biomed. Signal Process. Control, vol. 97, p. 106694, Nov. 2024, doi: 10.1016/j.bspc.2024.106694.

P. Kavita, D. R. Alli, and A. B. Rao, “Study of image fusion optimization techniques for medical applications,” Int. J. Cogn. Comput. Eng., vol. 3, pp. 136–143, Jun. 2022, doi: 10.1016/j.ijcce.2022.05.002.

T. Zhou, Q. Cheng, H. Lu, Q. Li, X. Zhang, and S. Qiu, “Deep learning methods for medical image fusion: A review,” Comput. Biol. Med., vol. 160, p. 106959, Jun. 2023, doi: 10.1016/j.compbiomed.2023.106959.

N. Liang, “Medical image fusion with deep neural networks,” Sci. Rep., vol. 14, no. 1, p. 7972, Apr. 2024, doi: 10.1038/s41598-024-58665-9.

F. Luo, D. Wu, L. R. Pino, and W. Ding, “A novel multimodel medical image fusion framework with edge enhancement and cross-scale transformer,” Sci. Rep., vol. 15, no. 1, p. 11657, Apr. 2025, doi: 10.1038/s41598-025-93616-y.

J. Duan, S. Mao, J. Jin, Z. Zhou, L. Chen, and C. L. P. Chen, “A Novel GA-Based Optimized Approach for Regional Multimodal Medical Image Fusion With Superpixel Segmentation,” IEEE Access, vol. 9, pp. 96353–96366, 2021, doi: 10.1109/ACCESS.2021.3094972.

W. Tang, F. He, Y. Liu, and Y. Duan, “MATR: Multimodal Medical Image Fusion via Multiscale Adaptive Transformer,” IEEE Trans. Image Process., vol. 31, pp. 5134–5149, 2022, doi: 10.1109/TIP.2022.3193288.

R. Prathipa and R. Ramadevi, “Feature Level Medical Image Fusion with Deep Learning,” in 2023 International Conference on Evolutionary Algorithms and Soft Computing Techniques (EASCT), Oct. 2023, pp. 1–8. doi: 10.1109/EASCT59475.2023.10393457.

L. Wang, J. Zhang, Y. Liu, J. Mi, and J. Zhang, “Multimodal Medical Image Fusion Based on Gabor Representation Combination of Multi-CNN and Fuzzy Neural Network,” IEEE Access, vol. 9, pp. 67634–67647, 2021, doi: 10.1109/ACCESS.2021.3075953.

N. Nagaraja Kumar, T. Jayachandra Prasad, and K. Satya Prasad, “Multimodal Medical Image Fusion with Improved Multi-Objective Meta-Heuristic Algorithm with Fuzzy Entropy,” J. Inf. Knowl. Manag., vol. 22, no. 1, p. 2250063, Feb. 2023, doi: 10.1142/S0219649222500630.

Z. Wang, H. Di, R. Zhang, and F. Liu, “DTCFormer: Deep tensor chain frequency guided transformer for multi-modal medical image classification,” Biomed. Signal Process. Control, vol. 113, p. 109152, Mar. 2026, doi: 10.1016/j.bspc.2025.109152.

S. Roheda, H. Krim, Z.-Q. Luo, and T. Wu, “Decision Level Fusion: An Event Driven Approach,” in 2018 26th European Signal Processing Conference (EUSIPCO), Sep. 2018, pp. 2598–2602. doi: 10.23919/EUSIPCO.2018.8553412.

P. Szczuko, A. Harasimiuk, and A. Czyżewski, “Evaluation of Decision Fusion Methods for Multimodal Biometrics in the Banking Application,” Sensors, vol. 22, no. 6, p. 2356, Mar. 2022, doi: 10.3390/s22062356.

A. Benmachiche, B. Hadjar, I. Boutabia, A. A. Betouil, M. Maatallah, and A. Makhlouf, “Development of a biometric authentication platform using voice recognition,” in 2022 4th International Conference on Pattern Analysis and Intelligent Systems (PAIS), Oct. 2022, pp. 1–7. doi: 10.1109/PAIS56586.2022.9946890.

S. O. Boufaida, A. Benmachiche, M. Maatallah, and C. Chemam, “Hybrid Multi-Factor Authentication (MFA) Using Biometrics and Behavioral Analysis,” Feb. 23, 2026, Social Science Research Network, Rochester, NY: 6295298. doi: 10.2139/ssrn.6295298.

A. Benmachiche, A. Makhlouf, and T. Bouhadada, “Evolutionary learning of HMM with Gaussian mixture densities for Automatic speech recognition,” in Proceedings of the 9th International Conference on Information Systems and Technologies, in ICIST ’19. New York, NY, USA: Association for Computing Machinery, Mar. 2019, pp. 1–6. doi: 10.1145/3361570.3361591.

V. Sireesha and K. Sandhyarani, “Overview of Fusion Techniques in Multimodal Biometrics,” Int. J. Eng. Res., 2014.

L. Huang, S. Ruan, P. Decazes, and T. Denoeux, “Deep evidential fusion with uncertainty quantification and contextual discounting for multimodal medical image segmentation,” no. arXiv:2309.05919. arXiv, Aug. 19, 2024. doi: 10.48550/arXiv.2309.05919.

N. A. Othman, M. A. Abdel-Fattah, and A. T. Ali, “A Hybrid Deep Learning Framework with Decision-Level Fusion for Breast Cancer Survival Prediction,” Big Data Cogn. Comput., vol. 7, no. 1, p. 50, Mar. 2023, doi: 10.3390/bdcc7010050.

T. Zheng, S. Sone, Y. Ushiku, Y. Oba, and J. Ma, “TNF: Tri-branch Neural Fusion for Multimodal Medical Data Classification,” no. arXiv:2403.01802. arXiv, Mar. 10, 2024. doi: 10.48550/arXiv.2403.01802.

S. Guo, L. Wang, Q. Chen, L. Wang, J. Zhang, and Y. Zhu, “Multimodal MRI Image Decision Fusion-Based Network for Glioma Classification,” Front. Oncol., vol. 12, Feb. 2022, doi: 10.3389/fonc.2022.819673.

A. Karthik et al., “Ensemble-based multimodal medical imaging fusion for tumor segmentation,” Biomed. Signal Process. Control, vol. 96, p. 106550, Oct. 2024, doi: 10.1016/j.bspc.2024.106550.

S. Diao, Y. Wan, S. Huang, and H. Ma, “Research on Cancer Prediction and Identification based on Multimodal Medical Image Fusion,” in Proceedings of the 2024 3rd International Symposium on Robotics, Artificial Intelligence and Information Engineering, in RAIIE ’24. New York, NY, USA: Association for Computing Machinery, Sep. 2024, pp. 120–124. doi: 10.1145/3689299.3689321.

B. Oumaima, A. Benmachiche, M. Majda, M. Redjimi, and M. Derdour, Examining Intelligent Tutoring Systems and Their AI Underpinnings in the Design of the Future of Learning. 2025, p. 8. doi: 10.1109/ICNAS68168.2025.11298109.

Published
2026-05-19
How to Cite
[1]
M. Maatallah, A. Benmachiche, K. Rais, and S. Touam, “Intelligent Fusion of Multi-Modal Medical Imaging: A Comprehensive Review of Methods, Challenges, and Clinical Integration”, j.electron.electromedical.eng.med.inform, vol. 8, no. 3, pp. 897-936, May 2026.
Section
Medical Informatics